New Emerald Insight Translator #2037

adam3smith · 2019-10-24T02:49:41Z

Closes #2036

@dstillman do we still need to include attr and text code in translators? If not, can we add them to linting as defined?


        New Emerald Insight Translator

Closes #2036

dstillman · 2019-10-24T02:50:58Z

do we still need to include attr and text code in translators? If not, can we add them to linting as defined?

We do, unfortunately.

zuphilip

This looks great! I have just some comments to further improve this as well as some questions for you.

zuphilip · 2019-10-26T09:49:00Z

Emerald Insight.js

+		if (!rows.length) {
+			// book
+			rows = doc.querySelectorAll('li.intent_book_chapter>a');
+		}


You can also combine two CSS paths with a comma, i.e.

rows = doc.querySelectorAll('.intent_issue_item h4>a, li.intent_book_chapter>a');

Or will the other path in the wrong case give unwanted results?

zuphilip · 2019-10-26T09:49:00Z

Emerald Insight.js

+
+
+function scrape(doc, url) {
+	var DOI = url.match(/\/(10\.[^#?/]+\/[^#?/]+)\//)[1];


It is (theoretically) possible that this does not match. Is it better to fail in such cases or should we prevent them from the following lines by checking that the url matches already in the detectWeb?

improved check in detectWeb and added one more fallback here. Also added the scrape function to work on PDF pages.

zuphilip · 2019-10-26T09:49:00Z

Emerald Insight.js

+		// they number authors in their RIS...
+		text = text.replace(/A\d+\s+-/g, "AU  -");
+		// add a comma after the last name
+		text = text.replace(/AU\s\s-\s(\w+)/g, "AU  - $1,");


I think we should test here first that no comma is present at the moment (because they might fix that at some point). Moreover, can you extend the comment for this a little more?

It seems to me that they are always treating the last word of the author string as the last name and everything else as the first name(s), which they then put into RIS data as "lastname firstname(s)" without a comma. The splitting does not need to be correct, e.g. https://www.emerald.com/insight/content/doi/10.1108/S1572-832320170000026008/full/html , but it is the only thing we can get from their metadata.

Done and done.

zuphilip · 2019-10-26T09:49:00Z

Emerald Insight.js

+						"mimeType": "application/pdf"
+					}
+				],
+				"tags": [],


There are keywords which could be saved as tags, i.e. look for doc.querySelectorAll('li .intent_text');.

Done. Won't use when scraping from PDF view which I think is acceptable.

zuphilip · 2019-10-26T09:49:00Z

Emerald Insight.js

+						"creatorType": "author"
+					}
+				],
+				"date": "January 1, 2017",


Should we make this language independent with ZU.strToISO? (Maybe even in the RIS translator itself?)

I'll impement this here -- let's think through some RIS scenarios together. Generally sounds like we should.

zuphilip · 2019-10-26T09:49:00Z

Emerald Insight.js

+				],
+				"date": "January 1, 2015",
+				"ISBN": "9781784415877 9781784415884",
+				"abstractNote": "AbstractOriginality/value\nThis technique creates opportunities for students to have unique assignments encouraging student to student teaching and can be applied to assignments in any accounting course (undergraduate and graduate). This testing method has been used in Intermediate I and II, Individual Taxation, and Corporate Taxation.",


Maybe it is cleaner to take here

ZU.cleanInternal(doc.getElementById('abstract').textContent);

for the abstract. See also https://www.emerald.com/insight/content/doi/10.1108/07419051111154758/full/html for more subcontent under abstract.

Done (won't use this when scraping from PDF, which I deemed acceptable)

Emerald Insight.js


        Fixes based on review

adam3smith · 2019-10-28T02:23:36Z

Thanks -- take another look please if you have a moment.

zuphilip · 2019-10-28T07:08:44Z

That all looks good, but can you update the test cases as well?

New Emerald Insight Translator

Loading status checks…

9636d4a

Closes #2036

adam3smith requested a review from zuphilip Oct 24, 2019

zuphilip reviewed Oct 26, 2019

View changes

Fixes based on review

Loading status checks…

9742866

Please note that GitHub no longer supports your web browser.

zotero/translators

New Emerald Insight Translator #2037

New Emerald Insight Translator #2037

adam3smith commented Oct 24, 2019

This comment has been minimized.

dstillman commented Oct 24, 2019

zuphilip left a comment

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

adam3smith commented Oct 28, 2019

This comment has been minimized.

zuphilip commented Oct 28, 2019

Please note that GitHub no longer supports your web browser.

zotero/translators

Join GitHub today

New Emerald Insight Translator #2037

Conversation

adam3smith commented Oct 24, 2019

This comment has been minimized.

dstillman commented Oct 24, 2019

zuphilip left a comment

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

adam3smith commented Oct 28, 2019

This comment has been minimized.

zuphilip commented Oct 28, 2019