Fix recognition of articles and getting PDFs in Nature #1766

adam3smith · Oct 12, 2018

(this was reported on the Zotero forums, but can't find it (?)

adam3smith · Oct 12, 2018

adam3smith reviewed Oct 12, 2018

View changes

Nature Publishing Group.js

zuphilip · Oct 12, 2018

I guess that it fixes #1763 and maybe this is the report you remembered?

zuphilip · Oct 12, 2018

zuphilip reviewed Oct 12, 2018

View changes

Okay, looks fine. Some small comments and suggestions. The most fragile thing is IMO the order of the attribute values of data-track-action which we are depending on several places. However, we can hope that the order is fixed in their sites (at least I would assume that, as long as we don't hear anything else).

zuphilip · Oct 12, 2018

Nature Publishing Group.js

-		if (!item.attachments) {
+		var hasPDF = false;
+		for (let attach of item.attachments){
+			if (attach.title.includes("PDF")) {


Let us filter on the mimeType == "application/pdf" instead.

zuphilip · Oct 12, 2018

Nature Publishing Group.js

+			}
+		}
+		if (!hasPDF) {
+			item.attachments=[];


That line seems negligible, because its value is overwritten by the next line.

zuphilip · Oct 12, 2018

Nature Publishing Group.js

 	var m = url.match(/(^[^#?]+\/)(?:full|abs)(\/[^#?]+?\.)[a-zA-Z]+(?=$|\?|#)/);
 	if (m && m.length) return m[1] + 'pdf' + m[2] + 'pdf';
+	else if (attr(doc, 'a[data-track-action="download pdf"]', 'href')) {
+		return attr(doc, 'a[data-track-action="download pdf"]', 'href');


Is the check before here needed? attr should return null if the element is not found, which should be okay here.

zuphilip · Oct 12, 2018

Nature Publishing Group.js

@@ -354,7 +361,7 @@ function scrapeRIS(doc, url, next) {
 	if (!risURL) risURL = doc.evaluate('//li[@class="download-citation"]/a', doc, null, XPathResult.ANY_TYPE, null).iterateNext();
 	if (!risURL) risURL = doc.evaluate('//a[normalize-space(text())="Export citation" and not(@href="#")]', doc, null, XPathResult.ANY_TYPE, null).iterateNext();
 	if (!risURL) risURL = ZU.xpath(doc, '//ul[@data-component="article-info-list"]//a[@data-track-source="citation-download"]')[0];
-
+	if (!risURL) risURL = doc.querySelectorAll('a[data-track-action="download article citation"]')[0];


For the first element you can also use .querySelector cf. https://developer.mozilla.org/de/docs/Web/API/Document/querySelector

adam3smith · Oct 12, 2018

OK, all done

zuphilip · Oct 13, 2018

Thank you very much!

Fix recognition of articles and getting PDFs in Nature

Loading status checks…

d66bf1d

adam3smith requested a review from zuphilip Oct 12, 2018

Update after review

Loading status checks…

73ac959

zuphilip merged commit 4381a20 into zotero:master Oct 13, 2018
1 check passed

1 check passed

continuous-integration/travis-ci/pr The Travis CI build passed
Details

zuphilip referenced this pull request Oct 13, 2018
Closed
PDF download missing for Nature Scientific Data #1763

zotero/translators

Fix recognition of articles and getting PDFs in Nature #1766

Fix recognition of articles and getting PDFs in Nature #1766

adam3smith commented Oct 12, 2018

adam3smith requested a review from zuphilip Oct 12, 2018

adam3smith reviewed Oct 12, 2018

View changes

This comment has been minimized.

zuphilip commented Oct 12, 2018

zuphilip reviewed Oct 12, 2018

View changes

zuphilip left a comment

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

adam3smith commented Oct 12, 2018

zuphilip merged commit `4381a20` into zotero:master Oct 13, 2018
1 check passed

1 check passed

This comment has been minimized.

zuphilip commented Oct 13, 2018

zuphilip referenced this pull request Oct 13, 2018

PDF download missing for Nature Scientific Data #1763

zotero/translators

Join GitHub today

Fix recognition of articles and getting PDFs in Nature #1766

Conversation

adam3smith commented Oct 12, 2018

adam3smith requested a review from zuphilip Oct 12, 2018

adam3smith reviewed Oct 12, 2018 View changes

This comment has been minimized.

zuphilip commented Oct 12, 2018

zuphilip reviewed Oct 12, 2018 View changes

zuphilip left a comment

This comment has been minimized.

zuphilip Oct 12, 2018

This comment has been minimized.

zuphilip Oct 12, 2018

This comment has been minimized.

zuphilip Oct 12, 2018

This comment has been minimized.

zuphilip Oct 12, 2018

This comment has been minimized.

adam3smith commented Oct 12, 2018

Hide details View details zuphilip merged commit 4381a20 into zotero:master Oct 13, 2018 1 check passed

1 check passed

This comment has been minimized.

zuphilip commented Oct 13, 2018

zuphilip referenced this pull request Oct 13, 2018

PDF download missing for Nature Scientific Data #1763

adam3smith reviewed Oct 12, 2018

View changes

zuphilip reviewed Oct 12, 2018

View changes

zuphilip merged commit `4381a20` into zotero:master Oct 13, 2018
1 check passed