Improvements for Dataset & DOI handling #1338

adam3smith · Jun 19, 2017

Note for SAE: this appears currently broken due to invalid HTML
@zuphilip if you have time to review, that'd be great

zuphilip · Jun 19, 2017

zuphilip reviewed Jun 19, 2017

View changes

zuphilip · Jun 19, 2017

RSC Publishing.js

+					item.extra += "\nDOI: " + DOI;
+				}
+			} else {
+				item.extra = "DOI: " + DOI;


This should not be needed because it should be already covered by the changes in EM translator.

it's needed because this is recognized by EM as a journal article so it doesn't move the DOI. We change that above.

We could also change the itemType in the translatorObject cf. https://github.com/zotero/translators/blob/master/Embedded%20Metadata.js#L815 and then (I assume) it is not necessary anymore. I.e.

translator.getTranslatorObject(function(trans) { trans.itemType = type; trans.doWeb(doc, url); });

zuphilip · Jun 19, 2017

Figshare.js

-					item.extra = "DOI:" + DOI;
+				else {
+					if (item.extra) {
+						if (item.extra.search(/^DOI:/) == -1) {


Maybe we could here even do

if (item.extra.search(/(^|\n)DOI:/) == -1) {

I haven't tested, but shouldn't ^DOI find the string at the beginning of any new line?

Hm... this could also be...

No, you were right; changing

zuphilip · Jun 19, 2017

Figshare.js

+						}
+					} else {
+						item.extra = "DOI: " + DOI;
+					}


Wouldn't it be better to do this in the called translator rather then the special one here?

turns out the whole passage isn't necessary -- the DOI is in the RIS and the RIS translator does handle this now.

zuphilip · Jun 19, 2017

RDF.js

+		else newItem.extra = "type: dataset";
+	}
+
+


Why did you do this change in the RDF translator? Is this the best place for this?

because the RDF translator handles all other dublin core code.

I haven't thought this through, but RDF is usually called by EM. Is there also another (common) use case for RDF translator?

Common no. General -- other RDF imports, including Zotero RDF, DC RDF, and Bibliontology RDF. Arguably those should all recognize the (valid) DC type dataset.

zuphilip · Jun 19, 2017

RDF.js

+		if (newItem.extra) {
+			newItem.extra += "\ntype: dataset";
+		}
+		else newItem.extra = "type: dataset";


Shouldn't this be newItem.extra = "itemType: dataset" as documented here https://www.zotero.org/support/dev/translators/datasets ? Otherwise we should update the documentation.

again, I need to test this, but I'm pretty sure type is correct here, as this is used by citeproc and hence relies on CSL json (which uses type, not itemType).

here I was right: testing shows that "type: dataset" works, itemType: dataset doesn't. Changed in the documentation

zuphilip · Jun 19, 2017

Zenodo.js

+
+			//something is odd with zenodo's author parsing; fix it
+			for (var i = 0; i< item.creators.length; i++) {
+				if (!item.creators[i].firstName && item.creators[i].lastName.indexOf(",")!=-1) {


The authors in CSL JSON look fine for me and also importing the CSL JSON works for me correctly. Can you give an example where this fix is needed?

https://www.zenodo.org/record/569304/export/csl#.WUgtNsaQy70
but it's also in one of the tests. I don't understand the problem -- have double-checked data entry for this one and we did that exactly as recommended.

Okay, I haven't seen this example. But I cannot find it in any test case...

(BTW I read the comment above also, that there is always something wrong with zenodo's author metadata, which is not the case.)

zuphilip · Jun 19, 2017

Zenodo.js

+var zoteroType = {
+	"figure": "artwork",
+	 "article": "report"
+}


I would suggest to move this to the place below where it is actually used. Otherwise I will have forgotten what this is when reading the code below...

zuphilip · Jun 19, 2017

Zenodo.js

+var zoteroType = {
+	"figure": "artwork",
+	 "article": "report"
+}


What about data set?

see above -- I'm leaving that as document on purpose (though feel free to disagree

zuphilip · Jun 19, 2017

Zenodo.js

+			if (abstract) item.abstractNote = abstract;
+			if (item.itemType == "document" && zoteroType[type]) {
+				item.itemType = zoteroType[type];
+			}


Maybe add a comment here, what these lines are doing and why.

zuphilip · Jun 19, 2017

Embedded Metadata.js

+	// Add DOI to non-supported item types
+	if (newItem.DOI && !ZU.fieldIsValidForType("DOI", newItem.itemType)) {
+		if (newItem.extra){
+			newItem.extra += "\nDOI: " + newItem.DOI;


Maybe also check that we only do this once, i.e.

if (item.extra.search(/(^|\n)DOI:/) == -1) {

as below.

Since there is no way for the Extra field to contain anything other than note (given the CSL import from Zotero) I just empty the field above and can do without all this code.

adam3smith · Jun 20, 2017

Thanks, updated with your comments.

zuphilip · Jun 20, 2017

zuphilip reviewed Jun 20, 2017

View changes

zuphilip · Jun 20, 2017

Zenodo.js

-		trans.itemType = type;
-		trans.doWeb(doc, url);
+
+			if (!item.DOI) {


if (!item.DOI && doi) {

zuphilip · Jun 20, 2017

This looks good. Just some suggestions/comments/replies.

Improvements for Dataset & DOI handling
Note for SAE: this appears currently broken due to invalid HTML

Loading status checks…

630c397

Address @zuphilip's comments

Loading status checks…

2d1cdd9

Last nits from @zuphilip

Loading status checks…

2b37520

adam3smith merged commit be399e3 into zotero:master Jun 22, 2017
1 check passed

1 check passed

continuous-integration/travis-ci/pr The Travis CI build passed
Details

zuphilip added a commit to zuphilip/translators that referenced this pull request Mar 28, 2018

Improvements for Dataset & DOI handling (zotero#1338)
Note for SAE: this appears currently broken due to invalid HTML

0e92596

zuphilip added a commit to zuphilip/translators that referenced this pull request Mar 28, 2018

Improvements for Dataset & DOI handling (zotero#1338)
Note for SAE: this appears currently broken due to invalid HTML

c2ebf14

zotero/translators

Join GitHub today

Improvements for Dataset & DOI handling #1338

Conversation

adam3smith commented Jun 19, 2017

zuphilip reviewed Jun 19, 2017 View changes

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

adam3smith commented Jun 20, 2017

zuphilip reviewed Jun 20, 2017 View changes

This comment has been minimized.

This comment has been minimized.

zuphilip commented Jun 20, 2017

Hide details View details adam3smith merged commit be399e3 into zotero:master Jun 22, 2017 1 check passed

1 check passed

zuphilip added a commit to zuphilip/translators that referenced this pull request Mar 28, 2018

zuphilip added a commit to zuphilip/translators that referenced this pull request Mar 28, 2018

zuphilip reviewed Jun 19, 2017

View changes

zuphilip reviewed Jun 20, 2017

View changes

adam3smith merged commit `be399e3` into zotero:master Jun 22, 2017
1 check passed