Join GitHub today
GitHub is home to over 40 million developers working together to host and review code, manage projects, and build software together.
Sign upResolve all Zika authors with 10 or more publications on https://tools.wmflabs.org/scholia/topic/Q202864/missing #1102
Comments
This comment has been minimized.
This comment has been minimized.
Top of the list is https://tools.wmflabs.org/author-disambiguator/?doit=Look+for+author&name=Michael%20McCarthy with 22 publications, and since Michael McCarthy is a very common name, I'm filtering it down to just items about Zika: https://tools.wmflabs.org/author-disambiguator/?name=Michael+McCarthy&doit=Look+for+author&filter=wdt%3AP921+wd%3AQ202864 . |
This comment has been minimized.
This comment has been minimized.
The list also contains 12 papers by Manon Vouga [Q28051248], for whom |
This comment has been minimized.
This comment has been minimized.
Doing all 37 author strings through comments in this thread may become unwieldy, so I may start to set up individual tickets for each of them. |
This comment has been minimized.
This comment has been minimized.
The batches https://tools.wmflabs.org/quickstatements/#/batch/6723 and https://tools.wmflabs.org/quickstatements/#/batch/6725 for Michael McCarthy and Manon Vouga have now finished. Their Scholia profiles:
At https://tools.wmflabs.org/scholia/authors/Q24532211,Q60332340,Q28051248,Q32642588 , there is a combined view of both of them, with David Safronetz (whose batch is still queued) and Van-Mai Cao-Lormeau for comparison. |
This comment has been minimized.
This comment has been minimized.
I am running an additional batch for "M Vouga" papers at https://tools.wmflabs.org/quickstatements/#/batch/6730 . |
This comment has been minimized.
This comment has been minimized.
One way to reduce the number of P2093 statements in the corpus is to run SourceMD over items that have P2093 statements, in the hope that the tool might find the affected papers in some ORCID records. I have done this multiple times in the past, but since both the corpus and the ORCID records are constantly evolving, now might be a good time to do it again. For simplicity,I will use this query that I am exploring for a Listeria list. |
This comment has been minimized.
This comment has been minimized.
That SourceMD batch is now up at https://tools.wmflabs.org/sourcemd/?action=batch&batch=5176 . |
This comment has been minimized.
This comment has been minimized.
That batch has finished and resulted in 92 edits but no effects on the remaining 33 strings with 10 or more Zika publications. |
Daniel-Mietchen
added
the
scholia
label
Jan 3, 2019
This comment has been minimized.
This comment has been minimized.
The query from the first comment in this thread now has 19 results. |
This comment has been minimized.
This comment has been minimized.
The query results are down to 10, and all of these currently have 10 papers. |
This comment has been minimized.
This comment has been minimized.
There are 5 author name strings left with more than 9 papers (all of them have 10). |
This comment has been minimized.
This comment has been minimized.
No author name strings left with 10 or more papers, but 22 with 9. |
This comment has been minimized.
This comment has been minimized.
By now, we have no author name strings left with 8 or more occurrences, and 55 with 7. |
This comment has been minimized.
This comment has been minimized.
I also checked https://www.wikidata.org/wiki/Wikidata:WikiProject_Zika_Corpus/Listeria/Missing_authors , which shows basically the same information (I have some batches running right now), so I think this ticket is ripe to be closed, knowing that curation will need to go on as new publications are being indexed. |
Daniel-Mietchen commentedJan 2, 2019
•
edited
Currently, the query
yields 37 results.
I tried to set up a Listeria list for this (with a threshold of 5) at https://www.wikidata.org/wiki/Wikidata:WikiProject_Zika_Corpus/Listeria/Missing_authors but did not get it to work properly.