Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Build a variant that helps disambiguate authors of a single publication #67

Open
Daniel-Mietchen opened this issue May 31, 2019 · 7 comments

Comments

@Daniel-Mietchen
Copy link
Contributor

commented May 31, 2019

Here, I am thinking of something like
https://tools.wmflabs.org/author-disambiguator/?fuzzy=0&limit=1&name=Li%20Li
but instead of the name parameter, it would be called via a work parameter that expects the QID of a given work (optionally via DOI/ PMID/ PMCID/ arXiv ID etc., e.g. by using the Wikidata Hub).

This would address the same problem as #66 but should be more straightforward to implement.

@arthurpsmith

This comment has been minimized.

Copy link
Owner

commented May 31, 2019

The idea being, you start from that publication and the links from the author name strings would go to name pages for those names? I'll have to think about it, but in principle I think it makes sense... I'm afraid I've been a bit busy with other priorities but I do plan to get back to updating this tool in the next week or two...

@Daniel-Mietchen

This comment has been minimized.

Copy link
Contributor Author

commented Jun 13, 2019

Yes, the key idea is to have a page for one work which ideally lists all author name strings for that work and links them to the corresponding name pages.

This would look very similar to the example given above, with the main difference being that right now, the limit=1 part does not give the user control over which work to display (for me, it's https://www.wikidata.org/wiki/Q21131061 right now), whereas the new mechanism would allow to specify something like &work=Q21131061 or similar.

@arthurpsmith arthurpsmith self-assigned this Aug 8, 2019

@arthurpsmith

This comment has been minimized.

Copy link
Owner

commented Aug 8, 2019

Ok, there's a new "work_item.php" page - for example here:
https://tools.wmflabs.org/author-disambiguator/work_item.php?id=Q27335792&doit=Get+author+links+for+work

I should probably link it from other pages. Any suggestions on what else should be done with it?

@Daniel-Mietchen

This comment has been minimized.

Copy link
Contributor Author

commented Aug 9, 2019

I had a first go at it from your example, which led to
https://tools.wmflabs.org/quickstatements/#/batch/16968
and thus
https://tools.wmflabs.org/scholia/author/Q66317808 .

Yes, cross-linking would be useful.

Will test a bit more later.

@Daniel-Mietchen

This comment has been minimized.

Copy link
Contributor Author

commented Aug 9, 2019

Things to add to work_item.php:

  • published in (with link to associated Scholia missing page)
  • main subject (with link to associated Scholia missing page)
  • perhaps links to author_item.php for citing/ cited papers

Links to work_item.php from pages like

@arthurpsmith

This comment has been minimized.

Copy link
Owner

commented Aug 12, 2019

Ok, "published in" and "main subject" links have been added, with the Scholia links as suggested. Also links from index.php and author_item etc. to the "work_item" page via a [work] link after the title - do you think that's sufficient? Maybe it should be highlighted somehow?

I'm not sure what you mean by links to author_item for citing/cited papers - you mean if we have identified authors on any citing or cited papers to add a list to the work item page? I haven't looked at traversing the citation graph at all here yet...

@Daniel-Mietchen

This comment has been minimized.

Copy link
Contributor Author

commented Aug 12, 2019

I just tried to move the [work] link to the author column, as per #72 .

Some highlighting seems useful — perhaps rethink the color scheme altogether, e.g. why is the journal linked in black, the topic in red?

Regarding the citation graph, what I had in mind here is that people often cite their own work, so if we already have a work with a P50 statement that cites our target work and the target work has a P2093 string that is in the label or aliases of that person's item, then that could be suggested on the page as a potential match. Such citation graph walking is very helpful in cases like your example where the author name strings are provided just with initials.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
2 participants
You can’t perform that action at this time.