Handle old pages of BBC #1371

sonali0901 · Jul 16, 2017

Fixes #1364
@mvolz @owcz I need some ideas for the title of older pages. The title provided by metadata is not accurate and if I extract it through ZU.xpathText(doc, '//meta[@name="Headline"]/@content') then it would affect results of other pages. Any workaround? Refer to last test case to see the issue.

AFAIS you can distinguish these two cases quite easily by analyzing the url. Thus, I suggest to use some conditional code, i.e. something like

if (url.substr(-4)==".stm") {
   //only for old pages of BBC
   item.title = ZU.xpathText(doc, '//meta[@name="Headline"]/@content');
   item.section = ZU.xpathText(doc, '//meta[@name="Section"]/@content');
}

zuphilip · Jul 16, 2017

AFAIS you can distinguish these two cases quite easily by analyzing the url. Thus, I suggest to use some conditional code, i.e. something like

if (url.substr(-4)==".stm") {
   //only for old pages of BBC
   item.title = ZU.xpathText(doc, '//meta[@name="Headline"]/@content');
   item.section = ZU.xpathText(doc, '//meta[@name="Section"]/@content');
}

What @zuphilip says, though looking at both this and the detectWeb, I think we don't want to require the .stm is at the end of the URL. E.g. some link shorteners add something like ?utm_campaing=mycampaignname to the end of URLs and there's really no reason we should have detectWeb (and then this) break in those cases. I think the safest would be to (again both here and in detect) clean the URL by doing url.replace(/[\?#].+/, "")

adam3smith · Jul 16, 2017

What @zuphilip says, though looking at both this and the detectWeb, I think we don't want to require the .stm is at the end of the URL. E.g. some link shorteners add something like ?utm_campaing=mycampaignname to the end of URLs and there's really no reason we should have detectWeb (and then this) break in those cases. I think the safest would be to (again both here and in detect) clean the URL by doing url.replace(/[\?#].+/, "")

Handle old pages of BBC
Fixes #1364

c2de5fd

Fix title for old BBC pages

42da596

sonali0901 changed the title from WIP : Handle old pages of BBC to Handle old pages of BBC Jul 18, 2017

adam3smith merged commit 701f8c5 into zotero:master Jul 22, 2017
1 check passed

1 check passed

continuous-integration/travis-ci/pr The Travis CI build passed
Details

sonali0901 deleted the sonali0901:BBC branch Jul 23, 2017

zuphilip added a commit to zuphilip/translators that referenced this pull request Mar 28, 2018

Handle old pages of BBC (#1371)
Fixes #1364

0bf282c

zuphilip added a commit to zuphilip/translators that referenced this pull request Mar 28, 2018

Handle old pages of BBC (#1371)
Fixes #1364

295530e

zotero/translators

Handle old pages of BBC #1371

sonali0901 commented Jul 16, 2017

This comment has been minimized.

zuphilip commented Jul 16, 2017

This comment has been minimized.

adam3smith commented Jul 16, 2017 •

edited

Edited 1 time

adam3smith edited Jul 16, 2017 (most recent)

sonali0901 changed the title from WIP : Handle old pages of BBC to Handle old pages of BBC Jul 18, 2017

adam3smith merged commit `701f8c5` into zotero:master Jul 22, 2017
1 check passed

1 check passed

sonali0901 deleted the sonali0901:BBC branch Jul 23, 2017

zuphilip added a commit to zuphilip/translators that referenced this pull request Mar 28, 2018

zuphilip added a commit to zuphilip/translators that referenced this pull request Mar 28, 2018

zotero/translators

Join GitHub today

Handle old pages of BBC #1371

Conversation

sonali0901 commented Jul 16, 2017

This comment has been minimized.

zuphilip Jul 16, 2017

zuphilip commented Jul 16, 2017

This comment has been minimized.

adam3smith Jul 16, 2017

adam3smith commented Jul 16, 2017 • edited Edited 1 time adam3smith edited Jul 16, 2017 (most recent)

sonali0901 changed the title from WIP : Handle old pages of BBC to Handle old pages of BBC Jul 18, 2017

Hide details View details adam3smith merged commit 701f8c5 into zotero:master Jul 22, 2017 1 check passed

1 check passed

sonali0901 deleted the sonali0901:BBC branch Jul 23, 2017

zuphilip added a commit to zuphilip/translators that referenced this pull request Mar 28, 2018

zuphilip added a commit to zuphilip/translators that referenced this pull request Mar 28, 2018

adam3smith commented Jul 16, 2017 •

edited

Edited 1 time

adam3smith edited Jul 16, 2017 (most recent)

adam3smith merged commit `701f8c5` into zotero:master Jul 22, 2017
1 check passed