Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update Image Report to Score Images From Metadata Higher #8

Closed
shawnmjones opened this issue Jun 16, 2020 · 0 comments
Closed

Update Image Report to Score Images From Metadata Higher #8

shawnmjones opened this issue Jun 16, 2020 · 0 comments
Assignees

Comments

@shawnmjones
Copy link
Member

@shawnmjones shawnmjones commented Jun 16, 2020

The current scores produced by hc report image-data are not as effective as they could be. Humans may have already supplied their desired striking images in the metadata of the web pages making up the collection.

Hypercane's existing image scoring function in hypercane/report/imagedata.py:rank_images currently adds image properties to a list on lines 143 - 152

imageranking.append(
(
score,
pixelsize,
colorcount,
1 / ratio,
noverN,
image_urim
)
)

Add another column to the left containing values of 1 or 0. If Hypercane discovers the image in the metadata, set this column to 1 otherwise 0. This way, when the sorting occurs on line 154, all images discovered in the metadata will exist at the highest ranks in the list and then will be sorted by their MementoEmbed score.

@shawnmjones shawnmjones self-assigned this Jun 16, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

None yet
1 participant
You can’t perform that action at this time.