Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Align NER output to WANE format #361

Draft
wants to merge 7 commits into
base: master
from

Conversation

@ruebot
Copy link
Member

commented Sep 18, 2019

GitHub issue(s): #297

What does this Pull Request do?

  • I'll
  • Add
  • This when I'm out of draft.

How should this be tested?

  • TravisCI
  • Something like:
import io.archivesunleashed._
import io.archivesunleashed.app._
import io.archivesunleashed.matchbox._

ExtractEntities.extractFromRecords("/home/nruest/Projects/au/aut-resources/NER/english.all.3class.distsim.crf.ser.gz", "/home/nruest/Projects/au/sample-data/geocites/1/GEOCITIES-20091027142649-00105-ia400111.us.archive.org.warc.gz", "/home/nruest/Projects/au/sample-data/issue-297/output-ner/", sc)

Should produce output like this:

Additional Notes:

  • Probably need to update documentation. I'll do a review before this gets merged.
ruebot added 7 commits Jul 22, 2019
@codecov

This comment has been minimized.

Copy link

commented Sep 18, 2019

Codecov Report

Merging #361 into master will increase coverage by 0.16%.
The diff coverage is 23.07%.

@@            Coverage Diff             @@
##           master     #361      +/-   ##
==========================================
+ Coverage   75.93%   76.09%   +0.16%     
==========================================
  Files          39       39              
  Lines        1392     1393       +1     
  Branches      267      268       +1     
==========================================
+ Hits         1057     1060       +3     
+ Misses        218      215       -3     
- Partials      117      118       +1
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
1 participant
You can’t perform that action at this time.