Skip to content
Permalink
Tree: 4ce59c8594
Commits on Jul 8, 2019
  1. Test Java 8 & 11, and remove OracleJDK; resolves #324. (#325)

    ruebot authored and ianmilligan1 committed Jul 8, 2019
Commits on Jul 5, 2019
  1. Add image analysis and extraction w/TensorFlow (#318)

    h324yang authored and ruebot committed Jul 5, 2019
Commits on Apr 22, 2019
  1. Makes ArchiveRecordImpl serializable by removing non-serializable ARC…

    jrwiebe authored and ruebot committed Apr 22, 2019
    …Record and WARCRecord variables. Also removes unused headerResponseFormat variable. (#316)
Commits on Mar 23, 2019
  1. Resolve cobertura-maven-plugin class issue; resolves #313. (#314)

    ruebot authored and jrwiebe committed Mar 23, 2019
    - Exclude slf4j binding logback-classic
    (mojohaus/cobertura-maven-plugin#6 (comment))
Commits on Mar 18, 2019
Commits on Jan 31, 2019
  1. Log closing of ARC and WARC files, resolves #156 (#301)

    jrwiebe authored and ruebot committed Jan 31, 2019
    * Log opening and closing of archive files as per #156
    * Remove redundant log message. Spark already logs the file that is to be read when an executor computes an RDD.
Commits on Jan 24, 2019
  1. Delete saved image file; resolves #299 (#300)

    jrwiebe authored and ruebot committed Jan 24, 2019
Commits on Nov 28, 2018
  1. Remove Deprecated ExtractGraph app; resolves #291. (#293)

    greebie authored and ruebot committed Nov 28, 2018
    * Remove deprecated ExtractGraph.scala file.
    * Remove deprecated ExtractGraphTest.scala file.
  2. Add .getHttpStatus and .getArchiveFile to ArchiveRecordImpl class #198

    greebie authored and ruebot committed Nov 28, 2018
    …& #164 (#292)
    
    * Resolves #198
    * Resolves #164
    * Add getHttpStatus to ArchiveRecord class & trait
      - add .getHttpStatus to potential outputs
      - add tests for .getHttpStatus calls
      - improve ArchiveRecord testing overall.
    * Add .getArchiveFile feature to ArchiveRecordImpl.
      - add getArchiveFile to trait
      - add getArchiveFile for ArchiveRecordImpl
      - add tests for getArchiveFile.
    * Other code style fixes.
    * Include updates to tests.
Commits on Nov 22, 2018
  1. Update license headers for #208. (#290)

    ruebot authored and ianmilligan1 committed Nov 22, 2018
  2. Change Id generation for graphs from using hashes for urls to using .…

    greebie authored and ruebot committed Nov 22, 2018
    …zipWithUniqueIds() (#289)
    
    * Resolves #243 
    * Create GEXF with proper ids instead of hash to avoid collisions.
    * Add WriteGEXF files.
    * Add WriteGraph file and test.
    * Add test for Graphml output.
    * Add xml escaping for edges.
    * Add test case for non-escaped edges.
    * Add additional tests to cover for more potential cases of graphml and gexf files.
    * Coverage for null cases in urls.
Commits on Oct 19, 2018
  1. CVE-2018-11771 update (#288)

    ruebot authored and ianmilligan1 committed Oct 19, 2018
Commits on Oct 18, 2018
  1. CVE-2017-17485 update; follow-on to #281. (#287)

    ruebot authored and ianmilligan1 committed Oct 18, 2018
Commits on Oct 17, 2018
  1. Update Apache Tika - security vulnerabilities; resolves #131. (#285)

    ruebot authored and ianmilligan1 committed Oct 17, 2018
    - CVE-2018-1338
    - CVE-2018-11762
    - CVE-2018-11761
    - CVE-2016-6809
    - CVE-2018-1339
    - CVE-2018-11796
    - CVE-2016-4434
    - CVE-2018-1335
  2. Only trigger TravisCI on master. (#283)

    ruebot authored and ianmilligan1 committed Oct 17, 2018
  3. [skip travis] Update README (#284)

    ruebot authored and ianmilligan1 committed Oct 17, 2018
  4. Fix bug and unit test for ExtractDomain; resolves #277 (#278)

    borislin authored and ruebot committed Oct 17, 2018
  5. Replace backslash with forward slash in URL; resolves #269 (#276)

    borislin authored and ruebot committed Oct 17, 2018
    * Fix backslash in URL
    * Add backslash test in ExtractDomainTest
  6. Missed something for #208. (#282)

    ruebot authored and ianmilligan1 committed Oct 17, 2018
Commits on Oct 16, 2018
  1. CVE-2018-7489 fix. (#281)

    ruebot authored and ianmilligan1 committed Oct 16, 2018
  2. Update jackson-databind version; resolves #279. (#280)

    ruebot authored and ianmilligan1 committed Oct 16, 2018
Commits on Oct 9, 2018
  1. Clean-up pom.xml to remove plugin warnings; resolves #273. (#274)

    ruebot authored and ianmilligan1 committed Oct 9, 2018
Commits on Oct 4, 2018
  1. [maven-release-plugin] prepare for next development iteration

    ruebot committed Oct 4, 2018
  2. [maven-release-plugin] prepare release aut-0.17.0

    ruebot committed Oct 4, 2018
  3. Fix exception error when processing corrupted ARC files, and empty fi…

    borislin authored and ruebot committed Oct 4, 2018
    …les. (#272)
    
    * Fix exception when processing corrupted ARC files
    * Filter out non-empty archive files in loadArchives()
    * Fix archive files path pattern
    * Resolves #246
    * Resolves #271 
    * Resolves #258
Commits on Sep 10, 2018
  1. Update Bug report template. (#268)

    ruebot authored and ianmilligan1 committed Sep 10, 2018
Commits on Aug 11, 2018
  1. ExtractBoilerpipeText to remove headers as well. #253 (#256)

    greebie authored and ruebot committed Aug 11, 2018
    * ExtractBoilerpipeText now removes headers.
  2. Add additional tweet fields to TweetUtils; partially address #194. (#254

    ruebot authored and ianmilligan1 committed Aug 11, 2018
    )
    
    - Adds:
      - retweet_count
      - favorite_count
      - in_reply_to_status_id_str
      - in_reply_to_user_id_str
      - in_reply_to_screen_name
      - source
      - user.protected
      - user.profile_image_url
      - user.description
      - user.location
      - user.name
      - user.url
      - user.time_zone
    - Updates some doc comments
    - Updates tests
Commits on Aug 10, 2018
  1. Add support for full_text in tweets; resolve #192. (#252)

    ruebot authored and ianmilligan1 committed Aug 10, 2018
  2. Get rid of 'filesystem-root relative reference' warning. (#251)

    ruebot authored and ianmilligan1 committed Aug 10, 2018
Commits on Aug 9, 2018
  1. Remove stray characters from example commands. (#250)

    ruebot authored and ianmilligan1 committed Aug 9, 2018
  2. Deal with final scalastyle assessments, and Convert nulls to Option(T…

    greebie authored and ruebot committed Aug 9, 2018
    …). (#249)
    
    * Fully resolves #196 
    * Resolves #212
Commits on Aug 1, 2018
  1. Address main scalastyle errors - #196 (#248)

    greebie authored and ruebot committed Aug 1, 2018
    * Deal with wildcard import lint issues.
    * Fix some magic numbers & duplicate string runs.
    * Lint fixes, mostly explicit import warnings.
    * All other scalastyle issues require refactoring.
Commits on Jul 29, 2018
  1. Add ExtractGraphX including algorithms for PageRank and Components. I…

    greebie authored and ianmilligan1 committed Jul 29, 2018
    …ssue 203 (#245)
    
    * pom.xml change for GraphX
    * Changes for GraphXSLS
    * Changes for SLS graph
    * Changes for GraphX
    * Changes for converting WARC RDD to GraphX object
    * Rename extractor to ExtractGraphX
    * Various lint fixes (usually Magic Numbers)
    * Remove illegal imports from scala style (we use wildcard imports a lot)
    * Add WriteGraphXMLTest.
Commits on Jul 27, 2018
  1. Fix TravisCI build issues (#244)

    ruebot authored and ianmilligan1 committed Jul 27, 2018
    * Make the TravisCI build less verbose since we're hitting the 4MB log limit.
    * Pin site.plugin and project-info-reports.plugin so mvn site builds.
      - See:
        - https://stackoverflow.com/questions/51091539/maven-site-plugins-3-3-java-lang-classnotfoundexception-org-apache-maven-doxia
        - https://travis-ci.org/archivesunleashed/aut/jobs/408259462#L3201-L3202
Older
You can’t perform that action at this time.