Skip to content
Permalink
Tree: aaaa193945
Commits on May 21, 2020
  1. Update puma to version 3.12.6 (#398)

    depfu committed May 21, 2020
    Co-authored-by: depfu[bot] <23717796+depfu[bot]@users.noreply.github.com>
Commits on May 19, 2020
  1. Update all of rails to version 5.2.4.3 (#397)

    depfu committed May 19, 2020
    Co-authored-by: depfu[bot] <23717796+depfu[bot]@users.noreply.github.com>
Commits on May 12, 2020
  1. Update figaro to version 1.2.0 (#396)

    depfu committed May 12, 2020
    Co-authored-by: depfu[bot] <23717796+depfu[bot]@users.noreply.github.com>
Commits on May 9, 2020
  1. Update eslint to version 7.0.0 (#395)

    depfu committed May 9, 2020
    Co-authored-by: depfu[bot] <23717796+depfu[bot]@users.noreply.github.com>
  2. Update jquery-rails to version 4.4.0 (#394)

    depfu committed May 9, 2020
    Co-authored-by: depfu[bot] <23717796+depfu[bot]@users.noreply.github.com>
Commits on Apr 30, 2020
  1. Get date finished from domains file.

    ruebot committed Apr 30, 2020
  2. Get date finished from filtered_text directory.

    ruebot committed Apr 30, 2020
  3. Add crawl date frequency vizualization, and parallelize graphpass job.

    ruebot committed Apr 30, 2020
    - Add sub-job to textfilter job to create crawl frequency count data
    - Add helper method to prepare data for visualization
    - Update controller and view to show visualization
    - Add route for crawl date visualization data
    - Update Graphpass job to run two parallelized sub-jobs
      - Graphpass + combining part files
      - Removing output directories
Commits on Apr 28, 2020
  1. Start logging before the jobs are executed.

    ruebot committed Apr 28, 2020
  2. Make sure we only download arc/warc files, not wats or wanes.

    ruebot committed Apr 28, 2020
    - Update Spark job to run on directory, since wildcard will break things
    on directories with MANY files.
  3. Update Spark job to run auk jobs via spark-submit in parallel. (#393)

    ruebot committed Apr 28, 2020
    * Update Spark job to run auk jobs via spark-submit in parallel.
    
    - Update Rubocop config
    - Remove AUK Notebooks link
    - Update application config example
    - Tweak analyzed date helper to use filtered text to get date (last item
    that is ran in the pipeline)
    - Change name of "Full Text" derivative to "Web Page Text" since full
    text is misleading
    - Multiply data analyzed total by 3 since that's what we're doing in
    reality
Commits on Apr 24, 2020
  1. Update byebug to version 11.1.3 (#392)

    depfu committed Apr 24, 2020
    Co-authored-by: depfu[bot] <23717796+depfu[bot]@users.noreply.github.com>
Commits on Apr 18, 2020
  1. Update byebug to version 11.1.2 (#390)

    depfu committed Apr 18, 2020
    Co-authored-by: depfu[bot] <23717796+depfu[bot]@users.noreply.github.com>
Commits on Apr 17, 2020
  1. We should remove www prefixes on the domain count job.

    ruebot committed Apr 17, 2020
  2. We should remove www prefixes on the domain count job.

    ruebot committed Apr 17, 2020
Commits on Apr 16, 2020
  1. Another language tweak for the download tooltips.

    ruebot committed Apr 16, 2020
  2. tweak toolkit language

    ruebot committed Apr 16, 2020
  3. [ImgBot] Optimize images (#389)

    imgbot and ImgBotApp committed Apr 16, 2020
    /app/assets/images/Tutorial_domain_derivative_file.png -- 168.85kb -> 149.15kb (11.66%)
    
    Signed-off-by: ImgBotApp <ImgBotHelp@gmail.com>
    
    Co-authored-by: ImgBotApp <ImgBotHelp@gmail.com>
  4. Minor tweaks to domain page (follow up on #387) (#388)

    ianmilligan1 committed Apr 16, 2020
    * Minor tweaks to domain page (follow up on #387)
    * Suggest using "Import"; text -> CSV
    * Updating image
  5. Update jobs to use aut-0.60.0. (#387)

    ruebot committed Apr 16, 2020
    - Resolves #386
    - Move the faux txt derivatives to what they actually are; csv.
    - Update Spark job to use DataFrames
    - Update auk documentation and lessons with correct file extension
    (s/txt/csv)
    - Data migration needs to be completed on prod
      - rename full-text and full-domains
        - s/.txt/.csv/g
      - on all -fullurls.txt
        - remove the first and last character on each line. ( )
    - TravisCI should only test Ruby 2.6.5
    - Update tests to reflect changes
    - Rename text fixtures
Commits on Apr 14, 2020
  1. Tooooooooooooooooo many escapes.

    ruebot committed Apr 14, 2020
Commits on Apr 6, 2020
  1. Update loofah to version 2.5.0 (#385)

    depfu committed Apr 6, 2020
    Co-authored-by: depfu[bot] <23717796+depfu[bot]@users.noreply.github.com>
  2. Update aut version in use.

    ruebot committed Apr 6, 2020
Commits on Apr 1, 2020
  1. Update os to version 1.1.0 (#384)

    depfu committed Apr 1, 2020
    Co-authored-by: depfu[bot] <23717796+depfu[bot]@users.noreply.github.com>
Commits on Mar 30, 2020
  1. Update http to version 4.4.1 (#383)

    depfu committed Mar 30, 2020
    Co-authored-by: depfu[bot] <23717796+depfu[bot]@users.noreply.github.com>
Commits on Mar 26, 2020
  1. Update http to version 4.4.0 (#382)

    depfu committed Mar 26, 2020
    Co-authored-by: depfu[bot] <23717796+depfu[bot]@users.noreply.github.com>
Commits on Mar 19, 2020
  1. Update all of rails to version 5.2.4.2 (#381)

    depfu committed Mar 19, 2020
    Co-authored-by: depfu[bot] <23717796+depfu[bot]@users.noreply.github.com>
Commits on Mar 18, 2020
  1. Update sys-filesystem to version 1.3.4 (#380)

    depfu committed Mar 18, 2020
    Co-authored-by: depfu[bot] <23717796+depfu[bot]@users.noreply.github.com>
Commits on Mar 15, 2020
  1. Update rubyzip to version 2.3.0 (#379)

    depfu committed Mar 15, 2020
    Co-authored-by: depfu[bot] <23717796+depfu[bot]@users.noreply.github.com>
Commits on Feb 29, 2020
  1. Update puma to version 3.12.4 (#378)

    depfu committed Feb 29, 2020
Commits on Feb 26, 2020
  1. Update simplecov to version 0.18.5 (#377)

    depfu committed Feb 26, 2020
Commits on Feb 24, 2020
  1. Update simplecov to version 0.18.3 (#376)

    depfu committed Feb 24, 2020
Commits on Feb 21, 2020
  1. Remove tzinfo-data: tzinfo/tzinfo-data#12 (comment)

    ruebot committed Feb 21, 2020
Commits on Feb 20, 2020
  1. Update groupdate to version 5.0.0 (#375)

    depfu committed Feb 20, 2020
Older
You can’t perform that action at this time.