Tree: 99a4e8b283
-
ruebot committed
Mar 26, 2020 Verified
This commit was signed with a verified signature.ruebot Nick RuestGPG key ID: 417FAF1A0E1080CD Learn about signing commits
-
Verified
This commit was created on GitHub.com and signed with a verified signature using GitHub’s key.GPG key ID: 4AEE18F83AFDEB23 Learn about signing commits
-
Tweak hasDate to handle Seq. (#430)
Tweak hasDate to handle Seq. - Addresses #425 - Add test for hasDate
Verified
This commit was created on GitHub.com and signed with a verified signature using GitHub’s key.GPG key ID: 4AEE18F83AFDEB23 Learn about signing commits
-
Verified
This commit was signed with a verified signature.ruebot Nick RuestGPG key ID: 417FAF1A0E1080CD Learn about signing commits -
Merge branch 'master' of github.com:archivesunleashed/aut into issue-409
ruebot committedMar 19, 2020 Verified
This commit was signed with a verified signature.ruebot Nick RuestGPG key ID: 417FAF1A0E1080CD Learn about signing commits
-
Restyle keep/discard filter UDFs in the context of DataFrames (#429)
Co-authored-by: g285sing <g285sing@student.cs.uwaterloo.ca> (@SinghGursimran) - Resolves #425 - Replace all keep/discard DF udfs with `hasXYZ()` - Update tests
Verified
This commit was created on GitHub.com and signed with a verified signature using GitHub’s key.GPG key ID: 4AEE18F83AFDEB23 Learn about signing commits
-
Merge branch 'issue-409' of github.com:archivesunleashed/aut into iss…
ruebot committedFeb 20, 2020 …ue-409
Verified
This commit was signed with a verified signature.ruebot Nick RuestGPG key ID: 417FAF1A0E1080CD Learn about signing commits -
Verified
This commit was created on GitHub.com and signed with a verified signature using GitHub’s key.GPG key ID: 4AEE18F83AFDEB23 Learn about signing commits -
Update Spark and Hadoop versions. (#426)
- Update Spark to 2.4.5 - Update Hadoop to 2.7.4 (for RADOS/S3 support) - Tweak README
Verified
This commit was created on GitHub.com and signed with a verified signature using GitHub’s key.GPG key ID: 4AEE18F83AFDEB23 Learn about signing commits
-
Verified
This commit was created on GitHub.com and signed with a verified signature using GitHub’s key.GPG key ID: 4AEE18F83AFDEB23 Learn about signing commits
-
Add logic so UDFs that filter on url should also filter on src (#424).
- Resolves #418 - Update tests Co-authored-by: Nick Ruest <ruestn@gmail.com>
Verified
This commit was created on GitHub.com and signed with a verified signature using GitHub’s key.GPG key ID: 4AEE18F83AFDEB23 Learn about signing commits
-
[skip travis] Add pre-print link to README. (#423)
ruebot committedFeb 11, 2020 * [skip travis] Add pre-print link to README.
Verified
This commit was created on GitHub.com and signed with a verified signature using GitHub’s key.GPG key ID: 4AEE18F83AFDEB23 Learn about signing commits
-
Add img alt text to imagegraph(); resolves #420. (#422)
- Update ExtractImageLinksRDD to grab alt text - Add alt_text column to imagegraph - Update tests
Verified
This commit was created on GitHub.com and signed with a verified signature using GitHub’s key.GPG key ID: 4AEE18F83AFDEB23 Learn about signing commits -
Rename imageLinks to imagegraph; resolves #419 (#421)
* Rename imageLinks to imagegraph; resolves #419
Verified
This commit was created on GitHub.com and signed with a verified signature using GitHub’s key.GPG key ID: 4AEE18F83AFDEB23 Learn about signing commits -
ruebot committed
Feb 10, 2020 Verified
This commit was signed with a verified signature.ruebot Nick RuestGPG key ID: 417FAF1A0E1080CD Learn about signing commits
-
Need --repositories flag with --packages. (#417)
- Fully resolves this issue archivesunleashed/docker-aut#19 - archivesunleashed/docker-aut@37ce4e2 - archivesunleashed/docker-aut@082907a - archivesunleashed/docker-aut@baee431
Verified
This commit was created on GitHub.com and signed with a verified signature using GitHub’s key.GPG key ID: 4AEE18F83AFDEB23 Learn about signing commits -
Verified
This commit was created on GitHub.com and signed with a verified signature using GitHub’s key.GPG key ID: 4AEE18F83AFDEB23 Learn about signing commits -
Verified
This commit was signed with a verified signature.ruebot Nick RuestGPG key ID: 417FAF1A0E1080CD Learn about signing commits
-
[maven-release-plugin] prepare release aut-0.50.0
ruebot committedFeb 5, 2020 Verified
This commit was signed with a verified signature.ruebot Nick RuestGPG key ID: 417FAF1A0E1080CD Learn about signing commits
-
Start adding filters; keep_valid_pages.
- TODO, make it object oriented
Verified
This commit was signed with a verified signature.ruebot Nick RuestGPG key ID: 417FAF1A0E1080CD Learn about signing commits -
Verified
This commit was signed with a verified signature.ruebot Nick RuestGPG key ID: 417FAF1A0E1080CD Learn about signing commits
-
Verified
This commit was created on GitHub.com and signed with a verified signature using GitHub’s key.GPG key ID: 4AEE18F83AFDEB23 Learn about signing commits
-
Clean up test descriptions, addresses #372. (#416)
- Clean up test descriptions - Rename typo filename
-
Verified
This commit was created on GitHub.com and signed with a verified signature using GitHub’s key.GPG key ID: 4AEE18F83AFDEB23 Learn about signing commits -
Add ExtractImageDetailsDF. (#415)
- Add test - Addresses #223
-
- Remove order udfs alphabetically
- Get detect_language setup - ComputeSHA1 and MD5 need some work?
Verified
This commit was signed with a verified signature.ruebot Nick RuestGPG key ID: 417FAF1A0E1080CD Learn about signing commits -
Merge branch 'issue-409' of github.com:archivesunleashed/aut into iss…
ruebot committedJan 21, 2020 …ue-409
Verified
This commit was signed with a verified signature.ruebot Nick RuestGPG key ID: 417FAF1A0E1080CD Learn about signing commits -
Setup external lib packaging for Python!!
ruebot committedJan 21, 2020 Add remove_html udf Rename remove_http_header to remove_http_headers
Verified
This commit was signed with a verified signature.ruebot Nick RuestGPG key ID: 417FAF1A0E1080CD Learn about signing commits
-
g285sing committed
Jan 19, 2020 Loading status checks… -
g285sing committed
Jan 19, 2020 Loading status checks…
-
Verified
This commit was created on GitHub.com and signed with a verified signature using GitHub’s key.GPG key ID: 4AEE18F83AFDEB23 Learn about signing commits -
Add crawl_date to binary DataFrames and imageLinks. (#414)
- Resolves #413 - Update tests where necessary
-
Verified
This commit was signed with a verified signature.ruebot Nick RuestGPG key ID: 417FAF1A0E1080CD Learn about signing commits -
- Add remove_http_header, remove_prefix_www - Rename extract_domain_func to extract_domain - Formatting updates - Addresses #409
Verified
This commit was signed with a verified signature.ruebot Nick RuestGPG key ID: 417FAF1A0E1080CD Learn about signing commits
-
Various DataFrame implementation updates for documentation clean-up; …
…Addresses #372. - .all() column HttpStatus to http_status_code - Adds archive_filename to .all() - Significant README updates for setup - See also: archivesunleashed/aut-docs#39