Skip to content
Permalink
Browse files

[skip travis] Add pre-print link to README. (#423)

* [skip travis] Add pre-print link to README.
  • Loading branch information
ruebot committed Feb 11, 2020
1 parent 8f1a9f1 commit 9474a0996e028b4859c2a40dd2a02a77e3b7a8ea
Showing with 2 additions and 1 deletion.
  1. +2 −1 README.md
@@ -9,9 +9,10 @@

The Archives Unleashed Toolkit is an open-source platform for analyzing web archives built on [Apache Spark](http://spark.apache.org/), which provides powerful tools for analytics and data processing. This toolkit is part of the [Archives Unleashed Project](http://archivesunleashed.org/).

The toolkit grew out of a previous project called [Warcbase](https://github.com/lintool/warcbase). The following article provides a nice overview, much of which is still relevant:
The following two articles give an overview of the project:

+ Jimmy Lin, Ian Milligan, Jeremy Wiebe, and Alice Zhou. [Warcbase: Scalable Analytics Infrastructure for Exploring Web Archives](https://dl.acm.org/authorize.cfm?key=N46731). _ACM Journal on Computing and Cultural Heritage_, 10(4), Article 22, 2017.
+ Nick Ruest, Jimmy Lin, Ian Milligan, Samantha Fritz. [The Archives Unleashed Project: Technology, Process, and Community to Improve Scholarly Access to Web Archives](https://arxiv.org/abs/2001.05399). 2020.

## Dependencies

0 comments on commit 9474a09

Please sign in to comment.
You can’t perform that action at this time.