Recent Publications

Building Community and Tools for Analyzing Web Archives through Datathons

PDF Project Slides

The Cost of a WARC: Analyzing Web Archives in the Cloud

PDF Project Slides

Recent & Upcoming Talks

More Talks

Lowering the Barrier to Access: The Archives Unleashed Cloud Project
Jun 19, 2019
Project Sustainability and Research Platforms: The Archives Unleashed Cloud Project
Jun 7, 2019
See a little Warclight: building an open-source web archive portal with project blacklight
Jun 6, 2019
Oh, I Get by with a little help from my friends: Interdisciplinary Web Archive Collaboration.
Feb 27, 2019
Make it WALK!
May 10, 2018
Hot Tips To Boost Your Interdisciplinary Web Archive Collaboration!
Apr 17, 2018
The World is a Beautiful and Terrible Place
Mar 22, 2018
Boosting Your Interdisciplinary Web Archive Collaboration
Feb 16, 2018
Twitter and Web Archive Analysis at Scale
Feb 14, 2018

Recent Posts

More Posts

Juxta A couple years ago I wrote about a method for creating a collage out of 1.2M images collected from the 2015 Canadian Federal …

I’ve been collecting tweets to @realDonaldTrump since June 2017. In my most recent time pulling together, and deduping the …

One feature of Blacklight that I’ve always wanted to setup in Warclight is displaying thumbnails in the results display. Getting …

At this past week’s Archives Unleashed dataton, I jokingly created some wordclouds of my Co-PI’s timelines. Finished my …

This is the text for my presention at the “National Forum on Ethics and Archiving the Web”. I had the honour of being on an …

Projects

Archives Unleashed Project

Archives Unleashed aims to make petabytes of historical internet content accessible to scholars and others interested in researching the recent past. Supported by a grant from the Andrew W. Mellon Foundation, we will be developing web archive search and data analysis tools to enable scholars and librarians to access, share, and investigate recent history since the early days of the World Wide Web.

Web Archives for Historical Research

Our research focuses on both web histories - writing about the recent past as reflected in web archives - as well as methodological approaches to understanding these repositories.

Islandora CLAW

Islandora CLAW is the next generation of Islandora.

Fedora Repository

Fedora is the flexible, modular, open source repository platform with native linked data support.

Contact