Join GitHub today
GitHub is home to over 40 million developers working together to host and review code, manage projects, and build software together.
Sign upAdding "Getting Started" page to the website, resolves #120 #121
Conversation
- An overview; selected screenshots for each tool.
Overall good to go. Just some language tweaks, and good on my end. |
|
||
| Tool | Skill | What it Does | What You Need | Ideal For | | ||
|:--------------------------:|--------------|--------------|---------------|-----------| | ||
| ![logo](/images/cloud-logo.png) | Beginner | The **[Archives Unleashed Cloud](/cloud)** is a web-based GUI front end for working with [**Archive-It**](https://archive-it.org) collections. Drawing on your Archive-It credentials, you can sync your collections, run basic analyses, and generate a standardized set of research derivatives: full text, network diagrams, and basic statistics on your collection. | This does not require technical skills. However, you need an **Archive-It** account. You can get this if you are an Archive-It subscriber, **or** if you connect with a librarian responsible for a collection they can generate you a guest account. | Librarians, and researchers who know a librarian with an Archive-It account! | |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
ruebot
May 3, 2019
Member
"standardized set of research derivatives: full text, network diagrams, and basic statistics on your collection" -- link that to derivatives documentation?
| Tool | Skill | What it Does | What You Need | Ideal For | | ||
|:--------------------------:|--------------|--------------|---------------|-----------| | ||
| ![logo](/images/cloud-logo.png) | Beginner | The **[Archives Unleashed Cloud](/cloud)** is a web-based GUI front end for working with [**Archive-It**](https://archive-it.org) collections. Drawing on your Archive-It credentials, you can sync your collections, run basic analyses, and generate a standardized set of research derivatives: full text, network diagrams, and basic statistics on your collection. | This does not require technical skills. However, you need an **Archive-It** account. You can get this if you are an Archive-It subscriber, **or** if you connect with a librarian responsible for a collection they can generate you a guest account. | Librarians, and researchers who know a librarian with an Archive-It account! | | ||
| ![logo](/images/notebook-logo.png) | Beginner/Intermediate | The **[Archives Unleashed Notebooks](/notebooks)** are web-based Jupyter Notebooks that can help you work with the output of the Archives Unleashed Cloud. While they are a bit difficult to install, once you have them up and running you can use your web browser to work through interactive tutorials! Explore your data through rich visualizations! | You need to install the "dependencies" for the notebooks. While you can follow instructions, it does require running commands in your "command line." This requires an intermediate level of technical knowledge. We [recommend this tutorial](https://programminghistorian.org/en/lessons/intro-to-bash). | Researchers who want to explore their web archival collections. | |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
I guess with such positive feedback I will move out of review. We can always add the video later? |
|:--------------------------:|--------------|--------------|---------------|-----------| | ||
| ![logo](/images/cloud-logo.png) | Beginner | The **[Archives Unleashed Cloud](/cloud)** is a web-based platform for working with [**Archive-It**](https://archive-it.org) collections. Drawing on your Archive-It credentials, you can sync your collections, run basic analyses, and generate a [standardized set of research derivatives: full text, network diagrams, and basic statistics on your collection](https://cloud.archivesunleashed.org/derivatives). | This does not require technical skills. However, you need an **Archive-It** account. You can get this if you are an Archive-It subscriber, **or** if you connect with a librarian responsible for a collection they can generate you a guest account. | Librarians, and researchers who know a librarian with an Archive-It account! | | ||
| ![logo](/images/notebook-logo.png) | Beginner/Intermediate | The **[Archives Unleashed Notebooks](/notebooks)** are Jupyter Notebooks that can help you work with the output of the Archives Unleashed Cloud. Once you have them up and running you can use your web browser to work through interactive tutorials! Explore your data through rich visualizations! | You need to install the "dependencies" for the notebooks. While you can follow instructions, it does require running commands in your "command line." This requires an intermediate level of technical knowledge. We [recommend this tutorial](https://programminghistorian.org/en/lessons/intro-to-bash). | Researchers who want to explore their web archival collections. | | ||
| ![logo](/images/warclight-logo.png) | Advanced | **[Warclight](/warclight)** is a search engine that lets users discover web archives. Think of it like the library catalogue meeting the WARC file! While it is easy to use, setting it up on your own collections requires an advanced level of knowledge. | You need a lot of WARCs that would benefit from this search engine. If you don't know what WARCs are, this is not the tool for you! | Librarians and archivists who have been collecting web archives and who want to enhance their discoverability. | |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
SamFritz
May 8, 2019
Member
Just wondering if it would be useful to help ball park "a lot of WARCs"?
This comment has been minimized.
This comment has been minimized.
SamFritz
May 8, 2019
Member
Suggestion:
Librarians and archivists who have been collecting web archives and who want to enhance their discoverability. -->
Librarians and archivists who have been collecting web archives and want to enhance collection discoverability. -->
This comment has been minimized.
This comment has been minimized.
ianmilligan1
May 8, 2019
Author
Member
Great, thanks @SamFritz! After reading this I think "a lot of WARCs" is misleading.. it really is just having WARCs, whether it's one or many.
| ![logo](/images/cloud-logo.png) | Beginner | The **[Archives Unleashed Cloud](/cloud)** is a web-based platform for working with [**Archive-It**](https://archive-it.org) collections. Drawing on your Archive-It credentials, you can sync your collections, run basic analyses, and generate a [standardized set of research derivatives: full text, network diagrams, and basic statistics on your collection](https://cloud.archivesunleashed.org/derivatives). | This does not require technical skills. However, you need an **Archive-It** account. You can get this if you are an Archive-It subscriber, **or** if you connect with a librarian responsible for a collection they can generate you a guest account. | Librarians, and researchers who know a librarian with an Archive-It account! | | ||
| ![logo](/images/notebook-logo.png) | Beginner/Intermediate | The **[Archives Unleashed Notebooks](/notebooks)** are Jupyter Notebooks that can help you work with the output of the Archives Unleashed Cloud. Once you have them up and running you can use your web browser to work through interactive tutorials! Explore your data through rich visualizations! | You need to install the "dependencies" for the notebooks. While you can follow instructions, it does require running commands in your "command line." This requires an intermediate level of technical knowledge. We [recommend this tutorial](https://programminghistorian.org/en/lessons/intro-to-bash). | Researchers who want to explore their web archival collections. | | ||
| ![logo](/images/warclight-logo.png) | Advanced | **[Warclight](/warclight)** is a search engine that lets users discover web archives. Think of it like the library catalogue meeting the WARC file! While it is easy to use, setting it up on your own collections requires an advanced level of knowledge. | You need a lot of WARCs that would benefit from this search engine. If you don't know what WARCs are, this is not the tool for you! | Librarians and archivists who have been collecting web archives and who want to enhance their discoverability. | | ||
| ![logo](/images/toolkit-logo.png) | Advanced | The **[Archives Unleashed Toolkit](/toolkit)** is an Apache Spark-based platform for analyzing web archives at scale. When you use the Archives Unleashed Cloud, you are using the Toolkit in the back end! As you can see from the documentation page, the Toolkit is very powerful. However, it is an advanced tool that requires a high-level of technical knowledge to use --- or at least, patience and effort. We do have a **hands-on walkthrough** [here](/aut/lesson) | You would need a lot of WARCs that would benefit from computationally exploring them at scale. | Researchers who want to explore their WARCs at scale, and who need more flexibility than the Cloud provides. | |
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
SamFritz
May 8, 2019
Member
suggestion:
Researchers who want to explore their WARCs at scale, and who need more flexibility than the Cloud provides. -->
Researchers who want to explore their WARCs at scale and need more flexibility than the Cloud provides.
| Tool | Skill | What it Does | What You Need | Ideal For | | ||
|:--------------------------:|--------------|--------------|---------------|-----------| | ||
| ![logo](/images/cloud-logo.png) | Beginner | The **[Archives Unleashed Cloud](/cloud)** is a web-based platform for working with [**Archive-It**](https://archive-it.org) collections. Drawing on your Archive-It credentials, you can sync your collections, run basic analyses, and generate a [standardized set of research derivatives: full text, network diagrams, and basic statistics on your collection](https://cloud.archivesunleashed.org/derivatives). | This does not require technical skills. However, you need an **Archive-It** account. You can get this if you are an Archive-It subscriber, **or** if you connect with a librarian responsible for a collection they can generate you a guest account. | Librarians, and researchers who know a librarian with an Archive-It account! | | ||
| ![logo](/images/notebook-logo.png) | Beginner/Intermediate | The **[Archives Unleashed Notebooks](/notebooks)** are Jupyter Notebooks that can help you work with the output of the Archives Unleashed Cloud. Once you have them up and running you can use your web browser to work through interactive tutorials! Explore your data through rich visualizations! | You need to install the "dependencies" for the notebooks. While you can follow instructions, it does require running commands in your "command line." This requires an intermediate level of technical knowledge. We [recommend this tutorial](https://programminghistorian.org/en/lessons/intro-to-bash). | Researchers who want to explore their web archival collections. | |
This comment has been minimized.
This comment has been minimized.
SamFritz
May 8, 2019
Member
suggestion:
"This requires an intermediate....." --> An intermediate level of technical knowledge is recommended. We suggest this tutorial
| ![logo](/images/warclight-logo.png) | Advanced | **[Warclight](/warclight)** is a search engine that lets users discover web archives. Think of it like the library catalogue meeting the WARC file! While it is easy to use, setting it up on your own collections requires an advanced level of knowledge. | You need a lot of WARCs that would benefit from this search engine. If you don't know what WARCs are, this is not the tool for you! | Librarians and archivists who have been collecting web archives and who want to enhance their discoverability. | | ||
| ![logo](/images/toolkit-logo.png) | Advanced | The **[Archives Unleashed Toolkit](/toolkit)** is an Apache Spark-based platform for analyzing web archives at scale. When you use the Archives Unleashed Cloud, you are using the Toolkit in the back end! As you can see from the documentation page, the Toolkit is very powerful. However, it is an advanced tool that requires a high-level of technical knowledge to use --- or at least, patience and effort. We do have a **hands-on walkthrough** [here](/aut/lesson) | You would need a lot of WARCs that would benefit from computationally exploring them at scale. | Researchers who want to explore their WARCs at scale, and who need more flexibility than the Cloud provides. | | ||
|
||
## Tools in Action: The Cloud |
This comment has been minimized.
This comment has been minimized.
Getting started page looks great @ianmilligan1. Just added in a few suggestions for wording, please feel free to use any that you think are beneficial :) |
ianmilligan1 commentedMay 3, 2019
As noted in #120, our website is a bit tough to get into. So here's a "getting started" page that basically provides a basic introduction to what our project is about. Right now, in this version, it adds:
We probably want to workshop specific language, so we can do so on this draft PR.
Example Screenshots of the Page