Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add google colab notebook for how to access data #57

Merged
merged 1 commit into from May 27, 2019

Conversation

Projects
None yet
3 participants
@yzhang0
Copy link
Contributor

commented May 23, 2019

No description provided.

@ruebot

ruebot approved these changes May 27, 2019

@ruebot ruebot requested a review from ianmilligan1 May 27, 2019

@ianmilligan1
Copy link
Member

left a comment

Works well.

For documentation, I hooked this up to our other notebooks by adding:

!git clone https://github.com/archivesunleashed/auk-notebooks.git
!pip install -r auk-notebooks/requirements.txt
!python -m nltk.downloader punkt vader_lexicon stopwords

and

from google.colab import auth
auth.authenticate_user()

and then customizing

bucket_name = 'auk_data'
file_path = '1234-fulltext.txt'

and then

!gsutil cp gs://{bucket_name}/{file_path} auk-notebooks/data/1234-fulltext.txt

This method worked reasonably well to me - after some copy edits and finessing on this, we might be able to put some Google Colab elements into the notebooks?

@ruebot ruebot merged commit 8862bec into archivesunleashed:master May 27, 2019

1 check passed

ci/dockercloud Your tests passed in Docker Cloud
Details
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
You can’t perform that action at this time.