New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create Datathon Cheat Sheet #68

Closed
ianmilligan1 opened this Issue Nov 8, 2018 · 2 comments

Comments

Projects
None yet
3 participants
@ianmilligan1
Member

ianmilligan1 commented Nov 8, 2018

Based on some feedback.

For the next datathon, let's create a cheat sheet of common things. In the past, we've had this in Slack but would be better placed in the website too.

some example topics:

  • how to find your terminal;
  • how to ssh into the server, chmod, etc.
  • how to launch AUT;
  • standard dataset techniques & examples;
  • how to copy files to/from a virtual machine.

@ianmilligan1 ianmilligan1 self-assigned this Nov 8, 2018

@greebie

This comment has been minimized.

Collaborator

greebie commented Nov 8, 2018

A few others to consider:

  • access ssh via Putty
  • common Spark commands (eg. map, filter, fold etc.)
  • a quick explanation of the spark launching commands master(local[*]) etc.
@ianmilligan1

This comment has been minimized.

Member

ianmilligan1 commented Jan 3, 2019

Decided to tackle this today. I'll stage as a PR for review.

I don't know how to use Putty nor am I going to figure it out without a Windows box (I need to get one at some point, tbh).I don't think common Spark commands belong on this cheat sheet – I put some in from your earlier reference guide and felt it got a bit too technical for the audience we're aiming this particular page at. That said, maybe we could revive your reference guide as a separate page at some point @greebie.

@SamFritz SamFritz closed this in aa99390 Jan 3, 2019

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment