Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Spark 3.0.0 + Java 11 support. #375

Open
wants to merge 51 commits into
base: master
from
Open

Spark 3.0.0 + Java 11 support. #375

wants to merge 51 commits into from

Conversation

@ruebot
Copy link
Member

ruebot commented Nov 10, 2019

GitHub issue(s): #356

What does this Pull Request do?

Mostly pom.xml updates. Lots of cleanup, and updates. Big stuff, update to Apache Spark 3.0.0, and Java 11.

How should this be tested?

  • TravisCI
  • Exhaustive regression testing; basically hit everything in the documentation, and make sure it works.
  • Testing gist (updating it as I go through)

Additional Notes:

  • BIG KICKER
    • I cannot get this to work with --packages. We hit an ugly dependency wall really quick. If we exclude, all the tests fail really bad. If I explicitly include it as a dependency, same thing. Tests fail really bad.
  • I'm going to leave this as a draft, and we shouldn't merge until there is an official Spark 3.0.0 release, and we make a decision on --packages.
  • We'll squash this all down, and make a nice commit message when the time comes.
ruebot added 5 commits Aug 31, 2019
- Some hacks to get a sucessful build
- Definitely need to loop back and clean-up a whole lot!
- Addresses #356
…talled 🤦, and a bunch more pom cleanup.
@ruebot ruebot requested review from lintool and ianmilligan1 Nov 10, 2019
ruebot added 4 commits Nov 10, 2019
@codecov
Copy link

codecov bot commented Nov 10, 2019

Codecov Report

Merging #375 into master will increase coverage by 4.50%.
The diff coverage is n/a.

@@             Coverage Diff              @@
##             master     #375      +/-   ##
============================================
+ Coverage     83.69%   88.20%   +4.50%     
- Complexity        0       57      +57     
============================================
  Files            43       43              
  Lines          1245      958     -287     
  Branches        239       86     -153     
============================================
- Hits           1042      845     -197     
+ Misses           80       74       -6     
+ Partials        123       39      -84     
@lintool
Copy link
Member

lintool commented Nov 10, 2019

This is awesome! Did a quick check, everything looks sane to me.

ruebot added 3 commits Nov 10, 2019
@ianmilligan1
Copy link
Member

ianmilligan1 commented Nov 10, 2019

Thanks for this @ruebot! Have built it locally, but will take my time to exhaustively run through the docs before giving it the thumbs up.

ruebot added 2 commits Nov 10, 2019
@ruebot ruebot added this to In Progress in 1.0.0 Release of AUT Nov 14, 2019
ruebot added 4 commits Nov 18, 2019
ruebot added 4 commits Nov 28, 2019
ruebot added 22 commits Jan 13, 2020
…ue-356
…ue-356
@ruebot ruebot marked this pull request as ready for review Jun 17, 2020
@ruebot
Copy link
Member Author

ruebot commented Jun 17, 2020

These all work with this branch, and Spark 3.0.0 (Hadoop 2.7).

@ruebot
Copy link
Member Author

ruebot commented Jun 17, 2020

I didn't test this one, but it is covered in all the others for the most part.

@ruebot
Copy link
Member Author

ruebot commented Jun 17, 2020

I'll try and get some s3 smoke testing done later today or tomorrow.

@ruebot
Copy link
Member Author

ruebot commented Jun 17, 2020

spark-shell smoke test:
Screenshot from 2020-06-17 15-36-19

pyspark smoke test:
Screenshot from 2020-06-17 15-37-04

Copy link
Member

ianmilligan1 left a comment

Looks great! Tested in Spark shell

Screen Shot 2020-06-17 at 5 07 33 PM

and Python 3 notebook:

Screen Shot 2020-06-17 at 5 08 25 PM

I didn't test every single thing in the documentation, but did get broad representation across all the main commands and the most important functions that we run.

👏 👏 👏 @ruebot, this is a real achievement!

@SamFritz
Copy link
Member

SamFritz commented Jun 17, 2020

Congratulations @ruebot 👏 👏! So much work has gone into this, and so excited to see you push it past the finish line with this wonderful achievement.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
1.0.0 Release of AUT
  
In Progress
Linked issues

Successfully merging this pull request may close these issues.

None yet

4 participants
You can’t perform that action at this time.