Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feature request: possibility to query about the progress #162

Closed
dportabella opened this issue Jan 21, 2018 · 2 comments

Comments

Projects
None yet
3 participants
@dportabella
Copy link
Contributor

commented Jan 21, 2018

related to #156
it would great to have the possibility to query about the progress, something such as:

case class WarcFileReport(file: File, started: Boolean, finished: Boolean, completed: Double)

def progressFn(warcFileReports: List[WarcFileReport]) {
  warcFileReports.foreach(println)
}

RecordLoader.loadArchives(file, sc, progressFnEvery = 1 hour, progressFn)
.foreach(archiveRecord => println(s"+++ ${archiveRecord.getUrl}"))

Is something like that feasible?

@ianmilligan1

This comment has been minimized.

Copy link
Member

commented Jan 21, 2018

Thanks @dportabella - we'll investigate.

@ruebot

This comment has been minimized.

Copy link
Member

commented Jul 16, 2019

I believe this be better resolved with just using the Spark web interface. That gives you some really nice detail, and recent versions of Spark have decent status indicators in the shell interface.

If it isn't, please feel free to re-open the issue.

Screenshot from 2019-07-16 19-47-05
Screenshot from 2019-07-16 19-46-01

@ruebot ruebot closed this Jul 16, 2019

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
You can’t perform that action at this time.