New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Discussion: Do we want to implement LGA as a derivative in AUK? #245

Closed
ruebot opened this Issue Jan 10, 2019 · 3 comments

Comments

Projects
None yet
5 participants
@ruebot
Member

ruebot commented Jan 10, 2019

Here is technical overview. Basically, it it is two files; id-map, and id-graph. With these it you build things outlined here on LGA use cases and examples.

Big questions to ask here I believe are:

  • Do our Gephi and Raw Network derivatives cover the same use cases for the LGA? If so, I don't think it is worth implementing.
  • If our Gephi and Raw Network derivatives do not cover the use cases for LGA, is this something we can currently produce out of the box with AUT?
@ianmilligan1

This comment has been minimized.

Member

ianmilligan1 commented Jan 10, 2019

I would ask @greebie to take a closer look, as he's an expert in network analysis, but my sense of this after reviewing the documentation and having worked in the digital humanities side of the house:

  • I think both the Gephi and GraphML derivatives cover the same use cases - you have granular timestamp, as you want, and then it indicates the origin and destination links. The LGA might be slightly more space efficient, but it's also not as readable for spot checking or quickly reviewing data.
  • Part of the project goal is taking WARCs and making them accessible... GEXF and GraphML are standardized file formats that are usable by a wide variety of network projects (including Gephi and SigmaJS), whereas to my knowledge, LGA is only used by IA and Archive-It services. I could be wrong on the latter but they're certainly not as widely used as GEXF and GraphML.

So I think since we cover the use cases, we shouldn't go down this road. But happy to hear other opinions.

@greebie

This comment has been minimized.

Contributor

greebie commented Jan 11, 2019

I remember taking a look at this before. At this stage, until we start seeing the LGA format being used widely by big data analysis players (Gephi in particular), I say it's not a huge priority now.

I can see why it could offer some useful benefits for, say, a D3.js visualization, but I don't see a tonne of work out there playing with derivatives in this way yet.

@ianmilligan1

This comment has been minimized.

Member

ianmilligan1 commented Jan 11, 2019

OK, I think we can probably close this now (removed stupid link).

@ruebot ruebot closed this Jan 11, 2019

@ruebot ruebot added the wontfix label Jan 11, 2019

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment