toofishes.net

Arch Package Visualization

Nothing like seeing a reddit post, saying you’ll spend a few minutes looking into something, and then realizing you spent multiple hours on it. Today that time sink was Gephi, a pretty cool desktop application for generating graphs from a variety of data sources. It is available in the AUR if you are an Arch Linux user.

I chose a data set I am pretty familiar with, and also one I had an easy time generating the necessary input data for Gephi. Scrapping together a quick Django admin command and basing the data off the Arch Linux website database seemed like the easiest way to get nodes (packages) and some attributes. The edges are of course the dependencies between packages.

The graph here has a few shortcomings, but to me the high-level visualization was more important than getting everything perfectly correct:

Even with these known deficiencies, the graph is still pretty awesome.

Packages visualization

A full packages PDF is available, which is of very large page size so you can zoom in and see individual packages.

I may tweak the data generation scripts a bit more to account for provides and things to see if it results in a more informative graph.

Update: Here is the latest Gephi packages file I have. This is using an updated script that handles provides, so things like perl and bash show up much larger than before.

Tags

See Also