In spring 2016, [Science featured a dataset on global usage of Sci-Hub](http://www.sciencemag.org/news/2016/04/whos-downloading-pirated-papers- everyone), a prominent shadow library for scholarly literature. The dataset, openly available via Dryad, tracks more than 28 Mio Sci-Hub usage events over a period of six month on the article-level. Tab-separated files contain timestamps, geo-locations (latitude, longitude), and the Digital Object Identifiers (DOI) of each requested full text.
Possible questions
- Are there any datasets available which can be easily combined with the Sci-Hub data to explore its usage? And how can we achieve this?
- It would be also interesting to re-build or even enhance the interactive information visualisations provided by Science on Sci-Hub usage.
We will provide a local copy of the dataset on USB drives in case you forget to download the dataset before the Hack Day. Smaller sample files, providing snapshots of the data, will be available as well.