Skip to content

Instantly share code, notes, and snippets.

@njahn82
Last active August 29, 2016 13:26
Show Gist options
  • Select an option

  • Save njahn82/d79bdfc42346d0dd911488decd1896c9 to your computer and use it in GitHub Desktop.

Select an option

Save njahn82/d79bdfc42346d0dd911488decd1896c9 to your computer and use it in GitHub Desktop.
3:AM Hack Day porposal

Proposed datasets

Sci-Hub usage

In spring 2016, [Science featured a dataset on global usage of Sci-Hub](http://www.sciencemag.org/news/2016/04/whos-downloading-pirated-papers- everyone), a prominent shadow library for scholarly literature. The dataset, openly available via Dryad, tracks more than 28 Mio Sci-Hub usage events over a period of six month on the article-level. Tab-separated files contain timestamps, geo-locations (latitude, longitude), and the Digital Object Identifiers (DOI) of each requested full text.

Possible questions

  • Are there any datasets available which can be easily combined with the Sci-Hub data to explore its usage? And how can we achieve this?
  • It would be also interesting to re-build or even enhance the interactive information visualisations provided by Science on Sci-Hub usage.

We will provide a local copy of the dataset on USB drives in case you forget to download the dataset before the Hack Day. Smaller sample files, providing snapshots of the data, will be available as well.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment