Skip to content

Instantly share code, notes, and snippets.

@calvinalvin
Last active December 27, 2017 19:39
Show Gist options
  • Save calvinalvin/999aef00c63b258767c8b3498e07c9cc to your computer and use it in GitHub Desktop.
Save calvinalvin/999aef00c63b258767c8b3498e07c9cc to your computer and use it in GitHub Desktop.

Facebook ML research datasets

Microsoft Coco image sets with image captions

Today, we are open-sourcing a dataset of over 16K crowdsourced queries. More precisely, this dataset contains 2400 queries for each of the 7 user intents we tested:

~500K english words

One Billion Word Benchmark for Measuring Progress in Statistical Language Modeling

International Skin Imaging Collaboration (ISIC)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment