Skip to content

Instantly share code, notes, and snippets.

@gaphex
Created May 9, 2019 14:55
Show Gist options
  • Select an option

  • Save gaphex/01953dd4c641b020130351b4de9a5770 to your computer and use it in GitHub Desktop.

Select an option

Save gaphex/01953dd4c641b020130351b4de9a5770 to your computer and use it in GitHub Desktop.
DEMO_MODE = True #@param {type:"boolean"}
if DEMO_MODE:
 CORPUS_SIZE = 1000000
else:
 CORPUS_SIZE = 100000000 #@param {type: "integer"}
 
!(head -n $CORPUS_SIZE dataset.txt) > subdataset.txt
!mv subdataset.txt dataset.txt
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment