Skip to content

Instantly share code, notes, and snippets.

@language-engineering
Created September 17, 2012 10:17
Show Gist options
  • Save language-engineering/3736575 to your computer and use it in GitHub Desktop.
Save language-engineering/3736575 to your computer and use it in GitHub Desktop.
from sussex_nltk.corpus_readers import RCV1CorpusReader #import the corpus reader
rcv1cr = RCV1CorpusReader() #create a new Reuters corpus reader
tokens = rc1cr.words() #get a list of all tokens in corpus
for token in tokens:
print token #print each token in corpus
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment