Skip to content

Instantly share code, notes, and snippets.

@ahurriyetoglu
Forked from larsmans/gist:3745866
Created June 21, 2014 16:01
Show Gist options
  • Save ahurriyetoglu/29189bae26bbd0f7a82a to your computer and use it in GitHub Desktop.
Save ahurriyetoglu/29189bae26bbd0f7a82a to your computer and use it in GitHub Desktop.
>>> from pandas import DataFrame
>>> from sklearn.feature_extraction.text import CountVectorizer
>>> docs = ["You can catch more flies with honey than you can with vinegar.",
... "You can lead a horse to water, but you can't make him drink."]
>>> vect = CountVectorizer(min_df=0., max_df=1.0)
>>> X = vect.fit_transform(docs)
>>> print(DataFrame(X.A, columns=vect.get_feature_names()).to_string())
but can catch drink flies him honey horse lead make more than to vinegar water with you
0 0 2 1 0 1 0 1 0 0 0 1 1 0 1 0 2 2
1 1 2 0 1 0 1 0 1 1 1 0 0 1 0 1 0 2
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment