Skip to content

Instantly share code, notes, and snippets.

@vargheseraphy
Last active August 29, 2015 14:16
Show Gist options
  • Save vargheseraphy/8ca1b3dd5a69a42a0fe5 to your computer and use it in GitHub Desktop.
Save vargheseraphy/8ca1b3dd5a69a42a0fe5 to your computer and use it in GitHub Desktop.
machine learning and artificial inteligence

6. Learning to Classify Text

Detecting patterns is a central part of Natural Language Processing. Words ending in -ed tend to be past tense verbs (5.). Frequent use of will is indicative of news text (3). These observable patterns — word structure and word frequency — happen to correlate with particular aspects of meaning, such as tense and topic. But how did we know where to start looking, which aspects of form to associate with which aspects of meaning?

The goal of this chapter is to answer the following questions:

How can we identify particular features of language data that are salient for classifying it? How can we construct models of language that can be used to perform language processing tasks automatically? What can we learn about language from these models?

http://www.nltk.org/book/ch06.html

Collective intelligence - A book based on python language

http://books.google.ae/books?id=fEsZ3Ey-Hq4C&printsec=frontcover&hl=ar&source=gbs_ge_summary_r#v=onepage&q&f=false

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment