Skip to content

Instantly share code, notes, and snippets.

@computerphysicslab
Created September 8, 2020 12:06
Show Gist options
  • Save computerphysicslab/d1d767822275cfbf3a431bb5251bf276 to your computer and use it in GitHub Desktop.
Save computerphysicslab/d1d767822275cfbf3a431bb5251bf276 to your computer and use it in GitHub Desktop.
"Latent Dirichlet Allocation (LDA) Blei et.al, 2002, proposed LDA model. They described thismodel as a generative probabilistic three-level hierarchical Bayesian model. They say that this model is for large collections of discrete data and tries to find short descriptions for the collection to process large collection of documents. It models the corpus as a collection of documents, each document as a distribution of multiple topics and each topic as a mixture of words."
"rstenjak et.al, 2013, proposed a possibility of using a KNN with TF-IDF method for text classification. The evaluation is based on the speed, accuracy and quality of classification. The results include both good and bad features. The main motivation for this paper was to develop concept frameworks with emphasis on KNN & TF-IDF module."
https://www.digitalxplore.org/up_proc/pdf/268-148653117584-89.pdf
#knn #lda #topicModel #textClassification
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment