Created
September 8, 2020 12:06
-
-
Save computerphysicslab/d1d767822275cfbf3a431bb5251bf276 to your computer and use it in GitHub Desktop.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
"Latent Dirichlet Allocation (LDA) Blei et.al, 2002, proposed LDA model. They described thismodel as a generative probabilistic three-level hierarchical Bayesian model. They say that this model is for large collections of discrete data and tries to find short descriptions for the collection to process large collection of documents. It models the corpus as a collection of documents, each document as a distribution of multiple topics and each topic as a mixture of words." | |
"rstenjak et.al, 2013, proposed a possibility of using a KNN with TF-IDF method for text classification. The evaluation is based on the speed, accuracy and quality of classification. The results include both good and bad features. The main motivation for this paper was to develop concept frameworks with emphasis on KNN & TF-IDF module." | |
https://www.digitalxplore.org/up_proc/pdf/268-148653117584-89.pdf | |
#knn #lda #topicModel #textClassification |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment