Skip to content

Instantly share code, notes, and snippets.

@ogrisel
Created September 18, 2011 13:39
Show Gist options
  • Save ogrisel/1225080 to your computer and use it in GitHub Desktop.
Save ogrisel/1225080 to your computer and use it in GitHub Desktop.
pignlproc topics - sample output
$ python categorize.py http://en.wikinews.org/wiki/Denmark_elects_new_centre-left_coalition_and_prime_minister
Category:College_of_Europe [0.454]
Category:Elections_in_Denmark [0.447]
Category:Party_of_European_Socialists [0.432]
Category:Politics_of_Denmark [0.425]
Category:Danish_law [0.408]
$ python categorize.py http://en.wikinews.org/wiki/Zimbabwe_minister_warns_media
Category:Zimbabwe [0.247]
Category:Government_of_Zimbabwe [0.235]
Category:Politics_of_Zimbabwe [0.224]
Category:Elections_in_Zimbabwe [0.202]
Category:Zimbabwean_presidential_election,_2008 [0.198]
$ python categorize.py http://en.wikinews.org/wiki/Southampton_City_Council_votes_against_water_fluoridation
Category:Halogens [0.248]
Category:Fluorine [0.227]
Category:Water_treatment [0.220]
Category:Political_controversies [0.188]
Category:Dentistry [0.186]
$ python categorize.py http://www.icml-2011.org/papers.php
Category:Machine_learning [0.496]
Category:Statistical_classification [0.459]
Category:Computational_statistics [0.427]
Category:Bayesian_statistics [0.417]
Category:Multivariate_statistics [0.346]
$ python categorize.py http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0023936
Category:Genes [0.142]
Category:Programmed_cell_death [0.120]
Category:Cancer_treatments [0.109]
Category:Tumor_suppressor_genes [0.106]
Category:Cell_biology [0.093]
@vene
Copy link

vene commented Sep 18, 2011

This looks interesting to me:D Could you provde some context please?

@ogrisel
Copy link
Author

ogrisel commented Sep 19, 2011

This is a sample outcome of using Solr MoreLikeThis as a k-NN classifier on a 50k+ topics index using aggregate text from articles in wikipedia / dbpedia: https://github.com/ogrisel/pignlproc/tree/master/examples/topic-corpus

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment