Skip to content

Instantly share code, notes, and snippets.

View dice89's full-sized avatar

Alexander Mueller dice89

View GitHub Profile
@dice89
dice89 / gist:2c313bd5cfff0a4fb599
Created February 8, 2015 21:08
Word2Vec Usage from Java with Apache Spark
Word2VecModel model_stemmed = ModelUtil.loadWord2VecModel("/Users/mueller/Coding/Word2Vectors/webbase10p/model_word2vec_stemmed.ser");
Word2VecModel model_unstemmed = ModelUtil.loadWord2VecModel("/Users/mueller/Coding/Word2Vectors/webbase10p/model_word2vec.ser");
System.out.println("Stemmed example");
System.out.println("#############################################");
String term1= "scholar";
String term2 ="student";
//To Stem terms the Porter Stemmer from Apache Lucene is used
double result = Word2VecSim.cousineSimilarityBetweenTerms(model_stemmed,ModelUtil.porter_stem(term1),ModelUtil.porter_stem(term2));