bsodhi

bsodhi / train_and_find_most_similar.py

Last active July 6, 2019 08:11

Using gensim Doc2Vec for finding top N documents from pre-trained model which are most similar to a given out-of-training corpus.

	""" Training a Doc2Vec model and finding top N documents from pre-trained model which are most similar to a given out-of-training corpus.

	Following sources have been the primary references for
	writing this module.

	* https://radimrehurek.com/gensim/models/doc2vec.html
	* https://github.com/RaRe-Technologies/gensim/blob/develop/docs/notebooks/doc2vec-lee.ipynb
	* https://radimrehurek.com/gensim/models/doc2vec.html#usage-examples

bsodhi / countries.json

Created November 28, 2018 08:41 — forked from keeguon/countries.json

A list of countries in JSON