Skip to content

Instantly share code, notes, and snippets.

@manashmandal
Created October 6, 2016 13:38
Show Gist options
  • Save manashmandal/307647465b4b1512d9d4ecd0eccc5dac to your computer and use it in GitHub Desktop.
Save manashmandal/307647465b4b1512d9d4ecd0eccc5dac to your computer and use it in GitHub Desktop.
StanfordNERTagger on linux
from nltk.tag import StanfordNERTagger
import os
print os.environ.get('STANFORD_MODELS')
# Output:
#/home/jackspicer/StanfordParser/stanford-ner-2015-12-09/classifiers/english.all.3class.distsim.crf.ser.gz:/home/jackspicer/stanford-postagger-2015-12-09/models/english.conll.4class.distsim.crf.ser.gz:/home/jackspicer/StanfordParser/stanford-ner-2015-12-09/english.muc.7class.distsim.crf.ser.gz
# I want to select the first classifier model
stanford_classifier = os.environ.get('STANFORD_MODELS').split(':')[0]
# For getting the path for StanfordNERTagger
stanford_ner_path = os.environ.get('CLASSPATH').split(':')[0]
st = StanfordNERTagger(stanford_classifier, stanford_ner_path, encoding='utf-8')
print st.tag("Manash is awesome!".split())
# Output
# [(u'Manash', u'O'), (u'is', u'O'), (u'awesome!', u'O')]
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment