Created
May 15, 2019 10:52
-
-
Save gokulsan/1b9f44dfa7eae4d608d426dbf61302f2 to your computer and use it in GitHub Desktop.
Natural Language Processing based Search Engine Design and Development
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| https://www.searchtechnologies.com/blog/natural-language-processing-techniques | |
| https://stackoverflow.com/questions/46383076/how-to-implement-synonyms-for-use-in-a-search-engine | |
| https://towardsdatascience.com/simple-nlp-search-in-your-application-step-by-step-guide-in-scala-22ca1ce3e475 | |
| https://medium.com/@chrisfotache/text-classification-in-python-pipelines-nlp-nltk-tf-idf-xgboost-and-more-b83451a327e0 | |
| Data Sample | |
| Training NER | |
| Entity Extraction | |
| Categorizing Content | |
| Clustering Content | |
| Fact Extraction | |
| Relationship Extraction | |
| Structure Extraction | |
| Identify and Mark Sentence, phase, and paragraph | |
| Language Identification | |
| Tokenization | |
| Acronym Normalization and Tagging | |
| Lemmatization / Stemming | |
| De-compounding | |
| Effort and Timeline | |
| Resource Matrix | |
| It is a Document Management System | |
| Workflow Management System | |
| Portal Application for the DMS | |
| Search integration to the Portal |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment