Created
October 3, 2018 13:06
-
-
Save djptek/17b67e663eda1dc6561d9926317b08d0 to your computer and use it in GitHub Desktop.
Comparing Dutch Stemmers
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# take a look at the difference between these 2 examples, you can cut and paste them into Kibana | |
# Standard Dutch | |
POST _analyze | |
{ | |
"tokenizer": "standard", | |
"filter": [ | |
"standard", | |
"lowercase", | |
{ | |
"type": "stemmer", | |
"name": "dutch" | |
} | |
], | |
"text": "activeren, activeer, activeert, activeerden, activeerde, geactiveerd" | |
} | |
#Dutch_kp | |
POST _analyze | |
{ | |
"tokenizer": "standard", | |
"filter": [ | |
"standard", | |
"lowercase", | |
{ | |
"type": "stemmer", | |
"name": "dutch_kp" | |
} | |
], | |
"text": "activeren, activeer, activeert, activeerden, activeerde, geactiveerd" | |
} |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment