- https://gist.github.com/HonzaKral/35451c6322a78d4de8f5 to load stack overflow data. Clone into load_stack subdirectory within this repo.
- Programmers StackExchange dump from http://www.clearbits.net/creators/146-stack-exchange-data-dump - the relevant files (Posts.xml, Users.xml and Comments.xml) should be placed in the data subdirectory in this repo.
- elasticat (https://gist.github.com/karmi/5670573) for pretty-printing es results
- kibana 3 with dashboard loaded from 07-kibana.json
Run:
BULK_FILE=$PWD/bulk.json ES_INDEX=programmers.stackexchange.com load_stack/load_stack.sh $PWD/data
From this repo. This should generate a bulk.json that can be re-used during the presentation (second run of this script won't have to generate it from the xml).
Clear all your indices:
curl -X DELETE http://localhost:9200
At this moment it's good idea to get kibana up and running and load the dashboard, verify that it's sane and displays correct data.
You should be set to go, enjoy!