Get the slides from my talk in the old fashioned portable document format here
- The National Library of Norway
- The NL search API
- My projects: PDF Search, NB Graph and NBngram
- Access the internett through the shell with wget
- Script Chrome with iMacros for tedious and repetitive tasks
- Open Refine (previously Google Refine) lets you take controll and normalize huge sets of data.
- Regex101 is awesome tool for learning and figuring out regular expressions
- David McClure has some interesting stuff on his blog.
- Jupyter notebook
-
The Jupyter Notebook is a web application that allows you to create and share documents that contain live code, equations, visualizations and explanatory text. Uses include: data cleaning and transformation, numerical simulation, statistical modeling, machine learning and much more.
- This plugin allows you to use jupyter inline in the Atom text editor. It's pretty, pretty, pretty good.
-
- word2vec
- Efficient Estimation of Word Representations in Vector Space (2013)
- jupyter notebook example
- A pretty thorough DH intro word2vec
- There are now multiple ways of running words2vec
- As a part of the TensorFlow package
- As a part of gensim
- With a python interface