TalkPython.fm #90 - Data Wrangling with Python
- Katharine on the web: http://kjamistan.com
- Katharine on twitter: @kjam
- Book: Data Wrangling with Python: Tips and Tools to Make Your Life Easier: http://amzn.to/2fGc0Cx
- Pycon 2016: How to Automate your Data Cleanup with Python: http://youtube.com/watch?v=gp-ngPV_ZX8
- Dedupe Python Library: http://github.com/datamade/dedupe
- probablepeople: http://github.com/datamade/probablepeople
- usaddress: http://github.com/datamade/usaddress
- jellyfish: http://github.com/jamesturk/jellyfish
- Fuzzywuzzy: http://github.com/seatgeek/fuzzywuzzy
- scrubadub: http://github.com/datascopeanalytics/scrubadub
- pint: http://pint.readthedocs.io
- arrow: http://github.com/crsmithdev/arrow
- pdftables.six: http://github.com/vnaydionov/pdftables
- Datacleaner: http://github.com/rhiever/datacleaner
- Parserator: http://github.com/datamade/parserator
- Gensim: http://radimrehurek.com/gensim
- Faker: http://github.com/joke2k/faker
- Dask: http://dask.pydata.org
- SpaCy: http://spacy.io
- Airflow: http://airflow.incubator.apache.org
- Luigi: http://luigi.readthedocs.io
- Hypothesis (testing): http://hypothesis.works