Last active
June 1, 2020 07:17
-
-
Save hdemers/5337229 to your computer and use it in GitHub Desktop.
Installation instructions for doing data science in a Python environment on Ubuntu. We'll install base packages like numpy, scipy, scikit-learn and pandas. We also install the IPython Notebook interactive environment. This is a best practice recommendation for doing research-type work. We make use of virtualenvwrapper, but don't show how to inst…
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
mkvirtualenv datascience | |
sudo apt-get install python-scipy libblas-dev liblapack-dev gfortran | |
sudo apt-get install libffi-dev # for cryptography from scrapy | |
sudo apt-get install libxslt-dev # for libxml from scrapy | |
export BLAS=/usr/lib/libblas.so | |
export LAPACK=/usr/lib/liblapack.so | |
pip install numpy | |
pip install scipy | |
pip install scikit-learn | |
pip install pandas | |
pip install patsy | |
pip install statsmodels | |
pip install ipython tornado pyzmq | |
pip install networkx | |
pip install gensim | |
pip install scrapy | |
pip install numexpr bottleneck | |
pip install sqlalchemy | |
pip install nltk | |
pip install seaborn |
Also, it seems like numpy and scipy install automatically with the sudo apt-get install python-scipy
@startakovsky, both command install scipy
but they're somewhat different. Using sudo apt-get python-scipy
gives you the most stable version that the OS supports but pip install scipy
installs the latest version that scipy pushed in to the pypi index.
Thank you
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Do the export statements stick after reboot? Do they need to be added to a
.profile
, what are they for?