For those like me who wish to continue learning about ML using scientific Python stack, check this video workshop by Jake VanderPlas
Here is the code https://github.com/jakevdp/sklearn_pycon2015/
So what steps I did to setup correctly working PySpark with Anaconda with 200 libraries on courses Vagrant VM
- Install Anaconda or Miniconda, you should be familiar with linux shell. Vagrant Spark VM is Ubuntu 32bit and Python 2.7 until PySpark for py3 not yet released. Get download url from http://continuum.io/downloads#all