Pre-requisites:
- Java Development Kit (JDK): download at http://www.oracle.com/technetwork/java/javase/downloads/index.html
- Spark pre-built for Hadoop 2.6 (in case not already installed): download at http://spark.apache.org/downloads.html
- Jupyter notebook:
sudo pip install jupyter/sudo pip3 istall jupyter
In this example, I have extracted the tar.gz file in /opt. Remember to use the correct path for $SPARK_HOME environment variable (see below), depending on how you installed it.