Skip to content

Instantly share code, notes, and snippets.

@p1nox
Last active January 12, 2018 20:45
Show Gist options
  • Save p1nox/d0ca7c17ae5fe5d647df23f6c9d12223 to your computer and use it in GitHub Desktop.
Save p1nox/d0ca7c17ae5fe5d647df23f6c9d12223 to your computer and use it in GitHub Desktop.
Spark Notes

Using docker:

docker run -it --rm -p 8888:8888 jupyter/all-spark-notebook

From standalone installation:

PYSPARK_DRIVER_PYTHON_OPTS="notebook" bin/pyspark

Run Spark Standalone Cluster in a single machine.

  • Install Java 7
sudo add-apt-repository ppa:webupd8team/java
sudo apt-get update
sudo apt-get install oracle-java7-installer

sudo apt-get install scala
  • Install Spark
wget http://d3kbcqa49mib13.cloudfront.net/spark-2.1.0-bin-hadoop2.7.tgz
tar zxvf spark-2.1.0-bin-hadoop2.7.tgz
mv spark-2.1.0-bin-hadoop2.7 spark_2.1.0

Download section.

  • Add to .profile
export SPARK_HOME=/home/raul/spark_2.1.0
export PATH=$PATH:$SPARK_HOME/bin
  • Activate profile
source .profile
  • Add configuration file

Create file vim $SPARK_HOME/conf/spark-env.sh containing:

SPARK_WORKER_CORES=2
SPARK_WORKER_INSTANCES=2
SPARK_WORKER_MEMORY=2g
  • Activate master and slaves
$SPARK_HOME/sbin/start-master.sh
$SPARK_HOME/sbin/start-slave.sh spark://data-app1-uat.east:7077
$SPARK_HOME/logs/spark-root-org.apache.spark.deploy.master.Master-1-data-app1-uat.east.out
$SPARK_HOME/logs/spark-root-org.apache.spark.deploy.worker.Worker-1-data-app1-uat.east.out
$SPARK_HOME/logs/spark-root-org.apache.spark.deploy.worker.Worker-2-data-app1-uat.east.out
$SPARK_HOME/sbin/stop-master.sh
$SPARK_HOME/sbin/stop-slave.sh

Resources:

https://spark.apache.org/docs/latest/cluster-overview.html

https://spark.apache.org/docs/latest/spark-standalone.html

https://github.com/jaceklaskowski/mastering-apache-spark-book/blob/master/spark-standalone-example-2-workers-on-1-node-cluster.adoc

docker run -p 8080:8080 -p 4040:4040 sofianito/zeppelin
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment