Skip to content

Instantly share code, notes, and snippets.

@tiagotele
Created May 17, 2023 13:31
Show Gist options
  • Save tiagotele/104e052783ea70578d78703d86d57b5d to your computer and use it in GitHub Desktop.
Save tiagotele/104e052783ea70578d78703d86d57b5d to your computer and use it in GitHub Desktop.
#!/bin/bash
SPARK_VERSION='spark-3.1.2'
SPARK_URL=https://archive.apache.org/dist/spark/$SPARK_VERSION/$SPARK_VERSION-bin-without-hadoop.tgz
echo "Downloading pre-built PySpark..."
wget $SPARK_URL -P /tmp
echo "Done!"
# Unpack PySpark
echo "Creating folder /home/ec2-user/SageMaker/spark..."
mkdir -p /home/ec2-user/SageMaker/spark
echo "Untar spark bundle..."
tar -xvf /tmp/$SPARK_VERSION-bin-without-hadoop.tgz -C /home/ec2-user/SageMaker/spark --strip-components=1
echo "Done"
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment