Skip to content

Instantly share code, notes, and snippets.

@ashee
Last active June 7, 2017 22:56
Show Gist options
  • Save ashee/f4719eccbc181875eef967422dd6bb0c to your computer and use it in GitHub Desktop.
Save ashee/f4719eccbc181875eef967422dd6bb0c to your computer and use it in GitHub Desktop.
AWS EMR Custom Jar Spark Job
# see - http://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-commandrunner.html
# also see (not accurate but worth a read) - https://github.com/awslabs/emr-bootstrap-actions/blob/master/spark/examples/spark-submit-via-step.md
aws emr add-steps --cluster-id j-3GZ6YDD7POJFQ --steps \
Name=DSEMigration,Jar="command-runner.jar",Args=[spark-submit,--deploy-mode,cluster,--master,yarn,--class,org.ithaka.dsemig.DseMig.DseMig,s3://sequoia-install/dse-migration/dsemig_2.11-0.1.0-SNAPSHOT.jar,--jars,/mnt/var/lib/zeppelin/local-repo/com/datastax/spark/spark-cassandra-connector_2.11/2.0.0-M3/spark-cassandra-connector_2.11-2.0.0-M3.jar],ActionOnFailure=CONTINUE
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment