Skip to content

Instantly share code, notes, and snippets.

@n0531m
Created November 15, 2017 14:47
Show Gist options
  • Save n0531m/53dd2b5becbfeb43276f33bf861e066d to your computer and use it in GitHub Desktop.
Save n0531m/53dd2b5becbfeb43276f33bf861e066d to your computer and use it in GitHub Desktop.
running a beam pipeline from cli
#!/bin/bash
PROJECTID=moritani-bigdata
DATASET=opendatasg
RUNNER=DirectRunner
#RUNNER=DataflowRunner
mvn compile exec:java \
-Dexec.mainClass=com.gmail.n0531m.datagovsg.pipelines.TaxiAvailabilityPipeline \
-Dexec.cleanupDaemonThreads=false \
-Dexec.args=" \
--project=$PROJECTID \
--stagingLocation=gs://moritani-bigdata-dataflow/staging \
--gcpTempLocation=gs://moritani-bigdata-dataflow/temp \
--tempLocation=gs://moritani-bigdata-dataflow/temp \
--runner=$RUNNER \
--appName=SingaporeTaxiAvailbilityPipeline \
--sinkProjectId=$PROJECTID \
--sinkDatasetId=$DATASET \
--sinkTableIdSuffixPattern=yyyyMMdd \
--sinkTableIdPrefix=TaxiAvailability \
--sinkStoragePathPrefix=gs://moritani-bigdata-dataflow/output/TaxiAvailability_processed_ \
--startYyyyMMddHHmmss=20171001000000 \
--endYyyyMMddHHmmss=20171001005959 \
--datagovsgApikey=$DATAGOVSG_APIKEY \
"
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment