Skip to content

Instantly share code, notes, and snippets.

@dehora
Last active August 29, 2015 14:18
Show Gist options
  • Save dehora/e14295032d9c94edf204 to your computer and use it in GitHub Desktop.
Save dehora/e14295032d9c94edf204 to your computer and use it in GitHub Desktop.
git clone https://github.com/apache/samza-hello-samza.git
cd samza-hello-samza
git checkout master
rm -rf deploy/*
bin/grid bootstrap
mvn clean package
rm -rf deploy/samza
tar -xvf target/hello-samza-0.8.0-dist.tar.gz -C deploy/samza
deploy/samza/bin/run-job.sh --config-factory=org.apache.samza.config.factories.PropertiesConfigFactory --config-path=file://$PWD/deploy/samza/config/wikipedia-feed.properties
deploy/samza/bin/run-job.sh --config-factory=org.apache.samza.config.factories.PropertiesConfigFactory --config-path=file://$PWD/deploy/samza/config/wikipedia-parser.properties
deploy/samza/bin/run-job.sh --config-factory=org.apache.samza.config.factories.PropertiesConfigFactory --config-path=file://$PWD/deploy/samza/config/wikipedia-stats.properties
# in new terminals
deploy/kafka/bin/kafka-console-consumer.sh --zookeeper localhost:2181 --topic wikipedia-raw
deploy/kafka/bin/kafka-console-consumer.sh --zookeeper localhost:2181 --topic wikipedia-edits
deploy/kafka/bin/kafka-console-consumer.sh --zookeeper localhost:2181 --topic wikipedia-stats
# stop the consumer processes first or they'll spin and need to be hard killed
bin/grid stop all
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment