Skip to content

Instantly share code, notes, and snippets.

@ottomata
Created September 7, 2017 18:14
Show Gist options
  • Select an option

  • Save ottomata/11586e130667f738ab374a2bbe635006 to your computer and use it in GitHub Desktop.

Select an option

Save ottomata/11586e130667f738ab374a2bbe635006 to your computer and use it in GitHub Desktop.
refine popups
spark-submit \
--class org.wikimedia.analytics.refinery.job.JsonRefine \
./refinery-job/target/refinery-job-0.0.49-SNAPSHOT.jar \
--input-base-path /wmf/data/raw/eventlogging \
--database otto \
--output-base-path /user/otto/external/event02 \
--done-flag _REFINED \
--input-regex '.*eventlogging_(.+)/hourly/(\d+)/(\d+)/(\d+)/(\d+)' \
--input-capture 'table,year,month,day,hour' \
--table-whitelist '.*Popups' \
--since 2 \
--limit 2 \
--dry-run
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment