Skip to content

Instantly share code, notes, and snippets.

@aaronfeng
Created August 17, 2012 14:23
Show Gist options
  • Save aaronfeng/3379083 to your computer and use it in GitHub Desktop.
Save aaronfeng/3379083 to your computer and use it in GitHub Desktop.
PhillyAWS - 8/28/12

I can't decided on the title. Maybe you can help me. 3 - 7 is courtesy of @codeslinger.

title

  1. "BigData: Data Crunch on a Budget"
  2. "BI in the Cloud"
  3. "You're using Oracle for that? Big Data and you"
  4. "Better Answers from More Data: Today's Business Intelligence Architecture"
  5. "How Did They Know That? Getting Smart Answers like Google"
  6. "I Just Want My SQL! High-Level Analysis on Giant Piles of Data"
  7. "How to Make the Other Guys Look Dumb"

abstract

Business intelligence is vital to many companies. It is common to run batch jobs to crunch some reports in order to gain insight into your current business. Traditional data warehouse and ETL techniques force you to "pick and choose" what you want to store then discard the rest that don't fit into the schema. What if you can easily preserve all your data points then figure out how to crunch them in the future? In this presentation I will go over the basics of Elastic MapReduce and Hive. EMR allows for crunching data on demand and at a low cost. Hive allows you analyze your data using a SQL-like interface.

location

Vox Medica, 601 Walnut Street, Suite 250-S, Philadelphia, PA

time

8/28/12 @ 6:30 pm

RSVP

http://www.doodle.com/tkvdekveved3sea2

@Randuin
Copy link

Randuin commented Aug 18, 2012

Bi in the cloud heh

@AlexCuse
Copy link

#4, though @gv0tch0's suggestion is pretty excellent.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment