Skip to content

Instantly share code, notes, and snippets.

@amodm
Created June 9, 2019 04:29
Show Gist options
  • Save amodm/f49b5e9a22d654c879189fc889e2d21c to your computer and use it in GitHub Desktop.
Save amodm/f49b5e9a22d654c879189fc889e2d21c to your computer and use it in GitHub Desktop.
Download the data from https://www.kaggle.com/cityofLA/los-angeles-traffic-collision-data/downloads/traffic-collision-data-from-2010-to-present.csv/24
Write a program to read the above data into memory in two formats:
1. Object oriented - for every record
2. Columnar format - a separate array for every attribute
Once the data set has been read into memory, the program should ask for a zip code.
After getting that input, the program should output the number of collisions that happened in that zipcode two times (along with time taken to process):
1. Using the object oriented in-memory store
2. Using the columnar format store
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment