Created
June 9, 2019 04:29
-
-
Save amodm/f49b5e9a22d654c879189fc889e2d21c to your computer and use it in GitHub Desktop.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Download the data from https://www.kaggle.com/cityofLA/los-angeles-traffic-collision-data/downloads/traffic-collision-data-from-2010-to-present.csv/24 | |
Write a program to read the above data into memory in two formats: | |
1. Object oriented - for every record | |
2. Columnar format - a separate array for every attribute | |
Once the data set has been read into memory, the program should ask for a zip code. | |
After getting that input, the program should output the number of collisions that happened in that zipcode two times (along with time taken to process): | |
1. Using the object oriented in-memory store | |
2. Using the columnar format store | |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment