This project aims to:
- Create an API of all diversity/inclusion in tech data that's currently out there, mainly looking at the diversity reports shared by most major tech giants starting in 2014
- Do some basic data visualization
- (Later iteration) Create endpoints of CS Education data
- (Later iteration) Do some pipeline analysis comparing the two data sets
"Diversity" and "inclusion" are both buzzwords in the tech industry, but too often the data isn't referenced in the conversation, or only a small slice of it is so as to dilute the conversation with bias. This project aims to remove bias and stick to the facts.
- Puttng all of the data in one place
- Doing some data visualization for the biggest buckets
- Potentially crafting an interactive narrative
People looking to engage in an honest and informed discussion around diversity in tech.
- Which APIs will you use?
- My own! (Is that ok?)
- Which OAuth integration are you planning to use?
- gmail
- Collect and import data (fortunately a lot of it already exists in csv form)
- Expose data as JSON endpoints
- Figure out how to get my front-end to talk to the back end, did some spiking on ember this weekend and really enjoyed it but not totally married to it.
- Do basic data analysis on data from endpoints
- Do some data visualization (Ember data? Don't think I totally get how it works)