Skip to content

Instantly share code, notes, and snippets.

@jkuruzovich
Last active August 11, 2017 21:03
Show Gist options
  • Save jkuruzovich/936d214689c6077bc11bb35e775afca5 to your computer and use it in GitHub Desktop.
Save jkuruzovich/936d214689c6077bc11bb35e775afca5 to your computer and use it in GitHub Desktop.
Introduction to Big Data with Spark

Class 12: Technology Fundamentals of Business Analytics

Introduction to Big Data

Class Objective:

The goal of this class is to investigate basic concepts surrounding text mining.

Readings (To be done before class):

Databricks Talk
Create a DataBricks Community Edition Account
Gentle Introduction To Spark

In Class Activities:

Word2Vec
Presentation
Databricks Demo
Introduction to MapReduce Links: local github slides
Introduction to Spark Links: local github slides
Lab2 Word Count

Notes: Be sure to install the library test_helper.

Assignment (due the second Wednesday following class by 11:59 PM):

Unfortunately this class is closed. Sorry.
(1) https://www.edx.org/course/big-data-analysis-apache-spark-uc-berkeleyx-cs110x
But the video lectures are still available on this YouTube Playlist. (2) Lab2 Word Count

Notes:


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment