Skip to content

Instantly share code, notes, and snippets.

View VeylanSolmira's full-sized avatar

VeylanSolmira VeylanSolmira

  • UCLA
  • Los Angeles, CA
View GitHub Profile
from pyspark import SparkContext
def main():
sc = SparkContext(appName="Test Compression")
# RDD has to be key, value pairs
data = sc.parallelize([
("key1", "value1"),
("key2", "value2"),
("key3", "value3"),