Skip to content

Instantly share code, notes, and snippets.

@corneliouzbett
Created March 15, 2019 14:04
Show Gist options
  • Save corneliouzbett/a940e28b9f29fc447a4b66079f0d57b2 to your computer and use it in GitHub Desktop.
Save corneliouzbett/a940e28b9f29fc447a4b66079f0d57b2 to your computer and use it in GitHub Desktop.
Example of take
from pyspark import SparkContext, SparkConf
if __name__ == "__main__":
conf = SparkConf().setAppName("take").setMaster("local[*]")
sc = SparkContext(conf = conf)
inputWords = ["spark", "hadoop", "spark", "hive", "pig", "cassandra", "hadoop"]
wordRdd = sc.parallelize(inputWords)
words = wordRdd.take(3)
for word in words:
print(word)
@corneliouzbett
Copy link
Author

This is an example code for take function.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment