Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Save ee08b397/3f2c14ade15b17413bb3 to your computer and use it in GitHub Desktop.
Save ee08b397/3f2c14ade15b17413bb3 to your computer and use it in GitHub Desktop.
Spark : How to create a Single RDD from Multiple Files
Best way to create Mutliple files into a single RDD
==================================
val fileRDD = sc.textFile(filename).repartition(1)
Where the filename is the location of your directory only.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment