Forked from mkanchwala/Spark-SingleRDDfrmMultipleFiles
Created
January 20, 2016 17:10
-
-
Save ee08b397/3f2c14ade15b17413bb3 to your computer and use it in GitHub Desktop.
Spark : How to create a Single RDD from Multiple Files
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Best way to create Mutliple files into a single RDD | |
================================== | |
val fileRDD = sc.textFile(filename).repartition(1) | |
Where the filename is the location of your directory only. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment