Skip to content

Instantly share code, notes, and snippets.

@qingniufly
Created March 1, 2017 10:10
Show Gist options
  • Save qingniufly/e05a8a7e43a18a012a6e0a22f0bfa880 to your computer and use it in GitHub Desktop.
Save qingniufly/e05a8a7e43a18a012a6e0a22f0bfa880 to your computer and use it in GitHub Desktop.
spark read and write parquet files
// read
val filePath = "file:///path/to/spark/examples/src/main/resources/users.parquet"
val users = sqlContext.read.load(filePath)
// write
val personJsonFile = "file:///path/to/spark/examples/src/main/resources/people.json"
val personDF = sqlContext.read.json(personJsonFile)
personDF.select("name", "age").write.format("parquet").save("file:///tmp/new_people.parquet")
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment