Skip to content

Instantly share code, notes, and snippets.

@al102964
Created April 13, 2020 01:12
Show Gist options
  • Save al102964/c58c7dad324d74b21acb3173959e23fb to your computer and use it in GitHub Desktop.
Save al102964/c58c7dad324d74b21acb3173959e23fb to your computer and use it in GitHub Desktop.
val genome_scores_df = spark.read.format("csv").option("header", "true").option("inferschema", "true").load("s3://<s3-bucket>/genome-scores.csv")
genome_scores_df.write.mode("overwrite").parquet("s3://<s3-bucket>/movielens-parquet/genome-scores/")
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment