Skip to content

Instantly share code, notes, and snippets.

@mannharleen
Created August 30, 2017 17:05
Show Gist options
  • Save mannharleen/ccafa4d1f3f4904c1bf234d6f2839a51 to your computer and use it in GitHub Desktop.
Save mannharleen/ccafa4d1f3f4904c1bf234d6f2839a51 to your computer and use it in GitHub Desktop.
Use sparkSQL in hive context to create a managed partitioned table. Use temp table to insert data into managed table using substring hive function
//import necessary
import org.apache.spark.{SparkContext,SparkConf}
import org.apache.spark.sql.hive.HiveContext
//initializations
val conf = new SparkConf().setAppName("xx").setMaster("local[2]")
val sc = new SparkContext(conf)
val hiveContext = new HiveContext(sc)
//create and load temp table
hiveContext.sql("use nyse_db")
hiveContext.sql("create temporary table nyse_temp (name String, date String, open Double, high Double, low Double, close Double, volume BigInt) ROW FORMAT delimited fields terminated by ',' ")
hiveContext.sql("load data local inpath 'D:\\OneDrive - Parkway Pantai\\BIG DATA\\data\\nyse\\nyse_data' into table nyse_temp")
//create managed table with partition
hiveContext.sql("create table nyse (name String, date String, open Double, high Double, low Double, close Double, volume BigInt) partitioned by (month String)")
// -- note: since date and month have diff column names its fine. if we were to use date as a whole as partitioned by column, make sure to rename original date column to something else.
// -- this is necessary since partitioned by create a pseudo column with that name
//use insert statement to perform dynamic partitioning from temp table
hiveContext.sql("set hive.exec.dynamic.partition.mode=nonstrict")
hiveContext.sql("insert into table nyse partition(month) select name,date,open,high,low,close,volume,substr(date,8,4) as month from nyse_temp")
// -- most important to note: 'month' is used for dynamic partition and must be the last column in the select statement
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment