Skip to content

Instantly share code, notes, and snippets.

View sllynn's full-sized avatar

Stuart Lynn sllynn

View GitHub Profile
@sllynn
sllynn / snowflake-sparklyr.R
Created September 17, 2019 14:05
connect to snowflake using sparklyr
library(sparklyr)
SparkR::sparkR.session()
sc <- spark_connect(method="databricks")
snow.df.sparklyr <- spark_read_source(
sc=sc,
name = "adult",
source = "snowflake",
options = list(
@sllynn
sllynn / pandas-dateadd
Created September 17, 2019 13:58
add days to dates in pandas dataframe / series
import pandas as pd
def create_date_window(in_date, window_size):
date_lower = pd.to_datetime(in_date) - pd.DateOffset(days=window_size)
date_upper = pd.to_datetime(in_date) + pd.DateOffset(days=window_size)
return date_lower, date_upper
if __name__ == "__main__":
print(create_date_window("2019-09-17", 28))
@sllynn
sllynn / pyspark-melt.py
Created September 17, 2019 13:52
melt a pyspark dataframe
from pyspark.sql.functions import array, col, explode, lit, struct
from pyspark.sql import DataFrame
from typing import Iterable
def melt(
df: DataFrame,
id_vars: Iterable[str], value_vars: Iterable[str],
var_name: str="variable", value_name: str="value") -> DataFrame:
"""Convert :class:`DataFrame` from wide to long format."""