Ernesto Ongaro ernestoongaro

Solutions Architect @ dbt Labs. Transforming data in your warehouse!

ernestoongaro / modin_ forecast_daily_returns.py

Created December 16, 2024 12:26

Move from Pandas to Modin with Snowflake's Snowpark & dbt

	import modin.pandas as pd
	import snowflake.snowpark.modin.plugin
	from prophet import Prophet

	def model( dbt, session ):

	dbt.config(
	materialized="table", # the incremental materialization is also supported
	packages=['pandas==2.2.1','modin==0.28.1','Prophet','holidays==0.18','snowflake-snowpark-python[modin]'], # how to import python libraries in dbt's context
	python_version="3.11"

ernestoongaro / main.py

Created May 19, 2023 16:53

Fivetran Webhooks -> dbt Cloud Run https://www.loom.com/share/4124fb01427c466797c5329783d7c848

	import enum
	import os
	import time

	# Be sure to `pip install requests` in your python environment
	import requests

	ACCOUNT_ID = 39
	JOB_ID = 302

ernestoongaro / incremental_lastrun_auto_materialization

Created April 3, 2023 16:05

	{% materialization incremental_lastrun_auto, adapter='snowflake' -%}

	{% set original_query_tag = set_query_tag() %}

	{%- set unique_key = config.get('unique_key') -%}
	{%- set full_refresh_mode = (should_full_refresh()) -%}

	{% set target_relation = this %}
	{% set existing_relation = load_relation(this) %}
	{% set tmp_relation = make_temp_relation(this) %}

ernestoongaro / pareto.py

Created February 14, 2023 13:58

	#example from https://towardsdatascience.com/reduce-warehouse-space-with-the-pareto-principle-using-python-e722a6babe0e
	#optimizing warehouse location based on sku frequency using pareto 80/20 principle

	import pandas as pd


	def model(dbt, session):
	dbt.config(
	materialized="table",
	packages=["pandas"] #https://repo.anaconda.com/pkgs/snowflake/

ernestoongaro / adding-service-account-key-for-bigquery.ipynb

Last active February 17, 2023 17:21

adding-service-account-key-for-bigquery.ipynb

Sorry, something went wrong. Reload?

Sorry, we cannot display this file.

Sorry, this file is invalid so it cannot be displayed.

ernestoongaro / add_search_optimization.sql

Last active December 3, 2022 08:42

Search Optimizations for Snowflake Macro (for use in post_hook)

	{% macro add_search_optimization(relation,columns) %}

	{# check if the relation has search optimization added #}
	{%- call statement('search_optimization', fetch_result=True) -%}
	describe search optimization on {{ relation }}
	{%- endcall %}

	{% if execute %}
	{% set result = load_result('search_optimization') %}
	{%- set result_data = result['data'] -%}

ernestoongaro / dynamic_pivot_multiple_variables.sql

Created September 27, 2021 11:24



	===
	Compiled SQL:
	select
	EXTRACT(YEAR from ship_date),
	sum(case when ship_mode = 'FOB' then gross_item_sales_amount end) as FOB_A_amount,
	sum(case when ship_mode = 'TRUCK' then gross_item_sales_amount end) as TRUCK_N_amount,
	sum(case when ship_mode = 'MAIL' then gross_item_sales_amount end) as MAIL_N_amount,
	sum(case when ship_mode = 'SHIP' then gross_item_sales_amount end) as SHIP_R_amount,

ernestoongaro / dynamic_pivot.sql

Created September 20, 2021 11:46

	/* Create a pivot table with dynamic columns based on the ship modes that are in the system */

	{%- call statement('result', fetch_result=True) -%}

	{# this pulls the unique ship modes from the fct_order_items table #}
	select ship_mode from {{ ref('fct_order_items') }} group by 1

	{%- endcall %}

	{% set ship_modes = load_result('result').table.columns[0].values() %}

ernestoongaro / json.md

Last active September 20, 2021 11:44

json example

First run this in snowflake:

(
 src variant
)
AS SELECT parse_json(column1) as src
FROM values
('{ 
 "topleveldate" : "2017-04-28",

ernestoongaro / core.yml

Created September 15, 2021 15:12

core.yml

	models:
	- name: dim_customers
	description: Customer dimensions table
	columns:
	- name: customer_key
	description: Primary key on the customers table
	tests:
	- unique
	- not_null
	- name: region