dbt_adapter.md

jwills commented May 16, 2022

Just read up in the thread a bit-- doing the sinks as analyses is smart, esp. given that constraint re: the lag with which they should be created relative to the rest of the pipeline. /cc @morsapaes

ahelium commented May 18, 2022 •

edited

Loading

PREVIEW would be super neat! One way to accomplish previewing a non materialized source is to use a TAIL command within a transaction to limit the number of rows returned. And now that TAIL can have internal queries, you can run any dev materialized view statement to peep your data:

materialize=> BEGIN;
BEGIN
materialize=> DECLARE c CURSOR for TAIL (SELECT convert_from(data, 'utf8') AS data FROM rp_flight_information);
DECLARE CURSOR
materialize=> FETCH 2 c;
mz_timestamp  | mz_diff |                                                                                                                                                                            data
---------------+---------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
 1652882799999 |       1 | {"icao24": "345682", "callsign": "", "origin_country": "Spain", "time_position": null, "last_contact": 1652829778, "longitude": null, "latitude": null, "baro_altitude": null, "on_ground": false, "velocity": 260.21, "true_track": 100.71, "vertical_rate": 0, "sensors": null, "geo_altitude": null, "squawk": null, "spi": false, "position_source": 0}
 1652882799999 |       1 | {"icao24": "38a1db", "callsign": "", "origin_country": "France", "time_position": null, "last_contact": 1652821139, "longitude": null, "latitude": null, "baro_altitude": null, "on_ground": true, "velocity": 0, "true_track": 123.75, "vertical_rate": null, "sensors": null, "geo_altitude": null, "squawk": "7776", "spi": false, "position_source": 0}
(2 rows)

I wonder if we could codify that process somehow for users, to make things simpler.

Author

morsapaes commented May 18, 2022

Thanks for writing that down, @jwills! Your comment + having a chat with @dataders (+ some 🚿 time) made me look at things from more of a workflow separation perspective. The TL;DR for the refactor (as we follow the progress of things like external nodes and the short-term plans to revamp the programmatic interfaces of dbt itself) sounds like:

Both sources and sinks should be completely decoupled from SQL models, which also means the creation of these objects should not happen at any point during dbt run, but as a separate staging step (much like what happens in dbt-external-tables). These should be YAML-ified (since they're pure DDL statements) and created using something like dbt stage --sources/dbt stage --sinks.

Does this sound reasonable? This separation should also make it easier to integrate with CI/CD pipelines that can trigger the creation of the right objects at the right time.

+1 that something like PREVIEW would be useful, but I'm having trouble understanding how that could help in context of the adapter. It'd be cool to wrap what @ahelium pointed out in a dry run-like command to preview the results of a (transformation) model, though!

morsapaes/dbt_adapter.md

Select an option

No results found

Select an option

No results found

Evolving the `dbt-materialize` adapter

Sources

Option 1: `dbt-external-tables`

User workflow

Option 2: `pre-hook` on models

User workflow

Sinks

Option 1: `post-hook` on models

User workflow

Option 2: custom metadata on `exposures`

User workflow

Handling credentials

jwills commented May 16, 2022

Uh oh!

ahelium commented May 18, 2022 •

edited

Loading

Uh oh!

morsapaes commented May 18, 2022

Uh oh!

morsapaes/dbt_adapter.md

Evolving the dbt-materialize adapter

Sources

Option 1: dbt-external-tables

User workflow

Option 2: pre-hook on models

User workflow

Sinks

Option 1: post-hook on models

User workflow

Option 2: custom metadata on exposures

User workflow

Handling credentials

jwills commented May 16, 2022

Uh oh!

ahelium commented May 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

morsapaes commented May 18, 2022

Uh oh!

Evolving the `dbt-materialize` adapter

Option 1: `dbt-external-tables`

Option 2: `pre-hook` on models

Option 1: `post-hook` on models

Option 2: custom metadata on `exposures`

ahelium commented May 18, 2022 •

edited

Loading