Skip to content

Existing Airflow DAG and dbt model cleanup for every data source #264

@jrlegrand

Description

@jrlegrand

Problem Statement

We need a wholesale review of our current Airflow DAGs and dbt setup for each data source from beginning to end.

Criteria for Success

Every checkbox in the DAGs and Work section of our Roadmap is checked and complete.

Airflow Standardized

  • Abstraction
    • Use airflow operator to create dag and tasks
    • Common tasks used to centralize logic
  • Loaded
    • Sources loaded to data lake

Dbt Buildout

  • Transformations in staging models
  • Combinations in intermediate models
  • Final table in prod schema mart model

Data Documented

  • Source data documented
  • Data source reference linked
  • Any additional tables documented

Additional Information

Roadmap: https://docs.google.com/document/d/1LTWPbgTdGrlRq2qi5KAyj68rtyGJfYRB_luUIN_1WFo/edit#heading=h.p5fi4rrf2kkp

Go down to the DAGs and Work section at the bottom.

image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    Status

    Todo

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions