Supabase ETL

A Rust crate to quickly build replication solutions for Postgres. Build data pipelines which continually copy data from Postgres to other systems.
Examples

ETL

This crate builds abstractions on top of Postgres's logical streaming replication protocol and pushes users towards the pit of success without letting them worry about low level details of the protocol.

Features

The etl crate supports the following destinations:

BigQuery
Apache Iceberg (planned)
DuckDB (planned)

Installation

To use etl in your Rust project, add the core library and desired destinations via git dependencies in Cargo.toml:

[dependencies]
etl = { git = "https://github.com/supabase/etl" }
etl-destinations = { git = "https://github.com/supabase/etl", features = ["bigquery"] }

The etl crate provides the core replication functionality, while etl-destinations contains destination-specific implementations. Each destination is behind a feature of the same name in the etl-destinations crate. The git dependency is needed for now because the crates are not yet published on crates.io.

Quickstart

To quickly get started with etl, see the etl-examples crate which contains practical examples and detailed setup instructions.

Database Setup

Before running the examples, tests, or the API and replicator components, you'll need to set up a PostgreSQL database. We provide a convenient script to help you with this setup. For detailed instructions on how to use the database setup script, please refer to our Database Setup Guide.

Running Tests

To run the test suite:

cargo test --all-features

Docker

The repository includes Docker support for both the replicator and api components:

# Build replicator image
docker build -f ./etl-replicator/Dockerfile .

# Build api image
docker build -f ./etl-api/Dockerfile .

Architecture

For a detailed explanation of the ETL architecture and design decisions, please refer to our Design Document.

Troubleshooting

Too Many Open Files Error

If you see the following error when running tests on macOS:

called `Result::unwrap()` on an `Err` value: Os { code: 24, kind: Uncategorized, message: "Too many open files" }

Raise the limit of open files per process with:

ulimit -n 10000

Performance Considerations

Currently, the system parallelizes the copying of different tables, but each individual table is still copied in sequential batches. This limits performance for large tables. We plan to address this once the ETL system reaches greater stability.

License

Distributed under the Apache-2.0 License. See LICENSE for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 847 Commits
.github		.github
.vscode		.vscode
docs		docs
etl-api		etl-api
etl-benchmarks		etl-benchmarks
etl-config		etl-config
etl-destinations		etl-destinations
etl-examples		etl-examples
etl-postgres		etl-postgres
etl-replicator		etl-replicator
etl-telemetry		etl-telemetry
etl		etl
scripts		scripts
.dockerignore		.dockerignore
.gitignore		.gitignore
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
deny.toml		deny.toml
mkdocs.yaml		mkdocs.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Repository files navigation

Supabase ETL

ETL

Table of Contents

Features

Installation

Quickstart

Database Setup

Running Tests

Docker

Architecture

Troubleshooting

Too Many Open Files Error

Performance Considerations

License

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Uh oh!

Contributors 13

Uh oh!

Languages

Uh oh!

License

supabase/etl

Folders and files

Latest commit

History

Repository files navigation

Supabase ETL

ETL

Table of Contents

Features

Installation

Quickstart

Database Setup

Running Tests

Docker

Architecture

Troubleshooting

Too Many Open Files Error

Performance Considerations

License

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors 13

Uh oh!

Languages

Packages