Official Repository for Scenario Dreamer

Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments
Luke Rowe^1,2,6, Roger Girgis^1,3,6, Anthony Gosselin^1,3, Liam Paull^1,2,5, Christopher Pal^1,2,3,5, Felix Heide^4,6
¹ Mila, ² Université de Montréal, ³ Polytechnique Montréal, ⁴ Princeton University, ⁵ CIFAR AI Chair, ⁶ Torc Robotics

Computer Vision and Pattern Recognition (CVPR), 2025

We propose Scenario Dreamer, a fully data-driven closed-loop generative simulator for autonomous vehicle planning.

scenario_dreamer.mp4

Repository Timeline

Setup

Start by cloning the repository

git clone https://github.com/princeton-computational-imaging/scenario-dreamer.git
cd scenario-dreamer

This repository assumes you have a "scratch" directory for larger files (datasets, checkpoints, etc.). If disk space is not an issue, you can keep everything in the repository directory:

export SCRATCH_ROOT=$(pwd) # prefer a separate drive? Point SCRATCH_ROOT there instead.

Define environment variables to let the code know where things live:

source $(pwd)/scripts/define_env_variables.sh

Conda Setup

# create conda environment
conda env create -f environment.yml
conda activate scenario-dreamer

# login to wandb for experiment logging
export WANDB_API_KEY=<your_api_key>
wandb login

Waymo Dataset Preparation

Quick Option:
If you'd prefer to skip data extraction and preprocessing, you can directly download the prepared files. Place this tar file in your scratch directory and extract:

scenario_dreamer_ae_preprocess_waymo.tar (preprocessed dataset for Scenario Dreamer autoencoder training on Waymo)
Download from Google Drive

Download the Waymo Open Motion Dataset (v1.1.0) into your scratch directory with the following directory structure:

$SCRATCH_ROOT/waymo_open_dataset_motion_v_1_1_0/
├── training/
│   ├── training.tfrecord-00000-of-01000
│   ├── …
│   └── training.tfrecord-00999-of-01000
├── validation/
│   ├── validation.tfrecord-00000-of-00150
│   ├── …
│   └── validation.tfrecord-00149-of-00150
└── testing/
    ├── testing.tfrecord-00000-of-00150
    ├── …
    └── testing.tfrecord-00149-of-00150

Then, we preprocess the waymo dataset to prepare for Scenario Dreamer model training. The first script takes ~12hrs and the second script takes ~12hrs (8 CPU cores, 64GB RAM):

bash scripts/extract_waymo_data.sh # extract relevant data from tfrecords and create train/val/test splits
bash scripts/preprocess_waymo_dataset.sh # preprocess data to facilitate efficient model training

NuPlan Dataset Preparation

Quick Option:
If you'd prefer to skip data extraction and preprocessing, you can directly download the prepared files: Place the following files in your scratch directory and extract:

scenario_dreamer_nuplan.tar (processed nuPlan data (required for computing metrics, but not required for training))

scenario_dreamer_ae_preprocess_nuplan.tar (preprocessed dataset for Scenario Dreamer autoencoder training on nuplan)
Download from Google Drive

We use the same extracted NuPlan data as SLEDGE, with minor modifications tailored for Scenario Dreamer. Our modified fork for extracting the Nuplan data is available here.

Step-by-Step Instructions

Install dependencies & download raw NuPlan data
Follow the guide in the installation.md file of our forked repo.
This will walk you through:
- Downloading the NuPlan dataset
- Setting up the correct environment variables
- Installing the sledge-devkit
Extract NuPlan data
Use the instructions under “1. Feature Caching” in the autoencoder.md to preprocess the NuPlan data.

Extract train/val/test splits and preprocess data for training
Run the following to extract train/val/test splits and create the preprocessed data for training.

bash scripts/extract_nuplan_data.sh # create train/val/test splits and create eval set for computing metrics
bash scripts/preprocess_nuplan_dataset.sh # preprocess data to facilitate efficient model training

Pre-Trained Checkpoints

Pre-trained checkpoints can be downloaded from Google Drive. Place the checkpoints directory into your scratch ($SCRATCH_ROOT) directory.

To download all checkpoints into your scratch directory, run:

cd $SCRATCH_ROOT
gdown --folder https://drive.google.com/drive/folders/1G9jUA_wgF2Vo40I5HckO1yxUjA_0kUEJ

Checkpoints

Model	Dataset	Size	SHA‑256
Autoencoder	Waymo	362 MB	`3c3033a107de727ca1c2399a8e0df107e5eb1a84bce3d7e18cc2e01698ccf6ac`
LDM Large	Waymo	12.4 GB	`06a1a65e9949f55c3398aeadacde388b03a6705f2661bc273cf43e7319de4cd5`
Autoencoder	Nuplan	371 MB	`386b1f89eda71c5cdf6d29f7c343293e1a74bbd09395bfdeab6c2fb57f43e258`
LDM Large	Nuplan	12.5 GB	`2151e59307282e29b456ffc7338b9ece92fc2e2cf22ef93a67929da3176b5c59`

Note: The LDM Large checkpoints were trained for 250k steps. While the Scenario Dreamer paper reports results at 165k steps, training to 250k steps leads to improvements across most metrics. For this reason, we are releasing the 250k step checkpoints and the expected results are marginally better than those reported in the paper.

Expected Performance

Scenario Dreamer L Waymo

Lane metrics	Value	Agent metrics	Value
route_length_mean (m)	38.80	nearest_dist_jsd	0.05
route_length_std (m)	13.56	lat_dev_jsd	0.03
endpoint_dist_mean (m)	0.21	ang_dev_jsd	0.08
endpoint_dist_std (m)	0.81	length_jsd	0.43
frechet_connectivity	0.10	width_jsd	0.29
frechet_density	0.26	speed_jsd	0.38
frechet_reach	0.26	collision_rate (%)	4.01
frechet_convenience	1.29

Scenario Dreamer L Nuplan

Lane metrics	Value	Agent metrics	Value
route_length_mean (m)	36.68	nearest_dist_jsd	0.08
route_length_std (m)	10.39	lat_dev_jsd	0.10
endpoint_dist_mean (m)	0.25	ang_dev_jsd	0.11
endpoint_dist_std (m)	0.71	length_jsd	0.25
frechet_connectivity	0.08	width_jsd	0.20
frechet_density	0.25	speed_jsd	0.06
frechet_reach	0.05	collision_rate (%)	9.22
frechet_convenience	0.40

Training

📈 Autoencoder Training

1. Prerequisites

Verify that you have the preprocessed dataset (scenario_dreamer_ae_preprocess_[waymo|nuplan]) and that it resides in your scratch directory.

2. Launch Autoencoder Training

python train.py \
  dataset_name=[waymo|nuplan] \
  model_name=autoencoder \
  ae.train.run_name=[your_autoencoder_run_name] \
  ae.train.track=True

By default ae.train.run_name is set to scenario_dreamer_autoencoder_[waymo|nuplan].

3. What to Expect

Trains on 1 GPU (≈ 36-40 h with A100 GPU).
Training metrics and visualizations are logged to Weights & Biases (W&B).
After each epoch a single checkpoint (overwritten to last.ckpt) is saved to $SCRATCH_ROOT/checkpoints/[your_autoencoder_run_name].

💾 Autoencoder Latent Caching

1. Prerequisites

Verify that you have the preprocessed dataset (scenario_dreamer_ae_preprocess_[waymo|nuplan]) and a trained autoencoder from the previous step.

2. Launch Caching

python eval.py \
  dataset_name=[waymo|nuplan] \
  model_name=autoencoder \
  ae.eval.run_name=[your_autoencoder_run_name] \
  ae.eval.cache_latents.enable_caching=True \
  ae.eval.cache_latents.split_name=[train|val|test]

3. What to Expect

Caches latents (mean/log_var) to disk at $SCRATCH_ROOT/scenario_dreamer_autoencoder_latents_[waymo|nuplan]/[train|val|test] for ldm training.
Utilizes 1 GPU (≈ 1 h with A100 GPU)

🚀 Latent Diffusion Model (LDM) Training

1. Prerequisites

Verify that you have the cached latents (scenario_dreamer_autoencoder_latents_[waymo|nuplan]) for the train and val split in your scratch directory, and the corresponding trained autoencoder.

2. Launch Training

Scenario Dreamer Base

By default, train.py trains a Scenario Dreamer Base model:

python train.py \
  dataset_name=[waymo|nuplan] \
  model_name=ldm \
  ldm.model.autoencoder_run_name=[your_autoencoder_run_name] \
  ldm.train.run_name=[your_ldm_run_name] \
  ldm.train.track=True

Ensure your ldm run name is different to your autoencoder run name. By default, ldm.train.run_name is set to scenario_dreamer_ldm_base_[waymo|nuplan].

Scenario Dreamer Large

python train.py \
  dataset_name=[waymo|nuplan] \
  model_name=ldm \
  ldm.model.autoencoder_run_name=[your_autoencoder_run_name] \
  ldm.train.run_name=[your_ldm_run_name] \
  ldm.train.num_devices=8 \
  ldm.datamodule.train_batch_size=128 \
  ldm.datamodule.val_batch_size=128 \
  ldm.model.num_l2l_blocks=3 \
  ldm.train.track=True

Ensure your ldm run name is different to your autoencoder run name. By default, ldm.train.run_name is set to scenario_dreamer_ldm_large_[waymo|nuplan].

3. What to Expect

Scenario Dreamer B trains on 4 GPUs (≈ 24h with 4 A100-L GPUs) and Scenario Dreamer L trains on 8 GPUs (≈ 32-36h with 8 A100-L GPUs).
By default, both models train for 165k steps.
Training metrics and visualizations are logged to Weights & Biases (W&B).
After each epoch a single checkpoint (overwritten to last.ckpt) is saved to $SCRATCH_ROOT/checkpoints/[your_ldm_run_name].
To resume training from an existing checkpoint, run the same training command and the code will automatically resume training from the last.ckpt stored in the run's $SCRATCH_ROOT/checkpoints/[your_ldm_run_name] directory.

Evaluation

📈 Autoencoder Evaluation

1. Prerequisites

Verify that you have the preprocessed dataset (scenario_dreamer_ae_preprocess_[waymo|nuplan]) and a trained autoencoder.

2. Launch Eval

python eval.py \
  dataset_name=[waymo|nuplan] \
  model_name=autoencoder \
  ae.eval.run_name=[your_autoencoder_run_name]

3. What to Expect

By default, 50 reconstructed scenes will be visualized and logged to $PROJECT_ROOT/viz_eval_[your_autoencoder_run_name].
The reconstruction metrics computed on the full test set will be printed.

🚀 Generate Scenes with LDM

Initial Scene Generation

1. Prerequisites

Verify that you have a trained autoencoder and ldm.

2. Generate and Visualize Samples

To generate and visualize 100 initial scenes from your trained model:

python eval.py \
  dataset_name=[waymo|nuplan] \
  model_name=ldm \
  ldm.eval.mode=initial_scene \
  ldm.model.num_l2l_blocks=[1|3] \ # base model has 1 l2l block, large model has 3
  ldm.eval.run_name=[your_ldm_run_name] \
  ldm.model.autoencoder_run_name=[your_autoencoder_run_name] \
  ldm.eval.num_samples=100 \
  ldm.eval.visualize=True

To additionally cache the samples to disk for metrics computation, set ldm.eval.cache_samples=True. You can adjust ldm.eval.num_samples to configure the number of samples generated.

3. What to Expect

100 samples will be generated on 1 GPU with a default batch size of 32.
The samples will be visualized to $PROJECT_ROOT/viz_gen_samples_[your_ldm_run_name].
If you toggle ldm.eval.cache_samples=True, samples will be cached to $SCRATCH_ROOT/checkpoints/[your_ldm_run_name]/samples.

Lane-conditioned Object Generation

1. Prerequisites

Verify that you have a trained autoencoder and ldm.
Verify that you have the cached latents (scenario_dreamer_autoencoder_latents_[waymo|nuplan]) for the train and val split in your scratch directory. We will condition the reverse diffusion process on the lane latents loaded from the cache for lane-conditioned generation.

2. Generate and Visualize Samples

To generate and visualize 100 lane-conditioned scenes from your trained model:

python eval.py \
  dataset_name=[waymo|nuplan] \
  model_name=ldm \
  ldm.eval.mode=lane_conditioned \
  ldm.model.num_l2l_blocks=[1|3] \ # base model has 1 l2l block, large model has 3
  ldm.eval.run_name=[your_ldm_run_name] \
  ldm.model.autoencoder_run_name=[your_autoencoder_run_name] \
  ldm.eval.conditioning_path=${SCRATCH_ROOT}/scenario_dreamer_autoencoder_latents_[waymo|nuplan]/val
  ldm.eval.num_samples=100 \
  ldm.eval.visualize=True

This will load lane latents from the validation set for conditioning. You can adjust ldm.eval.num_samples to configure the number of samples generated.

3. What to Expect

100 lane-conditioned samples will be generated on 1 GPU with a default batch size of 32.
The lane-conditioned samples will be visualized to $PROJECT_ROOT/viz_gen_samples_[your_ldm_run_name].

Inpainting Generation

TBD

📊 Compute Evaluation Metrics

1. Prerequisites

Verify that you have a trained autoencoder and ldm.
You first need to generate 50k samples with your trained LDM:

python eval.py \
  dataset_name=[waymo|nuplan] \
  model_name=ldm \
  ldm.eval.mode=initial_scene \
  ldm.model.num_l2l_blocks=[1|3] \ # base model has 1 l2l block, large model has 3
  ldm.eval.run_name=[your_ldm_run_name] \
  ldm.model.autoencoder_run_name=[your_autoencoder_run_name] \
  ldm.eval.num_samples=50000 \
  ldm.eval.cache_samples=True

2. Compute Metrics

python eval.py \
  dataset_name=[waymo|nuplan] \
  model_name=ldm \
  ldm.eval.mode=metrics \
  ldm.eval.run_name=[your_ldm_run_name]

3. What to Expect

Computes metrics using 50k generated scenes from your trained LDM and 50k real scenes whose paths are loaded from $PROJECT_ROOT/metadata/eval_set_[waymo|nuplan].pkl.
Lane generation and agent generation metrics will be printed and written to $SCRATCH_ROOT/checkpoints/[your_ldm_run_name]/metrics.pkl.

Citation

@InProceedings{rowe2025scenariodreamer,
  title={Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments},
  author={Rowe, Luke and Girgis, Roger and Gosselin, Anthony and Paull, Liam and Pal, Christopher and Heide, Felix},
  booktitle = {CVPR},
  year={2025}
}

Acknowledgements

Special thanks to the authors of the following open-source repositories:

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
cfgs		cfgs
data_processing		data_processing
datamodules		datamodules
datasets		datasets
metadata		metadata
models		models
nn_modules		nn_modules
scripts		scripts
utils		utils
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
eval.py		eval.py
metrics.py		metrics.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Official Repository for Scenario Dreamer

Repository Timeline

Table of Contents

Setup

Conda Setup

Waymo Dataset Preparation

NuPlan Dataset Preparation

Step-by-Step Instructions

Pre-Trained Checkpoints

Checkpoints

Training

📈 Autoencoder Training

💾 Autoencoder Latent Caching

🚀 Latent Diffusion Model (LDM) Training

Evaluation

📈 Autoencoder Evaluation

🚀 Generate Scenes with LDM

📊 Compute Evaluation Metrics

Citation

Acknowledgements

About

Uh oh!

Releases

Packages

Contributors 2

Languages

princeton-computational-imaging/scenario-dreamer

Folders and files

Latest commit

History

Repository files navigation

Official Repository for Scenario Dreamer

Repository Timeline

Table of Contents

Setup

Conda Setup

Waymo Dataset Preparation

NuPlan Dataset Preparation

Step-by-Step Instructions

Pre-Trained Checkpoints

Checkpoints

Training

📈 Autoencoder Training

💾 Autoencoder Latent Caching

🚀 Latent Diffusion Model (LDM) Training

Evaluation

📈 Autoencoder Evaluation

🚀 Generate Scenes with LDM

📊 Compute Evaluation Metrics

Citation

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages