GitHub

FLEX: A Backbone for Diffusion Models

This project implements FLEX (FLow EXpert), a backbone architecture for diffusion models. FLEX is a hybrid architeture, which combines convolutional ResNet layers with Transformer blocks embedded into a U-Net-style framework, optimized for tasks like super-resolving and forecasting spatio-temporal physical systems. It also supports calibrated uncertainty estimation via sampling and performs well even with as few as two reverse diffusion steps.

The following figure illustrates the overall architecture of FLEX, instantiated for super-resolution tasks. FLEX is modular and can be extended to forecasting and multi-task settings seamlessly. Here, FLEX operates in the residual space, rather than directly modeling raw data, which stabilizes training by reducing the variance of the diffusion velocity field.

See our paper on arXiv (https://arxiv.org/abs/2505.17351) for full details.

Architectural Highlights

Hybrid U-Net Backbone:
- Retains convolutional ResNet blocks for local spatial structure.
- Replaces the U-Net bottleneck with a ViT (Vision Transformer) operating on patch size 1, enabling all-to-all communication without sacrificing spatial fidelity.
- Uses a redesigned skip-connection scheme to integrate ViT bottleneck with convolutional layers, improving fine-scale reconstruction and long-range coherence.
Hierarchical Conditioning Strategy:
- Task-specific encoder processes auxiliary inputs (e.g., coarse-resolution or past snapshots).
- Weak conditioning injects partial features via skip connections, for learnining more task-agnostic latent representation.
- Strong conditioning injects full or learned embeddings into the decoder for task-specific guidance.

Training Instructions

To train a new multi-task model for both super-resolution and forecasting:

python train_mt.py --run-name flex_small --dataset nskt --model flex --size small --data-dir PATH/TO/DATASET

Additional options are available for model sizes (small/medium/big), model types (unet, uvit, flex).

To train a new single-task model for both super-resolution, use:

python train_sr.py --run-name flex_sr_small --dataset nskt --model flex --size small --data-dir PATH/TO/DATASET

You can download data here: [ToDo].

Evaluation with error metrics

To evaluate the trained model you can use the evaluation code bellow. You can download pre-trained checkpoints here: [ToDo].

Forecasting

python evaluate.py --model-path checkpoints/checkpoint_name.pt --Reynolds-number 12000 --batch-size 32 --horizion 10 --diffusion-steps 2 --model flex --ensemb-size 1 --size small --data-dir PATH/TO/DATASET

Superresolution

python evaluate.py --model-path checkpoints/checkpoint_name.pt --Reynolds-number 16000 --batch-size 32 --diffusion-steps 2 --model flex --ensemb-size 1 --size small --superres --data-dir PATH/TO/DATASET

Citation

ToDo

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
images		images
src		src
README.md		README.md
evaluate.py		evaluate.py
evaluate_sr.py		evaluate_sr.py
export_ddp.sh		export_ddp.sh
test.sh		test.sh
train_mt.py		train_mt.py
train_sr.py		train_sr.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FLEX: A Backbone for Diffusion Models

Architectural Highlights

Training Instructions

Evaluation with error metrics

Forecasting

Superresolution

Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

ViniciusMikuni/FLEX

Folders and files

Latest commit

History

Repository files navigation

FLEX: A Backbone for Diffusion Models

Architectural Highlights

Training Instructions

Evaluation with error metrics

Forecasting

Superresolution

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages