🐢 Dual-Process Image Generation

Grace Luo, Jonathan Granskog, Aleksander Holynski, Trevor Darrell

This repository contains the PyTorch implementation of Dual-Process Image Generation.

[Project Page][arXiv]

Setup

This code was tested with Python 3.10. To install the necessary packages, please run:

conda env create -f environment.yaml
conda activate dig

Gradio Demo

You can also run our method as a gradio app using the following command. If you set the environment variable OPENAI_API_KEY, you can use the extra prompt and question expansion features we have provided. You can also leave this variable unset, which will disable these features.

conda activate dig
export OPENAI_API_KEY=<your_openai_api_key>
python3 launch_gradio.py

Generation Scripts

You can also mass optimize LoRAs and visualize their samples using the following script.

conda activate dig
./run_dual_process.sh

Compute Requirements

This code was tested on a single 80GB Nvidia A100 GPU. However, depending on the configuration of Image Generator and VLM, you can run this codebase on as little as two 24GB Nvidia RTX 4090 GPUs. You can find the VRAM requirements for each supported model in the table below (as measured with torch.cuda.max_memory_reserved). We also indicate whether each model is officially supported (experimental models have not been extensively tested).

Model	Model Size	VRAM (GB)	Officially Supported
Image Generator
Stable Diffusion v1.4	860M	6.6	❌
Sana	1.6B	15.0	❌
Flux Schnell	12B	44.3	✅
Flux Dev	12B	44.3	✅
VLM
Gemma3	4B	12.1	❌
LLaVA v1.5	7B	16.7	❌
Idefics2	8B	17.3	✅
Qwen2.5 VL	7B	19.6	✅
Pixtral	12B	34.7	❌

Citing

@article{luo2025dualprocess,
  title={Dual-Process Image Generation},
  author={Grace Luo and Jonathan Granskog and Aleksander Holynski and Trevor Darrell},
  journal={arXiv preprint arXiv:2506.01955},
  year={2025}
}

Name	Name	Last commit date
Latest commit History 3 Commits
configs	configs
dual_process	dual_process
.gitignore	.gitignore
README.md	README.md
demo_optimize.ipynb	demo_optimize.ipynb
environment.yaml	environment.yaml
launch_gradio.py	launch_gradio.py
run_dual_process.sh	run_dual_process.sh
script_dual_process.py	script_dual_process.py	< 838D div class="Skeleton Skeleton--text">

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🐢 Dual-Process Image Generation

Setup

Gradio Demo

Generation Scripts

Compute Requirements

Citing

About

Uh oh!

Releases

Packages

Languages

g-luo/dual_process

Folders and files

Latest commit

History

Repository files navigation

🐢 Dual-Process Image Generation

Setup

Gradio Demo

Generation Scripts

Compute Requirements

Citing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages