CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design
Hui Zhang, Dexiang Hong, Maoke Yang, Yutao Cheng, Zhao Zhang, Jie Shao, Xinglong Wu, Zuxuan Wu, and Yu-Gang Jiang
Fudan University & ByteDance Intelligent Creation.
CreatiDesign tackles the challenge of automated graphic design generation that requires precise control over multiple heterogeneous elements—primary visual elements (product images), secondary visual elements (decorative objects), and textual elements (slogans, titles). CreatiDesign introduces a unified multi-conditional diffusion transformer that achieves flexible and harmonious integration of diverse design elements with minimal architectural modifications.
- 🎨 Multi-Conditional Image Generation: Unified architecture supporting images, semantic layouts conditions simultaneously
- 🎯 Precise Element Control: Multimodal attention mask mechanism prevents condition interference
- 🗂️ Graphic Design Datasets: 400K graphic design samples with multi-condition annotations construced by automatic pipeline
- 📊 Comprehensive Benchmark: Rigorous evaluation of multi-subject preservation and semantic layout alignment.
- ✏️ Zero-Shot Editing: Natural extension to editing tasks without additional training or retraining
- Environment setup
conda create -n creatidesign python=3.10 -y
conda activate creatidesign
conda install pytorch==2.4.1 torchvision==0.19.1 torchaudio==2.4.1 pytorch-cuda=12.1 -c pytorch -c nvidia
- Requirements installation
pip install -r requirements.txt
Our CreatiDesign dataset contains 400K high-quality graphic design samples with comprehensive multi-condition annotations, constructed through our fully automated pipeline. The dataset covers diverse design categories including movie posters, product advertisements, brand promotions, and social media content.
Our comprehensive benchmark contains 1,000 carefully curated samples designed to rigorously evaluate graphic design generation capabilities across multiple dimensions. The benchmark assesses both fine-grained condition adherence and overall visual quality.
To evaluate the model's graphic design generation capabilities through our benchmark, follow these steps:
Generate images:
python test_creatidesign_benchmark.py
Evaluate multi-subject preservation:
python eval/subject.py
Evaluate semantic layout alignment:
python eval/layout.py
python eval/text.py
Multi-Conditional Graphic Design:
Model | Base model | Description |
---|---|---|
FLUX.1-dev | model used in the paper |
If you find our work useful for your research and applications, please kindly cite using this BibTeX:
@article{zhang2025creatidesign,
title={CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design},
author={Zhang, Hui and Hong, Dexiang and Yang, Maoke and Chen, Yutao and Zhang, Zhao and Shao, Jie and Wu, Xinglong and Wu, Zuxuan and Jiang, Yu-Gang},
journal={arXiv preprint arXiv:2505.19114},
year={2025}
}