8000 GitHub - migucn/CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

migucn/CreatiDesign

 
 

Repository files navigation

CreatiDesign Logo CreatiDesign


HuggingFace HuggingFace HuggingFace

CreatiDesign Logo CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design
Hui Zhang, Dexiang Hong, Maoke Yang, Yutao Cheng, Zhao Zhang, Jie Shao, Xinglong Wu, Zuxuan Wu, and Yu-Gang Jiang
Fudan University & ByteDance Intelligent Creation.

🎯 Introduction

CreatiDesign tackles the challenge of automated graphic design generation that requires precise control over multiple heterogeneous elements—primary visual elements (product images), secondary visual elements (decorative objects), and textual elements (slogans, titles). CreatiDesign introduces a unified multi-conditional diffusion transformer that achieves flexible and harmonious integration of diverse design elements with minimal architectural modifications.

✨ Key Features

  • 🎨 Multi-Conditional Image Generation: Unified architecture supporting images, semantic layouts conditions simultaneously
  • 🎯 Precise Element Control: Multimodal attention mask mechanism prevents condition interference
  • 🗂️ Graphic Design Datasets: 400K graphic design samples with multi-condition annotations construced by automatic pipeline
  • 📊 Comprehensive Benchmark: Rigorous evaluation of multi-subject preservation and semantic layout alignment.
  • ✏️ Zero-Shot Editing: Natural extension to editing tasks without additional training or retraining

Quick Start

Setup

  1. Environment setup
conda create -n creatidesign python=3.10 -y
conda activate creatidesign
conda install pytorch==2.4.1 torchvision==0.19.1 torchaudio==2.4.1 pytorch-cuda=12.1 -c pytorch -c nvidia
  1. Requirements installation
pip install -r requirements.txt

Dataset and Benchmark

CreatiDesign Datasets HuggingFace

Our CreatiDesign dataset contains 400K high-quality graphic design samples with comprehensive multi-condition annotations, constructed through our fully automated pipeline. The dataset covers diverse design categories including movie posters, product advertisements, brand promotions, and social media content.

CreatiDesign Benchmark HuggingFace

Our comprehensive benchmark contains 1,000 carefully curated samples designed to rigorously evaluate graphic design generation capabilities across multiple dimensions. The benchmark assesses both fine-grained condition adherence and overall visual quality.

To evaluate the model's graphic design generation capabilities through our benchmark, follow these steps:

Generate images:

python test_creatidesign_benchmark.py

Evaluate multi-subject preservation:

python eval/subject.py

Evaluate semantic layout alignment:

python eval/layout.py
python eval/text.py

Models

Multi-Conditional Graphic Design:

Model Base model Description
HuggingFace FLUX.1-dev model used in the paper

✒️ Citation

If you find our work useful for your research and applications, please kindly cite using this BibTeX:

@article{zhang2025creatidesign,
  title={CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design},
  author={Zhang, Hui and Hong, Dexiang and Yang, Maoke and Chen, Yutao and Zhang, Zhao and Shao, Jie and Wu, Xinglong and Wu, Zuxuan and Jiang, Yu-Gang},
  journal={arXiv preprint arXiv:2505.19114},
  year={2025}
}

About

A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%
0