Guidance-Free Training (GFT)
Visual Generation Without Guidance

Huayu Chen^†, Kai Jiang^†, Kaiwen Zheng, Jianfei Chen, Hang Su, Jun Zhu

^†Co-first authors

Qualitative T2I comparison between vanilla conditional generation, GFT, and CFG on Stable Diffusion 1.5 with the prompt "Elegant crystal vase holding pink peonies, soft raindrops tracing paths down the window behind it".

GFT allows us to adjust sampling temperature of visual generation, with only a single model.

GFT is a simple algorithm that allows you to remove CFG from visual generative models without ANY performance loss.

GFT is highly efficient, simply finetune your pretrained models for 1% pretraining time on the pretraining dataset.
GFT requires minimal modifications (<10 lines of code) to existing codebases. Most design choices and hyperparameters are directly inherited from pretraining.
GFT is highly universal. One algorithm fits ALL visual generative models. This includes diffusion/Flow/AR/Masked models.
If you like, GFT also enables training guidance-free models directly from scratch.

Try GFT once, enjoy 50% cheaper guidance-free sampling forever!

Check out details in our paper.

Comparison of GFT and CFG method. GFT shares CFG's training objective but has a different parameterization technique for the conditional model. This enables direct training of an explicit sampling model.

🔥 News!!!

[2025/05] We release training code and pretrained guidance-free checkpoints of DiT models.

[2025/05] We release training code and pretrained guidance-free checkpoints of Stable Diffusion 1.5 models.

[2025/03] We release training code and pretrained guidance-free checkpoints of LlamaGen models.

Usage

Please check out implementation and example usage in the respective directory of base models. See how GFT can be easily integrated into existing visual codebases.

Citation

If you find our project helpful, please consider citing:

@article{chen2025visual,
  title={Visual Generation Without Guidance},
  author={Chen, Huayu and Jiang, Kai and Zheng, Kaiwen and Chen, Jianfei and Su, Hang and Zhu, Jun},
  journal={arXiv preprint arXiv:2501.15420},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
DiT		DiT
LlamaGen		LlamaGen
stable-diffusion-v1-5		stable-diffusion-v1-5
GFT.png		GFT.png
LICENSE		LICENSE
README.md		README.md
method.png		method.png
temperature.png		temperature.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Guidance-Free Training (GFT)
Visual Generation Without Guidance

Huayu Chen^†, Kai Jiang^†, Kaiwen Zheng, Jianfei Chen, Hang Su, Jun Zhu

^†Co-first authors

Usage

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

thu-ml/GFT

Folders and files

Latest commit

History

Repository files navigation

Guidance-Free Training (GFT) Visual Generation Without Guidance

Huayu Chen†, Kai Jiang†, Kaiwen Zheng, Jianfei Chen, Hang Su, Jun Zhu

†Co-first authors

Usage

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Guidance-Free Training (GFT)
Visual Generation Without Guidance

Huayu Chen^†, Kai Jiang^†, Kaiwen Zheng, Jianfei Chen, Hang Su, Jun Zhu

^†Co-first authors

Packages