Xiaoyi Dong | Yuhang Zang | Yuhang Cao | Dahua Lin | Jiaqi Wang†
The Chinese University of Hong Kong, Shanghai Artificial Intelligence Laboratory
HiFlow is a training-free and model-agnostic framework to unlock the resolution potential of pre-trained flow models.
📖 Click for the full abstract of HiFlow
Text-to-image (T2I) diffusion/flow models have drawn considerable attention recently due to their remarkable ability to deliver flexible visual creations. Still, high-resolution image synthesis presents formidable challenges due to the scarcity and complexity of high-resolution content. To this end, we present HiFlow, a training-free and model-agnostic framework to unlock the resolution potential of pre-trained flow models. Specifically, HiFlow establishes a virtual reference flow within the high-resolution space that effectively captures the characteristics of low-resolution flow information, offering guidance for high-resolution generation through three key aspects: initialization alignment for low-frequency consistency, direction alignment for structure preservation, and acceleration alignment for detail fidelity. By leveraging this flow-aligned guidance, HiFlow substantially elevates the quality of high-resolution image synthesis of T2I models and demonstrates versatility across their personalized variants. Extensive experiments validate HiFlow's superiority in achieving superior high-resolution image quality over current state-of-the-art methods.
HiFlow constructs reference flow from low-resolution sampling trajectory to offer initiation alignment, direction alignment, and acceleration alignment, enabling flow-aligned high-resolution image generation. Specifically, HiFlow involves a cascade generation paradigm: First, a virtual reference flow is constructe 8000 d in the high-resolution space based on the step-wise estimated clean samples of the low-resolution sampling flow. Then, during high-resolution synthesizing, the reference flow offers guidance from sampling initialization, denoising direction, and moving acceleration, aiding in achieving consistent low-frequency patterns, preserving structural features, and maintaining high-fidelity details.
git clone https://github.com/Bujiazi/HiFlow.git
cd HiFlow
conda create -n hiflow python=3.10
conda activate hiflow
pip install -r requirements.txt
HiFlow can be seamlessly integrated with various LoRA models.
Model:aidmaFLUXPro (More and Finer Details)
Model:aidmaRealisticSkin (Realistic Skin)
Model:Black Myth Wukong (T2I Customization)
sh inference.sh
Model downloading is automatic.
- Support LoRA! (2025.5.11)
- Code (V1.0) and project page are released! (2025.4.17)
- Paper is available on arXiv! (2025.4.8)
- 🚀 Release the HiFlow code and project page
- 🚀 Release paper
If you find our work helpful, please consider giving a star ⭐ and citation 📝
@article{bu2025hiflow,
title={HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance},
author={Bu, Jiazi and Ling, Pengyang and Zhou, Yujie and Zhang, Pan and Wu, Tong and Dong, Xiaoyi and Zang, Yuhang and Cao, Yuhang and Lin, Dahua and Wang, Jiaqi},
journal={arXiv preprint arXiv:2504.06232},
year={2025}
}
This is official code of HiFlow. All the copyrights of the demo images and audio are from community users. Feel free to contact us if you would like remove them.
The code is built upon the below repositories, we thank all the contributors for open-sourcing.