SongGeneration:High-Quality Song Generation with Multi-Preference Alignment (SOTA),you can try VRAM>12G
- Test env(插件测试环境):window11,python3.11, torch2.6 ,cu124, VR12G,(transformers 4.45.1)
- 0622修复config加载问题
- 0623 修复无法二次推理的问题,解决omegaconf重复注册的问题
In the ./ComfyUI/custom_node directory, run the following:
git clone https://github.com/smthemex/ComfyUI_SongGeneration.git
- window平台最难装的就是fairseq库,python3.11的建议用轮子安装liyaodev/fairseq;
- 如果缺失库,打开requirements_orgin.txt文件,看是少了哪个,手动安装;
- The most difficult thing to install on the Windows platform is the Fairseq library. It is recommended to install it on wheels for version 3.11 liyaodev/fairseq;
- If the library is missing, open the ’requirements_orgin.txt‘ file and see which one is missing, then manually install it;
pip install -r requirements.txt
- 3.1.1 download ckpt from tencent/SongGeneration 国内建议魔搭AI-ModelScope/SongGeneration
- 3.1.2 download htdemucs.pth tencent/SongGeneration
- 文件结构如下:
-- ComfyUI/models/SongGeneration/
|-- htdemucs.pth #150M
|-- ckpt/ # 24.4G all 整个文件夹的大小
|--encode-s12k.pt # 3.68G
|--prompt.pt # 3M
|--model_1rvq/
|--all files # 全部文件
|--model_septoken/
|--all files # 全部文件
|--models--lengyue233--content-vec-best/
|--all files # 全部文件
|--songgeneration_base/ #注意删掉了_zh notice no ‘_zh’ now
|--all files # 全部文件
|--vae/
|--all files # 全部文件
@article{lei2025levo,
title={LeVo: High-Quality Song Generation with Multi-Preference Alignment},
author={Lei, Shun and Xu, Yaoxun and Lin, Zhiwei and Zhang, Huaicheng and Tan, Wei and Chen, Hangting and Yu, Jianwei and Zhang, Yixuan and Yang, Chenyu and Zhu, Haina and Wang, Shuai and Wu, Zhiyong and Yu, Dong},
journal={arXiv preprint arXiv:2506.07520},
year={2025}
}