8000 GitHub - smthemex/ComfyUI_SongGeneration: SongGeneration:High-Quality Song Generation with Multi-Preference Alignment (SOTA),you can try VRAM>12G
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

smthemex/ComfyUI_SongGeneration

Repository files navigation

ComfyUI_SongGeneration

SongGeneration:High-Quality Song Generation with Multi-Preference Alignment (SOTA),you can try VRAM>12G

Tips

  • Test env(插件测试环境):window11,python3.11, torch2.6 ,cu124, VR12G,(transformers 4.45.1)
  • 0622修复config加载问题
  • 0623 修复无法二次推理的问题,解决omegaconf重复注册的问题

1. Installation

In the ./ComfyUI/custom_node directory, run the following:

git clone https://github.com/smthemex/ComfyUI_SongGeneration.git

2. Requirements

  • window平台最难装的就是fairseq库,python3.11的建议用轮子安装liyaodev/fairseq
  • 如果缺失库,打开requirements_orgin.txt文件,看是少了哪个,手动安装;
  • The most difficult thing to install on the Windows platform is the Fairseq library. It is recommended to install it on wheels for version 3.11 liyaodev/fairseq
  • If the library is missing, open the ’requirements_orgin.txt‘ file and see which one is missing, then manually install it;
pip install -r requirements.txt

3.Model

--  ComfyUI/models/SongGeneration/
    |-- htdemucs.pth #150M
    |-- ckpt/  # 24.4G all 整个文件夹的大小
        |--encode-s12k.pt  # 3.68G
        |--prompt.pt  # 3M
        |--model_1rvq/  
            |--all files  # 全部文件 
        |--model_septoken/
            |--all files  # 全部文件
        |--models--lengyue233--content-vec-best/
            |--all files  # 全部文件
        |--songgeneration_base/ #注意删掉了_zh  notice  no ‘_zh’ now
            |--all files  # 全部文件
        |--vae/
            |--all files  # 全部文件

4 Example

5 Citation

@article{lei2025levo,
  title={LeVo: High-Quality Song Generation with Multi-Preference Alignment},
  author={Lei, Shun and Xu, Yaoxun and Lin, Zhiwei and Zhang, Huaicheng and Tan, Wei and Chen, Hangting and Yu, Jianwei and Zhang, Yixuan and Yang, Chenyu and Zhu, Haina and Wang, Shuai and Wu, Zhiyong and Yu, Dong},
  journal={arXiv preprint arXiv:2506.07520},
  year={2025}
}

About

SongGeneration:High-Quality Song Generation with Multi-Preference Alignment (SOTA),you can try VRAM>12G

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

0