Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
A song aesthetic evaluation toolkit trained on SongEval.
Enhances Overleaf by allowing article searches and BibTeX retrieval from DBLP and Google Scholar | 通过允许从 DBLP 和 Google Scholar 进行文章搜索和获取 BibTeX 来增强 Overleaf。
ACE-Step: A Step Towards Music Generation Foundation Model
Code repository of our research paper - D. Afchar, G. Meseguer Brocal, R. Hennequin
PyTorch code and models for V-JEPA self-supervised learning from video.
[Support 0.49.x](Reset Cursor AI MachineID & Bypass Higher Token Limit) Cursor Ai ,自动重置机器ID , 免费升级使用Pro功能: You've reached your trial request limit. / Too many free trial accounts used on this machi…
This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.
Acceptance rates for the major AI conferences
Implementation of all RL algorithms in a simpler way
YouTube Music Desktop App bundled with custom plugins (and built-in ad blocker / downloader)
MUSDB25 - A Fully Multitrack Dataset for Music Source Separation
PyTorch implementation of Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities.
Train your AI self, amplify you, bridge the world
Self-supervised learning for fast pitch estimation
PyTorch Implementation of TCSinger(EMNLP 2024): Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control
Hearing loss simulation VST plugin
Minimal reproduction of DeepSeek R1-Zero
A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.
The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)
A simple yet effective Audio-to-Midi Automatic Piano Transcription system
CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models [NAACL 2025]
PyTorch Implementation of StyleSinger(AAAI 2024): Style Transfer for Out-of-Domain Singing Voice Synthesis
Teaching material for the course "Deep Learning for Music Analysis and Generation" I taught at National Taiwan University (2023 Fall)
Dataset and code of GTSinger(NeurIPS 2024 Spotlight): A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks