More
Lists (1)
Sort Name ascending (A-Z)
Stars
ACE-Step: A Step Towards Music Generation Foundation Model
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
A song aesthetic evaluation toolkit trained on SongEval.
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Perforator is a cluster-wide continuous profiling tool designed for large data centers
Awesome speech/audio LLMs, representation learning, and codec models
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
Reference-aware automatic speech evaluation toolkit
A developer's guide to management: an open-sourced handbook for leading software engineering teams.
The strictest and most opinionated python linter ever!
A Unified Library for Parameter-Efficient and Modular Transfer Learning
Foundational Models for State-of-the-Art Speech and Text Translation
Versatile audio super resolution (any -> 48kHz) with AudioSR.
Machine Learning Engineering Open Book
A high-throughput and memory-efficient inference and serving engine for LLMs
Noise supression using deep filtering
A timeline of the latest AI models for audio generation, starting in 2023!
Download your Spotify playlists and songs along with album art and metadata (from YouTube if a match is found).
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
🔊 Text-Prompted Generative Audio Model
AudioLDM: Generate speech, sound effects, music and beyond, with text.