Stars
Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis", Interspeech 2023
Object-oriented handling of audio data, with GPU-powered augmentations, and more.
Awesome-LLM: a curated list of Large Language Model
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNโฆ
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Audio generation using diffusion models, in PyTorch.
Instructional implementation of Physics-Aware Training (PAT) with demonstrations on simulated experiments.
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
๐ Search for YouTube videos, channels & playlists. Get ๐ video & ๐ playlist info using link. Get search suggestions. WITHOUT YouTube Data API v3.
Advanced AppleTV Web Browser (uses Private API)
CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)