Starred repositories
An AI agent powered by LLMs that streamlines the entire process of data analysis. 🚀
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
This package contains the original 2012 AlexNet code.
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
No fortress, purely open ground. OpenManus is Coming.
A live stream development of RL tunning for LLM agents
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
A generative speech model for daily dialogue.
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processin…
Utilize the unlimited free GPT-3.5-Turbo API service provided by the login-free ChatGPT Web.
Awesome speech/audio LLMs, representation learning, and codec models
Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3
✅ Solutions to LeetCode by Go, 100% test coverage, runtime beats 100% / LeetCode 题解
huangxu1991 / GPT-SoVITS-VC
Forked from RVC-Boss/GPT-SoVITSVC Without Retrain!
Easily train a good VC model with voice data <= 10 mins!
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
VITS2 for Chinese speech | 最新VITS2中文语音合成
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
vits2 backbone with multilingual-bert
unofficial vits2-TTS implementation in pytorch
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal proce…
The code for the bark-voicecloning model. Training and inference.
SoftVC VITS Singing Voice Conversion