Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
-
Updated
Aug 3, 2024
8000
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
🇺🇦 Open Source Ukrainian Text-to-Speech datasets
🇺🇦 Ukrainian RAD-TTS++ models (decoder + models with 3 voices) and HiFiGAN model
A voice-based AI chat interface built with Next.js and ElevenLabs. Start and stop real-time conversations with an animated UI that reflects agent status. Fully responsive and deployable via Vercel with environment-based agent configuration.
Add a description, image, and links to the speech-ai topic page so that developers can more easily learn about it.
To associate your repository with the speech-ai topic, visit your repo's landing page and select "manage topics."