🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
-
Updated
Aug 16, 2024 - Python
8000
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
Ultrafast GAN based Vocoder for Text to Speech
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
zero-shot realtime TTS system, fully offline, free and open source
TTS for Arabic (FastPitch, Mixer-TTS) in the ONNX format
SA-toolkit: Speaker speech anonymization toolkit in python
RADTTS + HiFiGAN vocoder
homework for deep generation. Combine FastSpeech2 with different vocoders ⭐REFERENCE (modify origin repos): https://github.com/ming024/FastSpeech2 https://github.com/NVIDIA/waveglow https://github.com/mindslab-ai/univnet https://github.com/jik876/hifi-gan
Add a description, image, and links to the hifigan topic page so that developers can more easily learn about it.
To associate your repository with the hifigan topic, visit your repo's landing page and select "manage topics."