8000 multi-speaker · GitHub Topics · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
#

multi-speaker

Here are 16 public repositories matching this topic...

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

  • Updated Sep 24, 2022
  • Python

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

  • Updated Jun 6, 2022
  • Python

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

  • Updated Jul 31, 2023
  • Python

🎵 Complete offline audio transcription system with speaker diarization using OpenAI Whisper and PyAnnote. Features automatic audio cleaning, precise timestamps, multiple output formats (JSON/TXT/Markdown), and support for 20+ audio formats. No external APIs required - works entirely offline.

  • Updated Jun 4, 2025
  • Python

Improve this page

Add a description, image, and links to the multi-speaker topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multi-speaker topic, visit your repo's landing page and select "manage topics."

Learn more

0