API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.

Python 447 65 Updated Oct 23, 2024

modelscope / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 2,085 180 Updated Jun 6, 2025

babysor / MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 36,319 5,256 Updated Nov 15, 2024

X-T-E-R / GPT-SoVITS-Inference

Forked from RVC-Boss/GPT-SoVITS

Inference Specialization

Python 457 30 Updated Jun 25, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 14,355 1,501 Updated Jun 2, 2025

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 7,639 886 Updated Jun 2, 2025

espressif / esp-skainet

Espressif intelligent voice assistant

C 714 159 Updated May 27, 2025

wiseman / py-webrtcvad

Python interface to the WebRTC Voice Activity Detector

C 2,257 418 Updated Jul 4, 2024

k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 6,236 714 Updated Jun 5, 2025

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 36,582 3,957 Updated May 23, 2025

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 37,949 4,503 Updated Aug 19, 2024

HumeAI / hume-api-examples

Example projects built with the Hume AI APIs

Jupyter Notebook 199 100 Updated Jun 4, 2025

ufal / whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Python 2,947 363 Updated Jan 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finn 0x5446

Achievements

Achievements

Block or report 0x5446

Starred repositories

0x5446 / async_cosyvoice

csukuangfj / onnxruntime-libs

nari-labs / dia

qi-hua / async_cosyvoice

hexisyztem / CosyVoice

canopyai / Orpheus-TTS

SWivid / F5-TTS

WGS-note / F5_TTS_Faster

RVC-Boss / GPT-SoVITS

DakeQQ / Voice-Activity-Detection-VAD-ONNX

KoljaB / RealtimeTTS

SesameAILabs / csm

sepfy / libpeer

espressif / esp-webrtc-solution

mem0ai / mem0

pengzhendong / streaming-sensevoice

hlt-mt / mosel

0x5446 / api4sensevoice