8000 0x5446 (Finn) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View 0x5446's full-sized avatar
  • t1ger

Block or report 0x5446

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

使用vllm加速cosyvoice2的推理

Jupyter Notebook 1 Updated Apr 30, 2025

onnxruntime pre-compiled libs

127 30 Updated May 16, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 16,733 1,342 Updated May 28, 2025

使用vllm加速cosyvoice2的推理

Jupyter Notebook 322 42 Updated Apr 26, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 20 2 Updated Apr 16, 2025

Towards Human-Sounding Speech

Python 4,932 401 Updated May 6, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 12,185 1,737 Updated Jun 5, 2025

F5-TTS 推理加速,速度提升约4倍!

Python 92 14 Updated Jan 6, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 47,216 5,202 Updated Jun 6, 2025

Utilizes ONNX Runtime for speech activity detection.

Python 24 4 Updated May 7, 2025

Converts text to speech in realtime

Python 3,139 312 Updated May 14, 2025

A Conversational Speech Generation Model

Python 13,457 1,293 Updated May 27, 2025

WebRTC Library for IoT/Embedded Device using C

C 1,223 203 Updated May 29, 2025

Memory for AI Agents; SOTA in AI Agent Memory; Announcing OpenMemory MCP - local and secure memory management.

Python 33,804 3,349 Updated Jun 5, 2025

Pseudo Streaming SenseVoice with Hotwords

Python 288 31 Updated Mar 13, 2025

Collection of Open Source Speech Data

158 6 Updated Nov 8, 2024

API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.

Python 447 65 Updated Oct 23, 2024

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 2,085 180 Updated Jun 6, 2025

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 36,319 5,256 Updated Nov 15, 2024

Inference Specialization

Python 457 30 Updated Jun 25, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 14,355 1,501 Updated Jun 2, 2025

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 7,639 886 Updated Jun 2, 2025

Espressif intelligent voice assistant

C 714 159 Updated May 27, 2025

Python interface to the WebRTC Voice Activity Detector

C 2,257 418 Updated Jul 4, 2024

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 6,236 714 Updated Jun 5, 2025

A generative speech model for daily dialogue.

Python 36,582 3,957 Updated May 23, 2025

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 37,949 4,503 Updated Aug 19, 2024

Example projects built with the Hume AI APIs

Jupyter Notebook 199 100 Updated Jun 4, 2025

Whisper realtime streaming for long speech-to-text transcription and translation

Python 2,947 363 Updated Jan 7, 2025
Next
0