8000 zgldh's list / Audio ML · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View zgldh's full-sized avatar
đź‘“
Naive
đź‘“
Naive
  • ZhengZhou

Block or report zgldh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Audio ML

TTS, STT
9 repositories
Python 1,116 342 Updated May 27, 2025

DNN based hotword and wake word detection toolkit (model generation included)

Python 465 139 Updated May 4, 2021

Classify audio with neural nets on embedded systems like the Raspberry Pi

Python 86 14 Updated Apr 10, 2024

Generate embedding vectors from audio files

Python 59 31 Updated May 7, 2025

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 856 62 Updated Dec 23, 2024

TTS with kokoro and onnx runtime

Python 2,020 196 Updated May 10, 2025

Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching

Python 2,927 418 Updated May 28, 2025

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 6,702 747 Updated Mar 5, 2025

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,959 1,915 Updated May 26, 2025
0