8000 zhenhaoge / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View zhenhaoge's full-sized avatar

Block or report zhenhaoge

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The C++ Core Guidelines are a set of tried-and-true guidelines, rules, and best practices about coding in C++

CSS 43,881 5,493 Updated May 8, 2025

Lingvo

Python 2,844 450 Updated Jun 18, 2025

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Python 793 195 Updated Apr 6, 2023

Self-contained, minimalistic implementation of diffusion models with Pytorch.

Python 1,074 138 Updated Jun 28, 2022

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 11,042 1,647 Updated Jul 2, 2025

A video translation and dubbing tool powered by LLMs, offering professional-grade translations and one-click full-process deployment. It can generate content optimized for platforms like YouTube,T…

Go 8,017 632 Updated Jul 6, 2025

StatQuest with Josh Starmer

HTML 70 37 Updated Sep 10, 2019
Python 470 82 Updated Jun 12, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 48,512 5,338 Updated Jul 2, 2025

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 6,817 763 Updated Mar 5, 2025

SOTA Open Source TTS

Python 22,246 1,822 Updated Jul 2, 2025

https://hf.co/hexgrad/Kokoro-82M

JavaScript 3,479 377 Updated Jun 30, 2025

A Python package that makes it easy for developers to create AI apps powered by various AI providers.

Python 1,624 199 Updated Apr 8, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 8,603 735 Updated Jul 7, 2025

Easily fine-tune, evaluate and deploy Qwen3, DeepSeek-R1, Llama 4 or any open source LLM / VLM!

Python 8,237 615 Updated Jul 3, 2025

Fast TorToiSe inference (5x or your money back!)

Jupyter Notebook 827 176 Updated Jul 10, 2024
0