8000 yantaozhao (YantaoZhao) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View yantaozhao's full-sized avatar
  • Beijing, China

Block or report yantaozhao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 13,684 987 Updated Apr 29, 2025

A Python library for temporal disaggregation of time series data

Jupyter Notebook 11 Updated Apr 21, 2025

Towards Human-Sounding Speech

Python 4,593 368 Updated Apr 16, 2025

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

Python 1,202 160 Updated Apr 20, 2025

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

Python 14,319 1,056 Updated Apr 26, 2025

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 15,242 1,650 Updated Apr 12, 2025

百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,时延低至800ms,Mac等低配置也可运行,支持打断

Python 1,175 208 Updated Mar 15, 2025

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 48,786 8,166 Updated Apr 29, 2025

🔎 📈 🐍 💰 Backtest trading strategies in Python.

Python 6,406 1,185 Updated Mar 30, 2025

A video translation and dubbing tool powered by LLMs, offering professional-grade translations and one-click full-process deployment. It can generate content optimized for platforms like YouTube,T…

Go 6,503 478 Updated Apr 30, 2025

nanomsg-next-generation -- light-weight brokerless messaging

C 4,073 506 Updated Apr 28, 2025

A generative speech model for daily dialogue.

Python 36,006 3,902 Updated Mar 14, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 92,191 11,687 Updated Apr 30, 2025

A JavaScript / TypeScript / Python / C# / PHP / Go cryptocurrency trading API with support for more than 100 bitcoin/altcoin exchanges

Python 35,855 7,845 Updated Apr 30, 2025

Portfolio and risk analytics in Python

Jupyter Notebook 470 133 Updated Sep 26, 2024

A hyperparameter optimization framework

Python 11,850 1,090 Updated Apr 28, 2025

Zipline, a Pythonic Algorithmic Trading Library

Python 1,370 246 Updated Nov 25, 2024

Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps to avoid Cloudflare and DataDome CAPTCHA pages. Easy to patch/unpatch, can be enabled/disabled on d…

JavaScript 729 41 Updated Dec 10, 2024

Undetected version of the Playwright testing and automation library.

JavaScript 711 12 Updated Apr 30, 2025

Details on how to get Binance public data

Python 1,832 517 Updated Jan 9, 2025

The QuantLib C++ library

C++ 5,913 1,903 Updated Apr 29, 2025

A book covering the fundamentals of data visualization

HTML 3,291 722 Updated Jul 27, 2022

Stable Diffusion web UI

Python 151,891 28,246 Updated Apr 29, 2025

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,013 706 Updated Apr 12, 2025

Dataset and code of GTSinger(NeurIPS 2024 Spotlight): A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks

Python 285 9 Updated Mar 7, 2025

TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching

Python 49 5 Updated Apr 21, 2025

PyTorch Implementation of TCSinger(EMNLP 2024): Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control

Python 334 39 Updated Apr 17, 2025

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 21,912 2,313 Updated Mar 13, 2025

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 39,889 14,954 Updated Apr 30, 2025
Next
0