Stars
A TTS model capable of generating ultra-realistic dialogue in one pass.
A Python library for temporal disaggregation of time series data
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,时延低至800ms,Mac等低配置也可运行,支持打断
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
🔎 📈 🐍 💰 Backtest trading strategies in Python.
A video translation and dubbing tool powered by LLMs, offering professional-grade translations and one-click full-process deployment. It can generate content optimized for platforms like YouTube,T…
nanomsg-next-generation -- light-weight brokerless messaging
A generative speech model for daily dialogue.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
A JavaScript / TypeScript / Python / C# / PHP / Go cryptocurrency trading API with support for more than 100 bitcoin/altcoin exchanges
stefan-jansen / pyfolio-reloaded
Forked from quantopian/pyfolioPortfolio and risk analytics in Python
stefan-jansen / zipline-reloaded
Forked from quantopian/ziplineZipline, a Pythonic Algorithmic Trading Library
Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps to avoid Cloudflare and DataDome CAPTCHA pages. Easy to patch/unpatch, can be enabled/disabled on d…
Undetected version of the Playwright testing and automation library.
Details on how to get Binance public data
A book covering the fundamentals of data visualization
Stable Diffusion web UI
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Dataset and code of GTSinger(NeurIPS 2024 Spotlight): A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks
TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching
PyTorch Implementation of TCSinger(EMNLP 2024): Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows