Stars
A course on aligning smol models.
PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.
WTF Solidity 极简入门教程,供小白们使用。Now supports English! 官网: https://wtf.academy
An intuitive and low-overhead instrumentation tool for Python
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
LLM Frontend for Power Users.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.
🚀 QuickGo 外链直达 — 无感知自动跳过知乎、简书、掘金、CSDN、少数派、Gitee 等 50+ 网站的安全中心跳转限制
Open CS Application | 开源CS申请
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
[AAAI 2025] Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding
UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts
Xiaomi Home Integration for Home Assistant
Ikaros-521 / AI-Vtuber
Forked from sandboxdream/AI-VtuberAI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊…
Live2D Library for Python (C Extension): Supports model loading, lip-sync, basic face rigging, and precise click test.