-
Wuhan University
- Wuhan University
-
12:16
(UTC +08:00)
Lists (3)
Sort Name ascending (A-Z)
Stars
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…
基于SenseVoice的funasr版本进行的api发布,可以无缝对接oneapi
Attempt to create lipsync library for realtime use with three.js
Talk To AI with FastRTC enables natural, real-time voice conversations with AI using WebRTC, offering customizable voices, interfaces, and local or cloud-based API integration.
基于Fastrtc、Ollama、FunASR和MegaTTS的大模型中文语音实时对话应用
[SIGGRAPH 2025] LAM: Large Avatar Model for One-shot Animatable Gaussian Head
[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds
Ikaros-521 / AI-Vtuber
Forked from sandboxdream/AI-VtuberAI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊…
Open-source framework for conversational voice AI agents.
实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and …
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
GPT-4o-level, real-time spoken dialogue system.
Voice activity detector (VAD) for the browser with a simple API
On-device wake word detection powered by deep learning
Awesome Digital Human
A ThreeJS-powered virtual human being that uses a set of neat Azure APIs to do some talking!
Real time interactive streaming digital human
Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai
An enterprise-class low-code technology stack with scale-out design / 一套面向扩展设计的企业级低代码技术体系
A React Framework for building internal tools, admin panels, dashboards & B2B apps with unmatched flexibility.
📱🚀 🧩 Cross Device & High Performance Normal Form/Dynamic(JSON Schema) Form/Form Builder -- Support React/React Native/Vue 2/Vue 3
🛠️ A flexible and extensible command line tool for OpenTiny and frontend.
Python based web automation tool. Powerful and elegant.