- Earth
Stars
Integrate the DeepSeek API into popular softwares
「妙幕」是一款跨平台客户端工具,可以批量为视频或者音频生成字幕文件,并支持对字幕进行翻译,支持百度、火山、openai、ollama、deepseek 等多家翻译
Smart Preview 是一个强大的浏览器扩展,旨在提升您的网页浏览体验。它允许用户快速预览链接内容,而无需打开新的标签页,同时提供智能的窗口管理和自定义设置选项。
Open-source framework and platform for building real-time, multimodal, low-latency conversational voice AI agents. It features a workflow builder and supports C, C++, Go, Python, JavaScript, and Ty…
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
Let your Claude able to think
🔥 今日热榜 API,一个聚合热门数据的 API 接口,支持 RSS 模式 及 Vercel 部署 | 前端页面:https://github.com/imsyy/DailyHot
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Open and efficient video watermarking
Official implementation of the paper "Watermark Anything with Localized Messages"
🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.
Video encoding / transcoding / converting for node.js
Ikaros-521 / AI-Vtuber
Forked from sandboxdream/AI-VtuberAI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊…
Real time interactive streaming digital human
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
A Magisk/KernelSU module to enable 5G and VoLTE on Pixel 7 Series
Curated list of awesome Android apps making use of Shizuku
tetato / JavSP-Docker
Forked from Yuukiy/JavSP汇总多站点数据的AV元数据刮削器-Docker版
A generative speech model for daily dialogue.
Question and Answer based on Anything.