Lists (4)
Sort Name ascending (A-Z)
Stars
Using OpenAI's Whisper to automatically generate YouTube subtitles
This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headles…
猫抓 浏览器资源嗅探扩展 / cat-catch Browser Resource Sniffing Extension
🥽🖼️ XR Voice Call WebUI, Make AI-Powered characters appear to you.
为独立开发者准备的精选技术栈和工具仓库来了!这里有你最需要的工具,帮你提升开发效率、节约成本,最重要的是——这些工具都是市场上热门的,经过验证的。🚀A curated collection of tech stacks and tools tailored for independent developers is here! these are proven, popular tools …
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…
一个Python + FastAPI + Playwright + Camoufox 中间层代理服务器,兼容 OpenAI API且支持参数设置、toolcall和ab测试模型等,通过将请求转发到 Google AI Studio 网页版的对话,并同样按照标准格式返回输出的工具。课余时间有限,随缘更新
Official repo for paper "Sparse Representation and Construction for High-Resolution 3D Shapes Modeling".
Various Dockerfiles I use on the desktop and on servers.
Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation
LLIA - Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models
[SIGGRAPH'25] SOAP: Style-Omniscient Animatable Portraits
基于deno的抖音视频图文无水印下载,支持cf worker,vercel,deno deploy,docker部署
一个在线的微信公众号文章批量下载工具,支持下载阅读量与评论数据,支持私有化部署,通过浏览器进行使用,无需进行安装
A collection of Three.js Shading Language (TSL) textures
Portfolio25 is an interactive portfolio built with Next.js, React.js, Three.js, Framer Motion, and TypeScript. It features a dynamic hero, 3D elements, canvas effects, and a fully responsive design…
A cartoon-style water effect with custom shaders, optimized for performance, built with React Three Fiber.
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
This Blender addon is aimed to help you integrate Cascadeur into your workflow.
[CSUR] A Survey on Video Diffusion Models
A powerful framework for building realtime voice AI agents 🤖🎙️📹
Gemini polling proxy service (gemini轮询代理服务)
Your AI Operator for Web, Android, Automation & Testing.