Stars
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
Chrome extension to view ChatGPT summaries alongside Google search results and YouTube videos, also supports Yahoo! ニュース、PubMed、PMC、NewsPicks、Github、Nikkei、 Bing、Google Patents, and any page summary.
Free, simple, fast interactive diagrams for any GitHub repository
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
💀 Mac app to block your own access to distracting websites etc for a predetermined period of time. It can not be undone by the app or by a restart – you must wait for the timer to run out.
A simple VITS HTTP API, developed by extending Moegoe with additional features.
No fortress, purely open ground. OpenManus is Coming.
AdGuard browser extension
Advanced player for set-top boxes and tvs running Android OS
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
Daily updated repository for https://github.com/disposable/disposable
A list of disposable/temporary email address domains
Speech To Speech: an effort for an open-sourced and modular GPT4-o
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Generative Models by Stability AI
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
FFmpeg for browser, powered by WebAssembly
twmd: CLI/GUI Apiless twitter downlaoder. Download medias from single tweet or a whole profile.