Lists (13)
Sort Name ascending (A-Z)
Stars
AI design agent, local alternative for Lovart. AI agent with ability to design, edit and generate images, posters, storyboards, etc.
🍕 Peer-to-peer file transfers in your browser
Demo of a customer service use case implemented with the OpenAI Agents SDK
MagicTryOn is a video virtual try-on framework based on a large-scale video diffusion Transformer.
MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
🔥 This repository contains complete application examples, including websites and other projects, developed using Firecrawl.
🔥 AI-powered deep research tool that breaks down complex queries, validates answers, and provides cited comprehensive results using Firecrawl and LangGraph
猫抓 浏览器资源嗅探扩展 / cat-catch Browser Resource Sniffing Extension
Get started with building Fullstack Agents using Gemini 2.5 and LangGraph
一款chrome浏览器插件,辅助一键获取当前用户小红书的cookie信息
A research prototype of a human-centered web agent
Prompts for our Grok chat assistant and the `@grok` bot on X.
Wan: Open and Advanced Large-Scale Video Generative Models
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
Real-time webcam demo with SmolVLM and llama.cpp server
ACE-Step: A Step Towards Music Generation Foundation Model
An open protocol enabling communication and interoperability between opaque agentic applications.
Examples and guides for using the OpenAI API
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)