Highlights
- Pro
Stars
ROS 2 Navigation Framework and System
A 3DGS framework for omni urban scene reconstruction and simulation.
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
NVR with realtime local object detection for IP cameras
[CoRL 2024] Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
ROS 2 packages based on NVIDIA libArgus library for NVIDIA-accelerated CSI camera support.
🤩 Easy-to-use global IM bot platform designed for the LLM era / 简单易用的大模型即时通信机器人开发平台 ⚡️ Bots for QQ / QQ频道 / Discord / WeChat(企业微信、个人微信)/ Telegram / 飞书 / 钉钉 / Slack 🧩 Integrated with ChatGPT、DeepSee…
[CVPR 2025 Highlight] Real-time dense scene reconstruction with SLAM3R
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
OpenStereo: A Comprehensive Benchmark for Stereo Matching
Open Source SMT Pick and Place Hardware and Software
A lightweight neural-network for rapid detection of traffic cones
Pocket Flow: 100-line LLM framework. Let Agents build Agents!
🤯 Lobe Chat - an open-source, modern design AI chat framework. Supports multiple AI providers (OpenAI / Claude 4 / Gemini / DeepSeek / Ollama / Qwen), Knowledge Base (file upload / knowledge manage…
Master programming by recreating your favorite technologies from scratch.
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version in translation
Official inference repo for FLUX.1 models
C/C++ WebRTC network library featuring Data Channels, Media Transport, and WebSockets
A curated list of awesome Deep Stereo Matching resources
Official implementation of Monocular Quasi-Dense 3D Object Tracking, TPAMI 2022
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
the AI-native open-source embedding database
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
📚 Collection of awesome generation acceleration resources.
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Ultimate camera streaming application with support RTSP, RTMP, HTTP-FLV, WebRTC, MSE, HLS, MP4, MJPEG, HomeKit, FFmpeg, etc.
WebRTC/RTSP/RTMP/LL-HLS bridge for Wyze cams in a docker container