Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Stars
Fully local, no dependency scribe. Speak into your microphone and summarize. Requires iOS 26 and MacOS 26 to use the advanced transcription model and foundational model for summaries
publish simple messages to be recorded in each rosbag for easier identification
Control the Google Presentation slides with Hand Gesture
Implementation of CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation.
Web3 healthcare app that puts patients in control of their medical data with secure blockchain-based sharing and data management
Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…
[ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
[CVPR2025] We present SleeperMark, a novel framework designed to embed resilient watermarks into T2I diffusion models
Official implementation of LangCoop: Collaborative Driving with Natural Language
Discriminative Constrained Optimization for Reinforcing Large Reasoning Models
a comprehensive and critical synthesis of the emerging role of GenAI across the full autonomous driving stack
OpenMMLab Foundational Library for Training Deep Learning Models
Neighborhood Attention Extension. Bringing attention to a neighborhood near you!
Examples for Recommenders - easy to train and deploy on accelerated infrastructure.
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
A high-throughput and memory-efficient inference and serving engine for LLMs
Fully open reproduction of DeepSeek-R1
mm-grounding-dino-for-training
Mcity2.0 demo
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
Sys2Bench is a benchmarking suite designed to evaluate reasoning and planning capabilities of large language models across algorithmic, logical, arithmetic, and common-sense reasoning tasks.