-
Carnegie Mellon University
-
15:11
(UTC -12:00)
Stars
Official implementation of "Segment Any Anomaly without Training via Hybrid Prompt Regularization (SAA+)".
Offical implementation of "RealNet: A Feature Selection Network with Realistic Synthetic Anomaly for Anomaly Detection (CVPR 2024)"
[ECCV2024] The Official Implementation for ''AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection''
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
基于Clash Core 制作的Clash For Linux备份仓库 A Clash For Linux Backup Warehouse Based on Clash Core
Robust Speech Recognition via Large-Scale Weak Supervision
An optimal trajectory planner considering distinctive topologies for mobile robots based on Timed-Elastic-Bands (ROS Package)
Unitree robot sdk version 2. https://support.unitree.com/home/zh/developer
Python interface for unitree sdk2
A generative world for general-purpose robotics & embodied AI learning.
Training code of waypoint predictor in Discrete-to-Continuous VLN.
[ICLR 2024] Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach Link: https://arxiv.org/abs/2401.15652
[ICML 2025] Official PyTorch implementation of LongVU
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
Machine Learning Containers for NVIDIA Jetson and JetPack-L4T
A tutorial for getting started with the Deep Learning Accelerator (DLA) on NVIDIA Jetson
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models
[Embodied-AI-Survey-2025] Paper List and Resource Repository for Embodied AI
Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).
Official implementation of CVPR24 highlight paper "Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance"
Mobile manipulation research tools for roboticists
[ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
✨✨Latest Advances on Multimodal Large Language Models