Stars
Active Multi-class Mapping using Semantic Shannon Mutual Information (SSMI)
Riemannian Optimization for Active Mapping with Robot Teams (ROAM)
[CVPR 2025 Oral] VGGT: Visual Geometry Grounded Transformer
Open source repo for Locate 3D Model, 3D-JEPA and Locate 3D Dataset
[ICLR 2025] Official Implementation of M3: 3D-Spatial Multimodal Memory
projectaria_tools is an C++/Python open-source toolkit to interact with Project Aria data
rogersce / cnpy 8000
library to read/write .npy and .npz files in C/C++
Hackable and optimized Transformers building blocks, supporting a composable construction.
Offical code release for DynoSAM: Dynamic Object Smoothing And Mapping [Submitted TRO Visual SLAM SI]. A visual SLAM framework and pipeline for Dynamic environements, estimating for the motion/pose…
An open source implementation of CLIP.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Terminal stock ticker with live updates and position tracking
[CVPR 2025] MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors
No fortress, purely open ground. OpenManus is Coming.
[CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation
A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks
Code for RSS-2019 paper: Planning with State Abstractions for Non-Markovian Task Specifications
[CVPR 2025] 5601 Sparse Voxels Rasterization: Real-time High-fidelity Radiance Field Rendering
Code to reproduce experiments in the paper "Constrained Language Models Yield Few-Shot Semantic Parsers" (EMNLP 2021).
[ICML 2025 Spotlight] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
C++ header-only library with methods to efficiently encode/decode Morton codes in/from 2D/3D coordinates