Stars
pySLAM is a visual SLAM pipeline in Python for monocular, stereo and RGBD cameras. It supports many modern local and global features, different loop-closing methods, a volumetric reconstruction pip…
[MICCAI'2024] EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera
Universal Monocular Metric Depth Estimation
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
[CVPR24] Depth Prompting for Sensor-Agnostic Depth Estimation
Collect some World Models for Autonomous Driving (and Robotic) papers.
Graph-based Knowledge Tracing: Modeling Student Proficiency Using Graph Neural Network
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
A geometry-aware deep network for depth estimation in monocular endoscopy
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation
[ICCV2023 Oral & TPAMI2024] NDDepth: Normal-Distance Assisted Monocular Depth Estimation and Completion
[ECCV 2022 oral] Monocular 3D Object Detection with Depth from Motion
We estimate dense, flicker-free, geometrically consistent depth from monocular video, for example hand-held cell phone video.
MATLAB implementation of the paper "Robust Uncertainty-Aware Multiview Triangulation"
Source code for DCL-Net, a deep learning model for sensorless freehand 3D ultrasound volume reconstruction.
[CVPR 2023] Multi-frame depth estimation in dynamic scenes. -- Li, Rui, et al. "Learning to Fuse Monocular and Multi-view Cues for Multi-frame Depth Estimation in Dynamic Scenes".
AI assisted identification of location and anatomical structure in biliary pancreatic endoscopic ultrasound.
an image registration/augmentation/segmentation package