Lists (1)
Sort Name ascending (A-Z)
Stars
🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.
Robot kinematics implemented in pytorch
Official Reporsitory of "RoboEngine: Plug-and-Play Robot Data Augmentation with Semantic Robot Segmentation and Background Generation"
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Official implementation of "DepthLab: From Partial to Complete"
Benchmarking Knowledge Transfer in Lifelong Robot Learning
CUDA accelerated rasterization of gaussian splatting
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
[TVCG2024] PGSR: Planar-based Gaussian Splatting for Efficient and High-Fidelity Surface Reconstruction
[Lumina Embodied AI Community] 具身智能技术指南 Embodied-AI-Guide
Official implementation of “4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models” (CVPR 2025)
A curated list of recent diffusion models for video generation, editing, and various other applications.
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
Lumina Robotics Talent Call | Lumina社区具身智能招贤榜 | A list for Embodied AI / Robotics Jobs (PhD, RA, intern, full-time, etc
[ACL'19] [PyTorch] Multimodal Transformer
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
pytorch implementation of Domain-Adversarial Training of Neural Networks
Gaze estimatin code. The Pytorch Implementation of "Eye Tracking for Everyone".
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]
Adaptive Feature Fusion Network for Gaze Tracking in Mobile Tablets.
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
VMamba: Visual State Space Models,code is based on mamba