-
Peking University
- Beijing
Stars
[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
[ECCV 2024] A Simple and Effective 3D DETR in Point Clouds
程序员延寿指南 | A programmer's guide to live longer
[NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model
[ECCV2022] MOTR: End-to-End Multiple-Object Tracking with TRansformer
Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection
[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version in translation
[CVPR 2024] PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation
[ECCV 2024] Fully Sparse 3D Occupancy Prediction & RayIoU Evaluation Metric
LightGlue: Local Feature Matching at Light Speed (ICCV 2023)
SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)
[IEEE T-PAMI 2023] Awesome BEV perception research and cookbook for all level audience in autonomous diriving
Vision-Centric BEV Perception: A Survey
[Information Fusion 2025] A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc.
OpenMMLab's next-generation platform for general 3D object detection.
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.