Stars
[ICLR 2025] Point-SAM: Promptable 3D Segmentation Model for Point Clouds
Pointcept: a codebase for point cloud perception research. Latest works: Sonata (CVPR'25 Highlight), PTv3 (CVPR'24 Oral), PPT (CVPR'24), MSC (CVPR'23)
Papers, code and datasets about deep learning for 3D Semantic Segmentation.
Awesome Data-Driven Autonomous Driving Solutions. Also the official repository of our survey paper: Data-Centric Evolution in Autonomous Driving: A Comprehensive Survey of Big Data System, Data Min…
Code for RA-L journal and IROS 2022 paper "DeepFusionMOT: A 3D Multi-Object Tracking Framework Based on Camera-LiDAR Fusion with Deep Association".
[ICCV 2023] DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds
Code for Paper, MUTR3D: A Multi-camera Tracking Framework via 3D-to-2D Queries. https://tsinghua-mars-lab.github.io/mutr3d/
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
[arXiv 2023] Set-of-Mark Prompting for GPT-4V and LMMs
A continuously updated project to track the latest progress in the field of multi-modal object tracking. This project focuses solely on single-object tracking.
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving & Foundation Models in Autonomous System
[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Making large AI models cheaper, faster and more accessible
3D Point Cloud Annotation Platform for Autonomous Driving
3D Bounding Box Annotation Tool (3D-BAT) Point cloud and Image Labeling
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
心理健康大模型 (LLM x Mental Health), Pre & Post-training & Dataset & Evaluation & Depoly & RAG, with InternLM / Qwen / Baichuan / DeepSeek / Mixtral / LLama / GLM series models
最全数据分析资料汇总(含python、爬虫、数据库、大数据、tableau、统计学等)
methods of online vector map for autonomous driving.
A curated list of awesome HD map construction methods
[T-PAMI-2024] Transformer-Based Visual Segmentation: A Survey
Unofficial Reimplementation of VLTSeg from "Strong but Simple: A Baseline for Domain Generalized Dense Perception by CLIP-based Transfer Learning"
OpenEMMA, a permissively licensed open source "reproduction" of Waymo’s EMMA model.