Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
[CVPR 2025] Spotting the Unexpected (STU): A 3D LiDAR Dataset for Anomaly Segmentation in Autonomous Driving
[Official Implementation] LiDAR-GS:Real-time LiDAR Re-Simulation using Gaussian Splatting
[ICCV 2023] CLIP2Point: Transfer CLIP to Point Cloud Classification with Image-Depth Pre-training
[AAAI 2024] BEV-MAE: Bird's Eye View Masked Autoencoders for Point Cloud Pre-training in Autonomous Driving Scenarios
Mix3D: Out-of-Context Data Augmentation for 3D Scenes (3DV 2021 Oral)
This repository contains the implementation for the paper "Revisiting Few Shot Object Detection with Vision-Language Models"
A minimal LLM chat app that runs entirely in your browser
Official repository for the NuScenes-MQA. This paper is accepted by LLVA-AD Workshop at WACV 2024.
An easy way to apply LoRA to CLIP. Implementation of the paper "Low-Rank Few-Shot Adaptation of Vision-Language Models" (CLIP-LoRA) [CVPRW 2024].
This is official code about "Out-of-Distribution Detection with Prototypical Outlier Proxy" in AAAI 2025
YOLO-UniOW: Efficient Universal Open-World Object Detection
New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection
Building One-class Detector for Anything: Open-vocabulary Zero-shot OOD Detection UsingText-image Models
Training a network on the mnist_dataset in tensorflow and then deploying it in C++.
[ECCV2024] The Official Implementation for ''AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection''
[ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
[ECCV 2024] LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection
Data and code for ECCV2024 paper "CMD: A Cross Mechanism Domain Adaptation Dataset for 3D Object Detection".
CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)
Official PyTorch implementation of the paper ‘VLM2Scene: Self-Supervised Image-Text-LiDAR Learning with Foundation Models for Autonomous Driving Scene Understanding’ (AAAI'2024)
Official implementation of the paper “MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes”
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
The PyTorch implementation for "DEAL: Disentangle and Localize Concept-level Explanations for VLMs" (ECCV 2024 Strong Double Blind)