Stars
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!
[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box
Joint Detection and Embedding for fast multi-object tracking
[CVPR 2023] An academic alternative to Tesla's occupancy network for autonomous driving.
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)
A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.
Code release for NeRF (Neural Radiance Fields)
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
[ECCV 2022] Discriminability-Transferability Trade-Off: An Information-Theoretic Perspective
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
[ECCV2022] Contrastive Vision-Language Pre-training with Limited Resources
Open source deep learning based unsupervised image retrieval toolbox built on PyTorch🔥
The official PyTorch implementation of paper BBN: Bilateral-Branch Network with Cumulative Learning for Long-Tailed Visual Recognition
Tool-box for efficient build and debug in Python. Especially for Scientific Computing and Computer Vision.
OpenMMLab Detection Toolbox and Benchmark
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
This is a re-implementation of Siamese-RPN with pytorch, which is CVPR2018 spotlight.
PyTorch implementation of Multi-Label Image Recognition with Graph Convolutional Networks, CVPR 2019.
Lifelong/Continual Learning Paper List
Image augmentation for machine learning experiments.
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …
Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.