Starred repositories
[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …
An open-source JavaScript library for world-class 3D globes and maps 🌎
Open 3D Engine (O3DE) is an Apache 2.0-licensed multi-platform 3D engine that enables developers and content creators to build AAA games, cinema-quality 3D worlds, and high-fidelity simulations wit…
Godot Engine – Multi-platform 2D and 3D game engine
Extendable version manager with support for Ruby, Node.js, Elixir, Erlang & more
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
EfficientDet (Scalable and Efficient Object Detection) implementation in Keras and Tensorflow
A Python toolkit of the BOP benchmark for 6D object pose estimation.
[CVPR19] FSA-Net: Learning Fine-Grained Structure Aggregation for Head Pose Estimation from a Single Image
Code for "PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation" CVPR 2019 oral
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation. (CVPR 2021)
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
The premier source of truth powering network automation. Open source under Apache 2. Try NetBox Cloud free: https://netboxlabs.com/products/free-netbox-cloud/
AI Image Signal Processing and Computational Photography. Official library for NTIRE (CVPR) and AIM (ICCV/ECCV) Challenges. You will find Learned ISPs, RAW Restoration-Upsampling-Reconstruction, Im…
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
🚀 Fast, secure, lightweight containers based on WebAssembly
RTMPose series (RTMPose, DWPose, RTMO, RTMW) without mmcv, mmpose, mmdet etc.
OpenMMLab Pose Estimation Toolbox and Benchmark.
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
LAVIS - A One-stop Library for Language-Vision Intelligence
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image