Stars
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. π₯ π₯ π₯
GraXpert is an astronomical image processing program for extracting and removing gradients from the background of your astrophotos.
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, 8000 and π video, up to 5x faster than OpenAI CLIP and LLaVA πΌοΈ & ποΈ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
AI Agent that handles engineering tasks end-to-end: integrates with developersβ tools, plans, executes, and iterates until it achieves a successful result.
A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Semi-internal command line tool to run GPU tasks
Official code of The Devil is in the Details: Delving into Unbiased Data Processing for Human Pose Estimation
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Transfer Learning Library for Domain Adaptation, Task Adaptation, and Domain Generalization
A small application that shows weather forecast for St. Petersburg
π°οΈ List of satellite image training datasets with annotations for computer vision and deep learning
This repository contains the dataset including the pair of 2D face image and its corresponding 3D face geometry model.
Official PyTorch implementation of "Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image", ICCV 2019
High-Resolution 3D Human Digitization from A Single Image.
State-of-the-art 2D and 3D Face Analysis Project
ECCV2020 paper "Whole-Body Human Pose Estimation in the Wild"
This is an official implementation of facial landmark detection for our TPAMI paper "Deep High-Resolution Representation Learning for Visual Recognition". https://arxiv.org/abs/1908.07919
The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"
End-to-End Object Detection with Transformers
π Play and Record Sound with Python π
3D sound propagation simulator using adaptive rectangular decomposition method.
ποΈ Open Source Audio Matching and Mastering