Stars
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
This is the official implementation for the paper "SNR-aware low-light image enhancement" in CVPR2022
Code for coupled TGV regularization of multi-spectral/multi-modal inverse problems
Collection of recent shadow removal works, including papers, codes, datasets, and metrics.
SteveImmanuel / SegGPT-FineTune
Forked from baaivision/PainterFine-tune SegGPT model with custom datasets
Medical Image Segmentation with Diffusion Model
[ICLR 2022] code for "How Much Can CLIP Benefit Vision-and-Language Tasks?" https://arxiv.org/abs/2107.06383
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
sketch + style = paints 🎨 (TOG2018/SIGGRAPH2018ASIA)
Official code for ECCV 2022 paper ``CT2: Colorization Transformer via Color Tokens"
[ICCV 2023] DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders
[ECCV 2022] Official PyTorch implementation of "BigColor"
Code for the CVPR 2020 paper "A Multi-task Mean Teacher for Semi-supervised Shadow Detection"
Official tensorflow implementation for CVPR2020 paper “Learning to Cartoonize Using White-box Cartoon Representations”
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
Scope+: An open source generalizable architecture for single-cell atlases at sample and cell levels
Fast and memory-efficient exact attention
VITA: Video Instance Segmentation via Object Token Association (NeurIPS 2022)
A Deep Learning based project for colorizing and restoring old images (and video!)
Official implementation of the CVPR 2022 paper "DETReg: Unsupervised Pretraining with Region Priors for Object Detection".
[CVPR 2022] The code for our paper 《Object-aware Video-language Pre-training for Retrieval》
[CVPR 2023] Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolatio