[CVPR 2022--Oral] Restormer: Efficient Transformer for High-Resolution Image Restoration. SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.

Python 2,114 266 Updated Aug 16, 2024

isl-org / DPT

Dense Prediction Transformers

Python 2,167 271 Updated Dec 18, 2024

Abdullah-Abuolaim / defocus-deblurring-dual-pixel

Reference github repository for the paper "Defocus Deblurring Using Dual-Pixel Data". We introduce a deep neural network (DNN) architecture that uses the dual-pixel (DP) sub-aperture views to reduc…

Python 201 23 Updated Feb 16, 2022

RF5 / simple-speech-commands

A pretrained Pytorch classifier for the Google Speech Commands dataset that is very quick to set up and use.

Python 3 Updated Dec 31, 2023

dair-ai / ml-visuals

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

15,377 1,466 Updated Feb 13, 2023

Luo-Jiaming / GIFT_CL

[CVPR2025] Synthetic Data is an Elegant GIFT for Continual Vision-Language Models

Python 13 1 Updated Jun 29, 2025

popjane / free_chatgpt_api

🔥 公益免费的ChatGPT API，Free ChatGPT API，GPT4 API，可直连，无需代理，使用标准 OpenAI APIKEY 格式访问 ChatGPT，可搭配ChatGPT-next-web、ChatGPT-Midjourney、Lobe-chat、Botgem、FastGPT、沉浸式翻译等项目使用

4,733 477 Updated Nov 12, 2024

amusi / daily-paper-computer-vision

记录每天整理的计算机视觉/深度学习/机器学习相关方向的论文

6,574 1,281 Updated Jul 8, 2023

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 29,660 3,655 Updated Jul 23, 2024

jianzongwu / download-cc3m

Jupyter Notebook 6 Updated Jan 13, 2023

QuoQA-NLP / Ko-conceptual-captions

Google's Conceptual Captions Dataset translated into Korean

22 2 Updated Aug 28, 2022

google-research-datasets / conceptual-captions

Conceptual Captions is a dataset containing (image-URL, caption) pairs designed for the training and evaluation of machine learned image captioning systems.

Shell 541 28 Updated Aug 21, 2021

McGill-NLP / llm2vec

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,549 125 Updated Jan 24, 2025

mli / paper-reading

深度学习经典、新论文逐段精读

30,672 2,666 Updated Mar 22, 2025

Whiffe / Custom-ava-dataset_Custom-Spatio-Temporally-Action-Video-Dataset

Custom ava dataset, Multi-Person Video Dataset Annotation Method of Spatio-Temporally Actions

Python 124 21 Updated Jun 7, 2022

noureldien / vivit_pytorch

Implementation of ViViT: A Video Vision Transformer - Zipping Coding Challenge

Python 32 Updated Jun 10, 2021

rishikksh20 / ViViT-pytorch

Implementation of ViViT: A Video Vision Transformer

Python 537 68 Updated Jun 21, 2021

open-mmlab / mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,672 1,297 Updated Aug 14, 2024

yzfly / yzfly

8 Updated Nov 18, 2024

bubbliiiing / clip-pytorch

这是一个clip-pytorch的模型，可以训练自己的数据集。

Python 232 30 Updated Apr 5, 2023

huggingface / diffusion-models-class

Materials for the Hugging Face Diffusion Models Course

Jupyter Notebook 4,069 454 Updated Feb 12, 2025

lucidrains / vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 23,285 3,331 Updated Mar 5, 2025

facebookresearch / detr

End-to-End Object Detection with Transformers

Python 14,488 2,576 Updated Mar 12, 2024

Megvii-BaseDetection / YOLOX

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Python 9,918 2,347 Updated Jun 8, 2025

ultralytics / yolov5

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 54,457 17,014 Updated Jun 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pepper725

Block or report pepper725

Stars

deep-spin / Infinite-Video

XiaoCode-er / 3D-Skeleton-Display

sindresorhus / Plash

dvlab-research / VisionZip

csZcWu / NRKNet

swz30 / Restormer