Stars
Official implementation of "Harnessing Large Language Models for Training-free Video Anomaly Detection", CVPR 2024
Code release for ICCV 2021 paper "Anticipative Video Transformer"
Source code for MICCAI 2023 paper entitled: 'FeSViBS: Federated Split Learning of Vision Transformer with Block Sampling'
[NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of training.
A tool for efficient semi-supervised video object segmentation (great results with minimal manual labor) and a dataset for benchmarking
find the spot on solar panel's thermal image
[CVPR 2023] CaPriDe Learning: Confidential and Private Decentralized Learning based on Encryption-friendly Distillation Loss
[ICCV'23] Official repository of paper SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
Source codes of our paper in CVPR 2019: Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection
[AVSS21 Oral] A framework consisting of Dissimilarity Attention Module (DAM) to discriminate the anomaly instances from normal ones both at feature level and score level. In order to decide instanc…
[BMVC 2022] Official repository for "How to Train Vision Transformer on Small-scale Datasets?"
[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
Official repository of paper titled "D3Former: Debiased Dual Distilled Transformer for Incremental Learning".
Source code for MICCAI 2022 paper entitled: 'Self-Ensembling Vision Transformer (SEViT) for Robust Medical Image Classification'
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
[ECCV'22] Official repository of paper titled "Class-agnostic Object Detection with Multi-modal Transformer".
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
EscVM YouTube Channel Repository. Start from Notebooks ⬅️
Paper implementations from scratch and machine learning tutorials
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
A PyTorch toolkit for 2D Human Pose Estimation.