Starred repositories
A faster pytorch implementation of faster r-cnn
Datawhale成员整理的面经,内容包括机器学习,CV,NLP,推荐,开发等,欢迎大家star
This tool provides an efficient implementation of the continuous bag-of-words and skip-gram architectures for computing vector representations of words. These representations can be subsequently us…
Fully open reproduction of DeepSeek-R1
This is an official implementation of our CVPR 2023 paper "Human Pose as Compositional Tokens" (https://arxiv.org/pdf/2303.11638.pdf)
CMU-Perceptual-Computing-Lab / convolutional-pose-machines-release
Forked from shihenw/convolutional-pose-machines-releaseCode repository for Convolutional Pose Machines
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
An official implementation of the Anchor DETR.
[CVPR 2020] Detection in Crowded Scenes: One Proposal, Multiple Predictions
A OpenMMLAB toolbox for human pose estimation, skeleton-based action recognition, and action synthesis.
Efficient violence detection in surveillance videos using Human Skeletons and Motion Estimation
Unofficial PyTorch implementation of the CVPR'19 paper "Skeleton-Based Action Recognition with Directed Graph Neural Networks".
Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition in CVPR19
Semantic Segmentation Suite in TensorFlow. Implement, train, and test new Semantic Segmentation models easily!
A toolbox for skeleton-based action recognition.
[CVPR 2025 Highlight] PyTorch implementation of "Revealing Key Details to See Differences: A Novel Prototypical Perspective for Skeleton-based Action Recognition"
Pytorch implementation of pose proposal networks
[CVPR 2020 Oral] PyTorch implementation of "Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition"
A curated paper list of awesome skeleton-based action recognition.
yysijie / st-gcn
Forked from open-mmlab/mmskeletonSpatial Temporal Graph Convolutional Networks (ST-GCN) for Skeleton-Based Action Recognition in PyTorch
This repository contains the code and resources for the "CUE-Net: Violence Detection Video Analytics with Spatial Cropping, Enhanced UniformerV2 and Modified Efficient Additive Attention" paper acc…
A large scale video database for violence detection, which has 2,000 video clips containing violent or non-violent behaviours.
An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
PyTorch implementation of over 30 realtime semantic segmentations models, e.g. BiSeNetv1, BiSeNetv2, CGNet, ContextNet, DABNet, DDRNet, EDANet, ENet, ERFNet, ESPNet, ESPNetv2, FastSCNN, ICNet, LEDN…
This is the official repository for our recent work: PIDNet