- Dalian, Liaoning, China
Stars
PyTorch implementation of a collections of scalable Video Transformer Benchmarks.
EVSign dataset for event-based sign language recognition and translation (ECCV 2024)
[TMM 2025] StableIdentity: Inserting Anybody into Anywhere at First Sight 🔥
Paper list of sign language, including sign language recognition(SLR), sign language translation(SLT) and other interesting work. Quick start your awesome work with us!! 🤟🤟🤟
Continuous Sign Language Recognition with Correlation Network (CVPR 2023)
Fantastic Robustness Measures: The Secrets of Robust Generalization [NeurIPS 2023]
A curated list of papers on adversarial machine learning (adversarial examples and defense methods).
code for 'Representation Learning for Visual Object Tracking by Masked Appearance Transfer'
🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applic…
✨✨Latest Advances on Multimodal Large Language Models
Paper list for single object tracking (State-of-the-art SOT trackers)
We developed a python UI based on labelme and segment-anything for pixel-level annotation. It support multiple masks generation by SAM(box/point prompt), efficient polygon modification and category…
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
PyTorch implementation of ARKitTrack for CVPR'2023 paper "ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data", by Haojie Zhao, Junsong Chen, Lijun Wang, Huchuan Lu. Code will be…
[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval
[IJCV] Bamboo: 4 times larger than ImageNet; 2 time larger than Object365; Built by active learning.
M2DGR: a Multi-modal and Multi-scenario Dataset for Ground Robots(RA-L2021 & ICRA2022)
SwinIR: Image Restoration Using Swin Transformer (official repository)
A large-scale benchmark dataset for color-event based visual tracking
Visible-Thermal UAV Tracking: A Large-Scale Benchmark (CVPR2022)
Official implementation of CVPR 2022 paper(MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video)
Vision-Centric BEV Perception: A Survey
Official Implementation of Towards Sequence-Level Training for Visual Tracking (ECCV 2022)
[ECCV'22 Oral] Towards Grand Unification of Object Tracking