Starred repositories
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
AutoVideo: An Automated Video Action Recognition System
Distributed vector search for AI-native applications
1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection(公式检测冠军方案)
Deep relational reasoning graph network for arbitrary shape text detection; Accepted by CVPR 2020 (Oral). http://arxiv.org/abs/2003.07493
[ICML 2021 Oral] We show pure attention suffers rank collapse, and how different mechanisms combat it.
LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference
This is an official implementation for "Contextual Transformer Networks for Visual Recognition".
Development repository for the Triton language and compiler
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
📉 Hand drawing style charts library for Python
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Image Visualization Tools (object detection, semantic and instance segmentation)
Code for "Deep Snake for Real-Time Instance Segmentation" CVPR 2020 oral
[CVPR 2020] CenterMask : Real-time Anchor-Free Instance Segmentation
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
ISTR: End-to-End Instance Segmentation with Transformers (https://arxiv.org/abs/2105.00637)
Unofficial implementation of MaX-DeepLab for Instance Segmentation
SOLO and SOLOv2 for instance segmentation, ECCV 2020 & NeurIPS 2020.
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
[CVPR 2021] Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement Approach
General Multi-label Image Classification with Transformers
📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.
Scene Text Recognition (STR) methods trained with fewer real labels (CVPR 2021)
Self-attention based Text Knowledge Mining for Text Detection
Efficient research work environment setup for computer science and general workflow for Deep Learning experiments
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
This repository contains the source code for the paper First Order Motion Model for Image Animation
Code for my medium article: ["Faster Notes with Python and Deep Learning"](https://medium.com/p/b713bbb3c186/edit)