Incomplete Record of Paper Reading

记录当天阅读 1 小时以上的文献 (不一定读完), 勾选表示做了笔记.

2025

20250213 [2020] End-to-end object detection with transformer
20250212 [2017] Attention is All You Need
20250121 [2022] SVTR_ Scene Text Recognition with a Single Visual Model

2024

20241111 [2022 IJCAI] SVTR_ Scene Text Recognition with a Single Visual Model
20241111 [2020 AAAI] Real-time Scene Text Detection with Differentiable Binarization
20240408 [2023] Improved Baselines with Visual Instruction Tuning
20240227 泛读大模型压缩相关文献
20240227 [2022 ICLR] Finetuned Language Models Are Zero-Shot Learners
20240223 [2015] Cross Modal Distillation for Supervision Transfer
20240222 泛读大模型压缩相关文献
20240222 [2022] Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation
20240222 [2023] Multimodal Chain-of-Thought Reasoning in Language Models
20240221 泛读大模型压缩相关文献
20240221 [2018] Improving language understanding by generative pre-training
20240220 [2023 ACL] Distilling Step-by-Step_ Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes
20240220 [2018] Distilling Task-Specific Knowledge from BERT into Simple Neural Networks
20240220 [2020] DistilBERT, a distilled version of BERT_ smaller, faster, cheaper and lighter
20240204 [2023 CVPR] Micron-BERT_ BERT-based Facial Micro-Expression Recognition

2023

20231120 [2023] Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
20231116 [2023] Shikra_ Unleashing Multimodal LLM’s Referential Dialogue Magic
20231116 [2023] Ferret_ Refer and Ground Anything Anywhere at Any Granularity
20231107 [2023] Visual Instruction Tuning
20231107 [2023] What Makes for Good Visual Instructions_ Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning
20231019 [2021] Improving Calibration for Long-Tailed Recognition
20231017 [2023 NeurIPS] Multi-modal Queried Object Detection in the Wild
20231017 [2019] Objects365_ A Large-scale, High-quality Dataset for Object Detection
20230829 [2021] RegionCLIP_ Region-based Language-Image Pretraining
20230829 [2021] OpenPrompt_ An Open-source Framework for Prompt-learning
20230630 泛读多模态任务微调相关文献
20230627 [2023] Segment Anything
20230627 [2020] End-to-End Object Detection with Transformers
20220621 [2022 ECCV] Visual Prompt Tuning
20220621 [2023] Segment Anything in High Quality
20220510 泛读视觉语言预训练相关文献
20230427 [2021] Swin Transformer_ Hierarchical Vision Transformer using Shifted Windows
20230427 [2022] Expanding Language-Image Pretrained Models for General Video Recognition
20230422 [2020] An image is worth 16x16 words_ Transformers for image recognition at scale
20230422 [2021] Align before Fuse_ Vision and Language Representation Learning with Momentum Distillation
20230418 [2021] BEiT_ BERT Pre-Training of Image Transformers
20230418 [2021] iBOT_ Image BERT Pre-Training with Online Tokenizer
20230418 [2023] DINOv2_ Learning Robust Visual Features without Supervision
20230417 [2021] Emerging Properties in Self-Supervised Vision Transformers
20230416 [2023] Scaling Vision Transformers to 22 Billion Parameters
20230416 [2022] LiT_ Zero-Shot Transfer with Locked-image text Tuning
20230416 [2021 ICML] Scaling up visual and vision-language representation learning with noisy text supervision
20230404 [2022] Confident Learning_ Estimating Uncertainty in Dataset Labels
20230206 [2021 NIPS] SegFormer_ Simple and Efficient Design for Semantic Segmentation with Transformers

2022

2021

2020

2014-2017

20170813 [2013 CVPR] Saliency Detection via Graph-Based Manifold Ranking
20170214 [2016] Understanding and Improving Convolutional Neural Networks via CReLU
20170122 [2016] Deep Learning without Poor Local Minima
20170120 [2016] Large-Margin Softmax Loss for Convolutional Neural Networks
20170000 [2016 ICLR] Deep Compression_ Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
20161230 [2011] Fast coordinate descent methods with variable
20150000 [2012 NPAR] Combining Sketch and Tone for Pencil Drawing Production
20140000 [2010 CVPR] Detecting Text in Natural Scenes with Stroke Width Transform

Name		Name	Last commit message	Last commit date
Latest commit History 144 Commits
_notes		_notes
audio_and_speech		audio_and_speech
machine_learning		machine_learning
mathematics		mathematics
natural_language		natural_language
vision_language		vision_language
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Incomplete Record of Paper Reading

2025

2024

2023

2022

2021

2020

2014-2017

About

Uh oh!

Releases

Packages

Uh oh!

quarrying/quarrying-paper-notes

Folders and files

Latest commit

History

Repository files navigation

Incomplete Record of Paper Reading

2025

2024

2023

2022

2021

2020

2014-2017

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Packages