8000 GitHub - arthur-75/information-retriever-resources: All what you need for information retrieval resources
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

arthur-75/information-retriever-resources

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 

Repository files navigation

information-retriever-pipeline

I am doing a PhD on Information retriever and I am happy to share what I learning All what you need for information retrieval resources. (Inspired by caiyinqiong)

Pipeline IR - How to build an IR pipeline



Awesome



Awesome

Feel free to participate. Work in progress...

Contents


Survey Paper

Paper Author Venue Code
Semantic Matching in Search Li et al., 2014 NA NA
Pretrained Transformers for Text Ranking: BERT and Beyond Lin et al., 2021 arXiv NA
Semantic Models for the First-stage Retrieval: A Comprehensive Review Guo et al., 2021 TOIS new.gif
A Proposed Conceptual Framework for a Representational Approach to Information Retrieval Lin et al., 2021 arXiv NA
Pre-training Methods in Information Retrieval Fan et al., 2022 arXiv new.gif
Dense Text Retrieval based on Pretrained Language Models: A Survey Zhao et al., 2022 arXiv new.gif
Low-Resource Dense Retrieval for Open-Domain Question Answering: A Comprehensive Survey Shen et al., 2022 arXiv new.gif
Lecture Notes on Neural Information Retrieval Tonellotto et al., 2022 arXiv new.gif

Query Expansion

Global Model

Paper Author Year Model
Word-word Associations in Document Retrieval Systems NA 1969 NA
Concept based Query Expansion NA 1993 NA
Query Expansion using Lexical-Semantic Relations NA 1994 NA
Using Query Contexts in Information Retrieval NA 2007 NA

Local Model

Paper Author Year Model
Relevance Feedback in Information Retrieval NA 1971 Rocchio PRF
Model-based Feedback in the Language Modeling Approach to Information Retrieval NA 2001 Divergence Minimization Model
UMass at TREC 2004: Novelty and HARD NA 2004 RM3 for PRF
Selecting Good Expansion Terms for Pseudo-Relevance Feedback NA 2008 PRF
A Comparative Study of Methods for Estimating Query Language Models with Pseudo Feedback NA 2009 NA
Pseudo-Relevance Feedback Based on Matrix Factorization NA 2016 NA
Reducing the Risk of Query Expansion via Robust Constrained Optimization NA 2009 query drift problem
Query Expansion using Local and Global Document Analysis NA 2017 NA

Classical Term-based Retrieval

Paper Author Year Model
A Vector Space Model for Automatic Indexing NA 1975 VSM
Developments in Automatic Text Retrieval NA 1991 TFIDF
Term-weighting Approaches in Automatic Text Retrieval NA 1988 TFIDF
Relevance Weighting of Search Terms NA 1976 BIM
A Theoretical Basis for the Use of Co-occurrence Data in Information Retrieval NA 1997 Tree Dependence Model
The Probabilistic Relevance Framework: BM25 and Beyond NA 2010 BM25
A Language Modeling Approach to Information Retrieval NA 1998 QL
Statistical Language Models for Information Retrieval NA 2007 LM for IR
Hypergeometric Language Model and Zipf-Like Scoring Function for Web Document Similarity Retrieval NA 2010 LM for IR
Probabilistic Models of Information Retrieval Based on Measuring the Divergence from Randomness NA 2002 DFR

Early Methods for Semantic Retrieval

Document Expansion

Paper Year Model
Corpus Structure, Language Models, and Ad Hoc Information Retrieval 2004 NA
Cluster-Based Retrieval Using Language Models 2004 NA
Language Model Information Retrieval with Document Expansion 2006 NA
Document Expansion Based on WordNet for Robust IR 2010 NA
Improving Retrieval of Short Texts Through Document Expansion 2012 NA
Document Expansion Using External Collections 2017 WordNet-based
Document Expansion versus Query Expansion for Ad-hoc Retrieval 2005 NA

Term Dependency Model

Paper Year Model
Experiments in Automatic Phrase Indexing For Document Retrieval: A Comparison of Syntactic and Non-Syntactic Methods 1987 VSM + term dependency
Term-weighting Approaches in Automatic Text Retrieval 1988 VSM + term dependency
An Analysis of Statistical and Syntactic Phrases 1997 VSM + term dependency
A Probabilistic Model of Information Retrieval: Development and Comparative Experiments 2000 VSM + term dependency
Relevance Ranking Using Kernels 2010 BM25 + term dependency
A General Language Model for Information Retrieval 1999 LM + term dependency
Biterm Language Models for Document Retrieval 2002 LM + term dependency
Capturing Term Dependencies using a Language Model based on Sentence Trees 2002 LM + term dependency
Dependence Language Model for Information Retrieval 2004 LM + term dependency
A Generative Theory of Relevance 2008 NA
A Markov Random Field Model for Term Dependencies 2005 SDM

Early Methods for Semantic Retrieval

Document Expansion

Paper Year Author Code
Corpus Structure, Language Models, and Ad Hoc Information Retrieval 2004 Susan T. Dumais NA
Cluster-Based Retrieval Using Language Models 2004 Susan T. Dumais NA
Language Model Information Retrieval with Document Expansion 2006 Jimmy Lin NA
Document Expansion Based on WordNet for Robust IR 2010 Jey Han Lau NA
Improving Retrieval of Short Texts Through Document Expansion 2012 Hang Li NA
Document Expansion Using External Collections 2017 Hui Fang WordNet-based
Document Expansion versus Query Expansion for Ad-hoc Retrieval 2005 Douglas W. Oard NA

Term Dependency Model

Paper Author Year Code
Experiments in Automatic Phrase Indexing For Document Retrieval: A Comparison of Syntactic and Non-Syntactic Methods 1987 Donna Harman VSM + term dependency
Term-weighting Approaches in Automatic Text Retrieval 1988 Stephen E. Robertson VSM + term dependency
An Analysis of Statistical and Syntactic Phrases 1997 Amit Singhal VSM + term dependency
A Probabilistic Model of Information Retrieval: Development and Comparative Experiments 2000 Stephen E. Robertson VSM + term dependency
Relevance Ranking Using Kernels 2010 ChengXiang Zhai BM25 + term dependency
A General Language Model for Information Retrieval 1999 Donald Metzler LM + term dependency
Biterm Language Models for Document Retrieval 2002 Qiaozhu Mei LM + term dependency
Capturing Term Dependencies using a Language Model based on Sentence Trees 2002 Qiaozhu Mei LM + term dependency
Dependence Language Model for Information Retrieval 2004 Zhiheng Huang LM + term dependency
A Generative Theory of Relevance 2008 Susan T. Dumais NA
A Markov Random Field Model for Term Dependencies 2005 Tao Tao SDM

Translation Model

Paper Author Year Code
Information Retrieval as Statistical Translation Bonnie J. Dorr 1999 NA
Estimation of Statistical Translation Models Based on Mutual Information for Ad Hoc Information Retrieval Benjamin Van Durme 2010 NA
Clickthrough-Based Translation Models for Web Search: From Word Models to Phrase Models Hang Li 2010 NA
Axiomatic Analysis of Translation Language Model for Information Retrieval ChengXiang Zhai 2012 NA
Query Rewriting Using Monolingual Statistical Machine Translation Maosong Sun 2010 for query expansion
Towards Concept-based Translation Models using Search Logs for Query Expansion Zhen Tan 2012 for query expansion

Neural Methods for Semantic Retrieval

Sparse Retrieval Methods

Dense Retrieval Methods

Hybrid Retrieval Methods

Other Resources

Other Tasks

Datasets

Indexing Methods

About

All what you need for information retrieval resources

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
0