SOTAVerified

Information Retrieval

Information retrieval is the task of ranking a list of documents or search results in response to a query

( Image credit: sudhanshumittal )

Papers

Showing 101150 of 4740 papers

TitleStatusHype
Melody transcription via generative pre-trainingCode2
RetroMAE v2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language ModelsCode2
Making a MIRACL: Multilingual Information Retrieval Across a Continuum of LanguagesCode2
Multilingual Search with Subword TF-IDFCode2
Atlas: Few-shot Learning with Retrieval Augmented Language ModelsCode2
Infinite Recommendation Networks: A Data-Centric ApproachCode2
RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-EncoderCode2
Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph CompletionCode2
Autoregressive Search Engines: Generating Substrings as Document IdentifiersCode2
Multi-CPR: A Multi Domain Chinese Dataset for Passage RetrievalCode2
SGPT: GPT Sentence Embeddings for Semantic SearchCode2
InPars: Data Augmentation for Information Retrieval using Large Language ModelsCode2
ColBERTv2: Effective and Efficient Retrieval via Lightweight Late InteractionCode2
Omnizart: A General Toolbox for Automatic Music TranscriptionCode2
FreshDiskANN: A Fast and Accurate Graph-Based ANN Index for Streaming Similarity SearchCode2
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval ModelsCode2
Pyserini: An Easy-to-Use Python Toolkit to Support Replicable IR Research with Sparse and Dense RepresentationsCode2
Pretrained Transformers for Text Ranking: BERT and BeyondCode2
GiantMIDI-Piano: A large-scale MIDI dataset for classical piano musicCode2
ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERTCode2
OpenNRE: An Open and Extensible Toolkit for Neural Relation ExtractionCode2
Multi-Interest Network with Dynamic Routing for Recommendation at TmallCode2
TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational AgentsCode2
Knowledge Representation Learning: A Quantitative ReviewCode2
CheMatAgent: Enhancing LLMs for Chemistry and Materials Science through Tree-Search Based Tool LearningCode1
REARANK: Reasoning Re-ranking Agent via Reinforcement LearningCode1
Modality Curation: Building Universal Embeddings for Advanced Multimodal Information RetrievalCode1
POQD: Performance-Oriented Query Decomposer for Multi-vector retrievalCode1
DisastIR: A Comprehensive Information Retrieval Benchmark for Disaster ManagementCode1
mmRAG: A Modular Benchmark for Retrieval-Augmented Generation over Text, Tables, and Knowledge GraphsCode1
ReCDAP: Relation-Based Conditional Diffusion with Attention Pooling for Few-Shot Knowledge Graph CompletionCode1
Exploring _0 Sparsification for Inference-free Sparse RetrieversCode1
Template-Based Financial Report Generation in Agentic and Decomposed Information RetrievalCode1
Building Russian Benchmark for Evaluation of Information Retrieval ModelsCode1
Pneuma: Leveraging LLMs for Tabular Data Representation and Retrieval in an End-to-End SystemCode1
Lightweight and Direct Document Relevance Optimization for Generative Information RetrievalCode1
Distillation and Refinement of Reasoning in Small Language Models for Document Re-rankingCode1
LIRA: A Learning-based Query-aware Partition Framework for Large-scale ANN SearchCode1
Narrative Trails: A Method for Coherent Storyline Extraction via Maximum Capacity Path OptimizationCode1
Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix FactorizationCode1
LLM-QE: Improving Query Expansion by Aligning Large Language Models with Ranking PreferencesCode1
Judging the Judges: A Collection of LLM-Generated Relevance JudgementsCode1
Towards Text-Image Interleaved RetrievalCode1
FairDiverse: A Comprehensive Toolkit for Fair and Diverse Information Retrieval AlgorithmsCode1
Syntriever: How to Train Your Retriever with Synthetic Data from LLMsCode1
Scalable-Softmax Is Superior for AttentionCode1
TFLOP: Table Structure Recognition Framework with Layout Pointer MechanismCode1
MechIR: A Mechanistic Interpretability Framework for Information RetrievalCode1
kANNolo: Sweet and Smooth Approximate k-Nearest Neighbors SearchCode1
Length-Aware DETR for Robust Moment RetrievalCode1
Show:102550
← PrevPage 3 of 95Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Two-tower Bi-Encoder (RoBERTa)Recall@10074.78Unverified
2Siamese Bi-Encoder (RoBERTa)Recall@10071.63Unverified
3BM25Recall@10051.33Unverified
#ModelMetricClaimedVerifiedStatus
1RetroMAE v2MRR@1042.58Unverified
2ConAE-256Time (ms)0.33Unverified
3ConAE-128Time (ms)0.32Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BmAP@1000.16Unverified
2TSDAEmAP@1000.15Unverified
#ModelMetricClaimedVerifiedStatus
1hpipubcommoninfNDCG0.56Unverified
2hpictallinfNDCG0.55Unverified
#ModelMetricClaimedVerifiedStatus
1MINDHR@300.32Unverified
#ModelMetricClaimedVerifiedStatus
1Distilled NetworknDCG@100.53Unverified
#ModelMetricClaimedVerifiedStatus
1RetroMAEMRR@100.42Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-5.8B-msmarconDCG@1050.25Unverified
#ModelMetricClaimedVerifiedStatus
1Information Retrieval + SVM1:1 Accuracy83.79Unverified
#ModelMetricClaimedVerifiedStatus
1BERT+CONCEPT FILTERNDCG0.25Unverified