SOTAVerified

Information Retrieval

Information retrieval is the task of ranking a list of documents or search results in response to a query

( Image credit: sudhanshumittal )

Papers

Showing 150 of 4740 papers

TitleStatusHype
LightRAG: Simple and Fast Retrieval-Augmented GenerationCode14
Language agents achieve superhuman synthesis of scientific knowledgeCode9
MindSearch: Mimicking Human Minds Elicits Deep AI SearcherCode9
PEER: Expertizing Domain-Specific Tasks with a Multi-Agent Framework and Tuning MethodsCode7
Benchmarking the Myopic Trap: Positional Bias in Information RetrievalCode5
Make Your LLM Fully Utilize the ContextCode5
Retrieval-Augmented Generation for AI-Generated Content: A SurveyCode5
Extreme Compression of Large Language Models via Additive QuantizationCode5
From Web Search towards Agentic Deep Research: Incentivizing Search with Reasoning AgentsCode4
DeepResearch Bench: A Comprehensive Benchmark for Deep Research AgentsCode4
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory SynthesisCode4
Beyond Outlining: Heterogeneous Recursive Planning for Adaptive Long-form Writing with Language ModelsCode4
DeepRetrieval: Hacking Real Search Engines and Retrievers with Large Language Models via Reinforcement LearningCode4
Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented GenerationCode4
iText2KG: Incremental Knowledge Graphs Construction Using Large Language ModelsCode4
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented GenerationCode4
COS-Mix: Cosine Similarity and Distance Fusion for Improved Information RetrievalCode4
Benchmarking Retrieval-Augmented Generation for MedicineCode4
Resources for Brewing BEIR: Reproducible Reference Models and an Official LeaderboardCode4
AlignScore: Evaluating Factual Consistency with a Unified Alignment FunctionCode4
AnnoLLM: Making Large Language Models to Be Better Crowdsourced AnnotatorsCode4
ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain KnowledgeCode4
SLIM: Sparsified Late Interaction for Multi-Vector Retrieval with Inverted IndexesCode4
One Embedder, Any Task: Instruction-Finetuned Text EmbeddingsCode4
MTEB: Massive Text Embedding BenchmarkCode4
PLAID: An Efficient Engine for Late Interaction RetrievalCode4
A Comprehensive Survey of Deep Research: Systems, Methodologies, and ApplicationsCode3
Iterative Self-Incentivization Empowers Large Language Models as Agentic SearchersCode3
Distance Adaptive Beam Search for Provably Accurate Graph-Based Nearest Neighbor SearchCode3
ReasonIR: Training Retrievers for Reasoning TasksCode3
REAL: Benchmarking Autonomous Agents on Deterministic Simulations of Real WebsitesCode3
Any Information Is Just Worth One Single Screenshot: Unifying Search With Visualized Information RetrievalCode3
BMX: Entropy-weighted Similarity and Semantic-enhanced Lexical SearchCode3
Music2Latent: Consistency Autoencoders for Latent Audio CompressionCode3
Robust Neural Information Retrieval: An Adversarial and Out-of-distribution PerspectiveCode3
MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click LabelsCode3
From Matching to Generation: A Survey on Generative Information RetrievalCode3
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon GenerationCode3
When Large Language Models Meet Vector Databases: A SurveyCode3
INTERS: Unlocking the Power of Large Language Models in Search with Instruction TuningCode3
WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on WikipediaCode3
Dataset and Baseline System for Multi-lingual Extraction and Normalization of Temporal and Numerical ExpressionsCode3
FinBERT-QA: Financial Question Answering with pre-trained BERT Language ModelsCode2
Retrieval Augmented Generation Evaluation in the Era of Large Language Models: A Comprehensive SurveyCode2
GENIUS: A Generative Framework for Universal Multimodal SearchCode2
UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure AnalysisCode2
A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information RetrievalCode2
SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-CheckingCode2
Qilin: A Multimodal Information Retrieval Dataset with APP-level User SessionsCode2
Rank1: Test-Time Compute for Reranking in Information RetrievalCode2
Show:102550
← PrevPage 1 of 95Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Two-tower Bi-Encoder (RoBERTa)Recall@10074.78Unverified
2Siamese Bi-Encoder (RoBERTa)Recall@10071.63Unverified
3BM25Recall@10051.33Unverified
#ModelMetricClaimedVerifiedStatus
1RetroMAE v2MRR@1042.58Unverified
2ConAE-256Time (ms)0.33Unverified
3ConAE-128Time (ms)0.32Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BmAP@1000.16Unverified
2TSDAEmAP@1000.15Unverified
#ModelMetricClaimedVerifiedStatus
1hpipubcommoninfNDCG0.56Unverified
2hpictallinfNDCG0.55Unverified
#ModelMetricClaimedVerifiedStatus
1MINDHR@300.32Unverified
#ModelMetricClaimedVerifiedStatus
1Distilled NetworknDCG@100.53Unverified
#ModelMetricClaimedVerifiedStatus
1RetroMAEMRR@100.42Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-5.8B-msmarconDCG@1050.25Unverified
#ModelMetricClaimedVerifiedStatus
1Information Retrieval + SVM1:1 Accuracy83.79Unverified
#ModelMetricClaimedVerifiedStatus
1BERT+CONCEPT FILTERNDCG0.25Unverified