SOTAVerified

Information Retrieval

Information retrieval is the task of ranking a list of documents or search results in response to a query

( Image credit: sudhanshumittal )

Papers

Showing 501550 of 4740 papers

TitleStatusHype
Context manipulation attacks : Web agents are susceptible to corrupted memory0
FormGym: Doing Paperwork with Agents0
InsertRank: LLMs can reason over BM25 scores to Improve Listwise Reranking0
Fretting-Transformer: Encoder-Decoder Model for MIDI to Tablature Transcription0
xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations0
Hierarchical Multi-Positive Contrastive Learning for Patent Image Retrieval0
Versatile and Fast Location-Based Private Information Retrieval with Fully Homomorphic Encryption over the TorusCode0
eLog analysis for accelerators: status and future outlook0
CMI-Bench: A Comprehensive Benchmark for Evaluating Music Instruction Following0
Towards Understanding Bias in Synthetic Data for EvaluationCode0
ScholarSearch: Benchmarking Scholar Searching Ability of LLMs0
Private Aggregation for Byzantine-Resilient Heterogeneous Federated Learning0
Evaluation empirique de la sécurisation et de l'alignement de ChatGPT et Gemini: analyse comparative des vulnérabilités par expérimentations de jailbreaks0
Multimodal Representation Alignment for Cross-modal Information Retrieval0
Unlocking the Potential of Large Language Models in the Nuclear Industry with Synthetic Data0
Unable to Forget: Proactive lnterference Reveals Working Memory Limits in LLMs Beyond Context Length0
PolitiSky24: U.S. Political Bluesky Dataset with User Stance Labels0
SAR2Struct: Extracting 3D Semantic Structural Representation of Aircraft Targets from Single-View SAR Image0
BioMol-MQA: A Multi-Modal Question Answering Dataset For LLM Reasoning Over Bio-Molecular Interactions0
Exp4Fuse: A Rank Fusion Framework for Enhanced Sparse Retrieval using Large Language Model-based Query ExpansionCode0
APVR: Hour-Level Long Video Understanding with Adaptive Pivot Visual Information Retrieval0
A Survey on Vietnamese Document Analysis and Recognition: Challenges and Future Directions0
GOLFer: Smaller LM-Generated Documents Hallucination Filter & Combiner for Query Expansion in Information RetrievalCode0
Preface to the Special Issue of the TAL Journal on Scholarly Document Processing0
Evaluating the Unseen Capabilities: How Many Theorems Do LLMs Know?0
CoQuIR: A Comprehensive Benchmark for Code Quality-Aware Information Retrieval0
CLIP-AE: CLIP-assisted Cross-view Audio-Visual Enhancement for Unsupervised Temporal Action Localization0
Retrieval Augmented Generation based Large Language Models for Causality MiningCode0
SNS-Bench-VL: Benchmarking Multimodal Large Language Models in Social Networking ServicesCode0
From Token to Action: State Machine Reasoning to Mitigate Overthinking in Information RetrievalCode0
Map&Make: Schema Guided Text to Table Generation0
Bayesian Attention Mechanism: A Probabilistic Framework for Positional Encoding and Context Length ExtrapolationCode0
Logical Consistency is Vital: Neural-Symbolic Information Retrieval for Negative-Constraint QueriesCode0
Rethinking Chunk Size For Long-Document Retrieval: A Multi-Dataset AnalysisCode0
Anveshana: A New Benchmark Dataset for Cross-Lingual Information Retrieval On English Queries and Sanskrit Documents0
It's High Time: A Survey of Temporal Information Retrieval and Question Answering0
DocMMIR: A Framework for Document Multi-modal Information RetrievalCode0
Likert or Not: LLM Absolute Relevance Judgments on Fine-Grained Ordinal Scales0
DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research0
InstructPart: Task-Oriented Part Segmentation with Instruction Reasoning0
Intent Classification on Low-Resource Languages with Query Similarity Search0
Reinforcement Speculative Decoding for Fast Ranking0
Don't "Overthink" Passage Reranking: Is Reasoning Truly Necessary?0
Tools in the Loop: Quantifying Uncertainty of LLM Question Answering Systems That Use Tools0
Align-GRAG: Reasoning-Guided Dual Alignment for Graph Retrieval-Augmented Generation0
Fixing Data That Hurts Performance: Cascading LLMs to Relabel Hard Negatives for Robust Information Retrieval0
Search Wisely: Mitigating Sub-optimal Agentic Searches By Reducing Uncertainty0
Layer-wise Investigation of Large-Scale Self-Supervised Music Representation Models0
MiLQ: Benchmarking IR Models for Bilingual Web Search with Mixed Language Queries0
Learning Normal Patterns in Musical Loops0
Show:102550
← PrevPage 11 of 95Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Two-tower Bi-Encoder (RoBERTa)Recall@10074.78Unverified
2Siamese Bi-Encoder (RoBERTa)Recall@10071.63Unverified
3BM25Recall@10051.33Unverified
#ModelMetricClaimedVerifiedStatus
1RetroMAE v2MRR@1042.58Unverified
2ConAE-256Time (ms)0.33Unverified
3ConAE-128Time (ms)0.32Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BmAP@1000.16Unverified
2TSDAEmAP@1000.15Unverified
#ModelMetricClaimedVerifiedStatus
1hpipubcommoninfNDCG0.56Unverified
2hpictallinfNDCG0.55Unverified
#ModelMetricClaimedVerifiedStatus
1MINDHR@300.32Unverified
#ModelMetricClaimedVerifiedStatus
1Distilled NetworknDCG@100.53Unverified
#ModelMetricClaimedVerifiedStatus
1RetroMAEMRR@100.42Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-5.8B-msmarconDCG@1050.25Unverified
#ModelMetricClaimedVerifiedStatus
1Information Retrieval + SVM1:1 Accuracy83.79Unverified
#ModelMetricClaimedVerifiedStatus
1BERT+CONCEPT FILTERNDCG0.25Unverified