SOTAVerified

Information Retrieval

Information retrieval is the task of ranking a list of documents or search results in response to a query

( Image credit: sudhanshumittal )

Papers

Showing 51100 of 4740 papers

TitleStatusHype
The GigaMIDI Dataset with Features for Expressive Music Performance DetectionCode2
VUS: Effective and Efficient Accuracy Measures for Time-Series Anomaly DetectionCode2
GSM-Infinite: How Do Your LLMs Behave over Infinitely Increasing Context Length and Reasoning Complexity?Code2
AIR-Bench: Automated Heterogeneous Information Retrieval BenchmarkCode2
LamRA: Large Multimodal Model as Your Advanced Retrieval AssistantCode2
Lightning IR: Straightforward Fine-tuning and Inference of Transformer-based Language Models for Information RetrievalCode2
DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive LearningCode2
CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language ModelsCode2
Differential TransformerCode2
Reasoning-Enhanced Healthcare Predictions with Knowledge Graph Community RetrievalCode2
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language ModelsCode2
Eureka: Evaluating and Understanding Large Foundation ModelsCode2
Language Model Powered Digital Biology with BRADCode2
MemLong: Memory-Augmented Retrieval for Long Text ModelingCode2
Scientific QA System with Verifiable AnswersCode2
Think-on-Graph 2.0: Deep and Faithful Large Language Model Reasoning with Knowledge-guided Retrieval Augmented GenerationCode2
PersonaRAG: Enhancing Retrieval-Augmented Generation Systems with User-Centric AgentsCode2
CoIR: A Comprehensive Benchmark for Code Information Retrieval ModelsCode2
FIRST: Faster Improved Listwise Reranking with Single Token DecodingCode2
Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language ModelsCode2
MidiCaps: A large-scale MIDI dataset with text captionsCode2
Evaluation of Retrieval-Augmented Generation: A SurveyCode2
Overview of the EHRSQL 2024 Shared Task on Reliable Text-to-SQL Modeling on Electronic Health RecordsCode2
Bias and Unfairness in Information Retrieval Systems: New Challenges in the LLM EraCode2
AiSAQ: All-in-Storage ANNS with Product Quantization for DRAM-free Information RetrievalCode2
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow InstructionsCode2
Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based RetrieversCode2
Backtracing: Retrieving the Cause of the QueryCode2
Verif.ai: Towards an Open-Source Scientific Generative Question-Answering System with Referenced and Verifiable AnswersCode2
The Power of Noise: Redefining Retrieval for RAG SystemsCode2
Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure AnalysisCode2
BIRB: A Generalization Benchmark for Information Retrieval in BioacousticsCode2
RankZephyr: Effective and Robust Zero-Shot Listwise Reranking is a Breeze!Code2
SUQL: Conversational Search over Structured and Unstructured Data with Large Language ModelsCode2
Mustango: Toward Controllable Text-to-Music GenerationCode2
A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open QuestionsCode2
A Foundation Model for Music InformaticsCode2
RankVicuna: Zero-Shot Listwise Document Reranking with Open-Source Large Language ModelsCode2
CAGRA: Highly Parallel Graph Construction and Approximate Nearest Neighbor Search for GPUsCode2
Large Language Models for Information Retrieval: A SurveyCode2
All-In-One Metrical And Functional Structure Analysis With Neighborhood Attentions on Demixed AudioCode2
InPars Toolkit: A Unified and Reproducible Synthetic Data Generation Pipeline for Neural Information RetrievalCode2
MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information RetrievalCode2
RETA-LLM: A Retrieval-Augmented Large Language Model ToolkitCode2
WebCPM: Interactive Web Search for Chinese Long-form Question AnsweringCode2
Autonomous GIS: the next-generation AI-powered GISCode2
RetroMAE-2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language ModelsCode2
Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking AgentsCode2
UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of RerankersCode2
InPars-v2: Large Language Models as Efficient Dataset Generators for Information RetrievalCode2
Show:102550
← PrevPage 2 of 95Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Two-tower Bi-Encoder (RoBERTa)Recall@10074.78Unverified
2Siamese Bi-Encoder (RoBERTa)Recall@10071.63Unverified
3BM25Recall@10051.33Unverified
#ModelMetricClaimedVerifiedStatus
1RetroMAE v2MRR@1042.58Unverified
2ConAE-256Time (ms)0.33Unverified
3ConAE-128Time (ms)0.32Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BmAP@1000.16Unverified
2TSDAEmAP@1000.15Unverified
#ModelMetricClaimedVerifiedStatus
1hpipubcommoninfNDCG0.56Unverified
2hpictallinfNDCG0.55Unverified
#ModelMetricClaimedVerifiedStatus
1MINDHR@300.32Unverified
#ModelMetricClaimedVerifiedStatus
1Distilled NetworknDCG@100.53Unverified
#ModelMetricClaimedVerifiedStatus
1RetroMAEMRR@100.42Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-5.8B-msmarconDCG@1050.25Unverified
#ModelMetricClaimedVerifiedStatus
1Information Retrieval + SVM1:1 Accuracy83.79Unverified
#ModelMetricClaimedVerifiedStatus
1BERT+CONCEPT FILTERNDCG0.25Unverified