SOTAVerified

Information Retrieval

Information retrieval is the task of ranking a list of documents or search results in response to a query

( Image credit: sudhanshumittal )

Papers

Showing 101150 of 4740 papers

TitleStatusHype
A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information RetrievalCode2
GSM-Infinite: How Do Your LLMs Behave over Infinitely Increasing Context Length and Reasoning Complexity?Code2
Detect-Order-Construct: A Tree Construction based Approach for Hierarchical Document Structure AnalysisCode2
GiantMIDI-Piano: A large-scale MIDI dataset for classical piano musicCode2
The GigaMIDI Dataset with Features for Expressive Music Performance DetectionCode2
The Power of Noise: Redefining Retrieval for RAG SystemsCode2
Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph CompletionCode2
FreshDiskANN: A Fast and Accurate Graph-Based ANN Index for Streaming Similarity SearchCode2
UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of RerankersCode2
Bias and Unfairness in Information Retrieval Systems: New Challenges in the LLM EraCode2
AIR-Bench: Automated Heterogeneous Information Retrieval BenchmarkCode2
Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based RetrieversCode2
Infinite Recommendation Networks: A Data-Centric ApproachCode2
FinBERT-QA: Financial Question Answering with pre-trained BERT Language ModelsCode2
BIRB: A Generalization Benchmark for Information Retrieval in BioacousticsCode2
FIRST: Faster Improved Listwise Reranking with Single Token DecodingCode2
Eureka: Evaluating and Understanding Large Foundation ModelsCode2
A Foundation Model for Music InformaticsCode2
Evaluation of Retrieval-Augmented Generation: A SurveyCode2
ColBERT: Efficient and Effective Passage Search via Contextualized Late Interaction over BERTCode2
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow InstructionsCode2
Autoregressive Search Engines: Generating Substrings as Document IdentifiersCode2
InPars: Data Augmentation for Information Retrieval using Large Language ModelsCode2
Backtracing: Retrieving the Cause of the QueryCode2
Efficient Data-aware Distance Comparison Operations for High-Dimensional Approximate Nearest Neighbor SearchCode1
EEG2Mel: Reconstructing Sound from Brain Responses to MusicCode1
Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and BetterCode1
A Data-Driven Methodology for Considering Feasibility and Pairwise Likelihood in Deep Learning Based Guitar Tablature Transcription SystemsCode1
RACE: Retrieval-Augmented Commit Message GenerationCode1
Efficient fine-tuning methodology of text embedding models for information retrieval: contrastive learning penalty (clp)Code1
DTF-AT: Decoupled Time-Frequency Audio Transformer for Event ClassificationCode1
Back to the Basics: A Quantitative Analysis of Statistical and Graph-Based Term Weighting Schemes for Keyword ExtractionCode1
Automatic Jailbreaking of the Text-to-Image Generative AI SystemsCode1
Do Music Generation Models Encode Music Theory?Code1
Dynamic Modality Interaction Modeling for Image-Text RetrievalCode1
Efficiently predicting high resolution mass spectra with graph neural networksCode1
Adaptive Machine Translation with Large Language ModelsCode1
Distilling Knowledge from Reader to Retriever for Question AnsweringCode1
Dive into Decision Trees and Forests: A Theoretical DemonstrationCode1
Automatic Generation of Topic LabelsCode1
Distillation and Refinement of Reasoning in Small Language Models for Document Re-rankingCode1
Divide and Conquer: Text Semantic Matching with Disentangled Keywords and IntentsCode1
Augmenting Document Representations for Dense Retrieval with Interpolation and PerturbationCode1
audioLIME: Listenable Explanations Using Source SeparationCode1
DisastIR: A Comprehensive Information Retrieval Benchmark for Disaster ManagementCode1
Audio Embeddings as Teachers for Music ClassificationCode1
Dimension Reduction for Efficient Dense Retrieval via Conditional AutoencoderCode1
Discovering Mathematical Objects of Interest -- A Study of Mathematical NotationsCode1
Injecting Domain Adaptation with Learning-to-hash for Effective and Efficient Zero-shot Dense RetrievalCode1
Embed2Detect: Temporally Clustered Embedded Words for Event Detection in Social MediaCode1
Show:102550
← PrevPage 3 of 95Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Two-tower Bi-Encoder (RoBERTa)Recall@10074.78Unverified
2Siamese Bi-Encoder (RoBERTa)Recall@10071.63Unverified
3BM25Recall@10051.33Unverified
#ModelMetricClaimedVerifiedStatus
1RetroMAE v2MRR@1042.58Unverified
2ConAE-256Time (ms)0.33Unverified
3ConAE-128Time (ms)0.32Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BmAP@1000.16Unverified
2TSDAEmAP@1000.15Unverified
#ModelMetricClaimedVerifiedStatus
1hpipubcommoninfNDCG0.56Unverified
2hpictallinfNDCG0.55Unverified
#ModelMetricClaimedVerifiedStatus
1MINDHR@300.32Unverified
#ModelMetricClaimedVerifiedStatus
1Distilled NetworknDCG@100.53Unverified
#ModelMetricClaimedVerifiedStatus
1RetroMAEMRR@100.42Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-5.8B-msmarconDCG@1050.25Unverified
#ModelMetricClaimedVerifiedStatus
1Information Retrieval + SVM1:1 Accuracy83.79Unverified
#ModelMetricClaimedVerifiedStatus
1BERT+CONCEPT FILTERNDCG0.25Unverified