SOTAVerified

Information Retrieval

Information retrieval is the task of ranking a list of documents or search results in response to a query

( Image credit: sudhanshumittal )

Papers

Showing 201250 of 4740 papers

TitleStatusHype
TriviaHG: A Dataset for Automatic Hint Generation from Factoid QuestionsCode1
RankMamba: Benchmarking Mamba's Document Ranking Performance in the Era of TransformersCode1
DTF-AT: Decoupled Time-Frequency Audio Transformer for Event ClassificationCode1
AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge AugmentationCode1
Hierarchical Multimodal Pre-training for Visually Rich Webpage UnderstandingCode1
Corpus-Steered Query Expansion with Large Language ModelsCode1
Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a SurveyCode1
ColBERT-XM: A Modular Multi-Vector Representation Model for Zero-Shot Multilingual Information RetrievalCode1
Self-Retrieval: End-to-End Information Retrieval with One Large Language ModelCode1
INSTRUCTIR: A Benchmark for Instruction Following of Information Retrieval ModelsCode1
MuChin: A Chinese Colloquial Description Benchmark for Evaluating Language Models in the Field of MusicCode1
ExaRanker-Open: Synthetic Explanation for IR using Open-Source LLMsCode1
The Quantified Boolean Bayesian Network: Theory and Experiments with a Logical Graphical ModelCode1
Enhancing Complex Question Answering over Knowledge Graphs through Evidence Pattern RetrievalCode1
History-Aware Conversational Dense RetrievalCode1
LongHealth: A Question Answering Benchmark with Long Clinical DocumentsCode1
SciMMIR: Benchmarking Scientific Multi-modal Information RetrievalCode1
Exploring the Best Practices of Query Expansion with Large Language ModelsCode1
Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional SportsCode1
CaseGNN: Graph Neural Networks for Legal Case Retrieval with Text-Attributed GraphsCode1
Wikiformer: Pre-training with Structured Information of Wikipedia for Ad-hoc RetrievalCode1
Extending Context Window of Large Language Models via Semantic CompressionCode1
MUST: An Effective and Scalable Framework for Multimodal Search of Target ModalityCode1
LLF-Bench: Benchmark for Interactive Learning from Language FeedbackCode1
mir_ref: A Representation Evaluation Framework for Music Information Retrieval TasksCode1
ESPN: Memory-Efficient Multi-Vector Information RetrievalCode1
A Two-Stage Adaptation of Large Language Models for Text RankingCode1
IterCQR: Iterative Conversational Query Reformulation with Retrieval GuidanceCode1
Scalable and Effective Generative Information RetrievalCode1
Neural Retrievers are Biased Towards LLM-Generated ContentCode1
Jina Embeddings 2: 8192-Token General-Purpose Text Embeddings for Long DocumentsCode1
Poisoning Retrieval Corpora by Injecting Adversarial PassagesCode1
Attention Lens: A Tool for Mechanistically Interpreting the Attention Head Information Retrieval MechanismCode1
A Comprehensive Python Library for Deep Learning-Based Event Detection in Multivariate Time Series Data and Information Retrieval in NLPCode1
Clinfo.ai: An Open-Source Retrieval-Augmented Large Language Model System for Answering Medical Questions using Scientific LiteratureCode1
BioImage.IO Chatbot: A Community-Driven AI Assistant for Integrative Computational BioimagingCode1
Open-source Large Language Models are Strong Zero-shot Query Likelihood Models for Document RankingCode1
A Comprehensive Evaluation of Large Language Models on Legal Judgment PredictionCode1
Leveraging Large Language Models for Node Generation in Few-Shot Learning on Text-Attributed GraphsCode1
Language Models As Semantic IndexersCode1
Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge GeneratorsCode1
LAiW: A Chinese Legal Large Language Models BenchmarkCode1
EMelodyGen: Emotion-Conditioned Melody Generation in ABC Notation with the Musical Feature TemplateCode1
Symbolic Music Representations for Classification Tasks: A Systematic EvaluationCode1
YAGO 4.5: A Large and Clean Knowledge Base with a Rich TaxonomyCode1
RaLLe: A Framework for Developing and Evaluating Retrieval-Augmented Large Language ModelsCode1
Taken by Surprise: Contrast effect for Similarity ScoresCode1
HAGRID: A Human-LLM Collaborative Dataset for Generative Information-Seeking with AttributionCode1
Med-HALT: Medical Domain Hallucination Test for Large Language ModelsCode1
Zero-note samba: Self-supervised beat trackingCode1
Show:102550
← PrevPage 5 of 95Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Two-tower Bi-Encoder (RoBERTa)Recall@10074.78Unverified
2Siamese Bi-Encoder (RoBERTa)Recall@10071.63Unverified
3BM25Recall@10051.33Unverified
#ModelMetricClaimedVerifiedStatus
1RetroMAE v2MRR@1042.58Unverified
2ConAE-256Time (ms)0.33Unverified
3ConAE-128Time (ms)0.32Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-BE-5.8BmAP@1000.16Unverified
2TSDAEmAP@1000.15Unverified
#ModelMetricClaimedVerifiedStatus
1hpipubcommoninfNDCG0.56Unverified
2hpictallinfNDCG0.55Unverified
#ModelMetricClaimedVerifiedStatus
1MINDHR@300.32Unverified
#ModelMetricClaimedVerifiedStatus
1Distilled NetworknDCG@100.53Unverified
#ModelMetricClaimedVerifiedStatus
1RetroMAEMRR@100.42Unverified
#ModelMetricClaimedVerifiedStatus
1SGPT-5.8B-msmarconDCG@1050.25Unverified
#ModelMetricClaimedVerifiedStatus
1Information Retrieval + SVM1:1 Accuracy83.79Unverified
#ModelMetricClaimedVerifiedStatus
1BERT+CONCEPT FILTERNDCG0.25Unverified