SOTAVerified

Retrieval

A methodology that involves selecting relevant data or examples from a large dataset to support tasks like prediction, learning, or inference. It enhances models by providing context or additional information, often used in systems like retrieval-augmented generation or in-context learning.

Papers

Showing 10261050 of 14297 papers

TitleStatusHype
Dual Operating Modes of In-Context LearningCode1
Hierarchical Multimodal Pre-training for Visually Rich Webpage UnderstandingCode1
Corpus-Steered Query Expansion with Large Language ModelsCode1
StaPep: an open-source tool for the structure prediction and feature extraction of hydrocarbon-stapled peptidesCode1
RNNs are not Transformers (Yet): The Key Bottleneck on In-context RetrievalCode1
WIKIGENBENCH: Exploring Full-length Wikipedia Generation under Real-World ScenarioCode1
Grounding Language Models for Visual Entity RecognitionCode1
Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation SystemsCode1
REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question AnsweringCode1
Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a SurveyCode1
Multimodal Learned Sparse Retrieval with Probabilistic Expansion ControlCode1
Citation-Enhanced Generation for LLM-based ChatbotsCode1
RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health RecordsCode1
ListT5: Listwise Reranking with Fusion-in-Decoder Improves Zero-shot RetrievalCode1
Self-Retrieval: End-to-End Information Retrieval with One Large Language ModelCode1
ColBERT-XM: A Modular Multi-Vector Representation Model for Zero-Shot Multilingual Information RetrievalCode1
OpenTab: Advancing Large Language Models as Open-domain Table ReasonersCode1
INSTRUCTIR: A Benchmark for Instruction Following of Information Retrieval ModelsCode1
CLoVe: Encoding Compositional Language in Contrastive Vision-Language ModelsCode1
LLM Based Multi-Agent Generation of Semi-structured Documents from Semantic Templates in the Public Administration DomainCode1
Distinctive Image Captioning: Leveraging Ground Truth Captions in CLIP Guided Reinforcement LearningCode1
Analysing The Impact of Sequence Composition on Language Model Pre-TrainingCode1
Understanding and Mitigating the Threat of Vec2Text to Dense Retrieval SystemsCode1
Heterogeneous Graph Reasoning for Fact Checking over Texts and TablesCode1
Interpreting Conversational Dense Retrieval by Rewriting-Enhanced Inversion of Session EmbeddingCode1
Show:102550
← PrevPage 42 of 572Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second183.53Unverified
2ElasticsearchQueries per second21.8Unverified
3BM25-PTQueries per second6.49Unverified
4Rank-BM25Queries per second1.18Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second20.88Unverified
2ElasticsearchQueries per second7.11Unverified
3Rank-BM25Queries per second0.04Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second41.85Unverified
2ElasticsearchQueries per second12.16Unverified
3Rank-BM25Queries per second0.1Unverified
#ModelMetricClaimedVerifiedStatus
1FLMRRecall@589.32Unverified
2RA-VQARecall@582.84Unverified
#ModelMetricClaimedVerifiedStatus
1PreFLMRRecall@562.1Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP-KIStext-to-video Mean Rank30Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4OutfitRecall@57.59Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1COLTCOMP@84.55Unverified
#ModelMetricClaimedVerifiedStatus
1hello0L1,121,222Unverified