SOTAVerified

Retrieval

A methodology that involves selecting relevant data or examples from a large dataset to support tasks like prediction, learning, or inference. It enhances models by providing context or additional information, often used in systems like retrieval-augmented generation or in-context learning.

Papers

Showing 30513100 of 14297 papers

TitleStatusHype
Needle In A Multimodal HaystackCode2
Large Language Models are Limited in Out-of-Context Knowledge ReasoningCode0
RS-Agent: Automating Remote Sensing Tasks through Intelligent AgentCode2
Beyond Bare Queries: Open-Vocabulary Object Grounding with 3D Scene Graph0
Benchmarking Vision-Language Contrastive Methods for Medical Representation LearningCode0
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language ModelingCode4
MINERS: Multilingual Language Models as Semantic RetrieversCode1
Post-Hoc Answer Attribution for Grounded and Trustworthy Long Document Comprehension: Task, Insights, and Challenges0
TelecomRAG: Taming Telecom Standards with Retrieval Augmented Generation and LLMs0
Which Country Is This? Automatic Country Ranking of Street View PhotosCode0
Scholarly Question Answering using Large Language Models in the NFDI4DataScience GatewayCode0
Text Information Retrieval in Tetun: A Preliminary Study0
Fetch-A-Set: A Large-Scale OCR-Free Benchmark for Historical Document Retrieval0
Progressive Query Expansion for Retrieval Over Cost-constrained Data Sources0
The Impact of Quantization on Retrieval-Augmented Generation: An Analysis of Small LLMs0
AutoSurvey: Large Language Models Can Automatically Write SurveysCode3
UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance AssessorCode2
Scaling the Vocabulary of Non-autoregressive Models for Efficient Generative Retrieval0
Evaluating the Retrieval Component in LLM-Based Question Answering Systems0
Efficient k-Nearest-Neighbor Machine Translation with Dynamic RetrievalCode0
Weighted KL-Divergence for Document Ranking Model Refinement0
Recurrent Context Compression: Efficiently Expanding the Context Window of LLMCode2
2DP-2MRC: 2-Dimensional Pointer-based Machine Reading Comprehension Method for Multimodal Moment Retrieval0
Enhancing Long-Term Memory using Hierarchical Aggregate Tree for Retrieval Augmented Generation0
GrowOVER: How Can LLMs Adapt to Growing Real-World Knowledge?Code0
MrRank: Improving Question Answering Retrieval System through Multi-Result Ranking Model0
DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented GenerationCode1
Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval0
Hello Again! LLM-powered Personalized Agent for Long-term DialogueCode2
TIGeR: Unifying Text-to-Image Generation and Retrieval with Large Multimodal Models0
Async Learned User Embeddings for Ads Delivery Optimization0
Machine Against the RAG: Jamming Retrieval-Augmented Generation with Blocker Documents0
RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented GenerationCode0
Retrieval & Fine-Tuning for In-Context Tabular Models0
Corpus Poisoning via Approximate Greedy Gradient DescentCode0
The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and More0
The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs BetterCode0
Diving Deep into the Motion Representation of Video-Text ModelsCode0
Multi-Head RAG: Solving Multi-Aspect Problems with LLMsCode3
PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance PredictionCode0
MATTER: Memory-Augmented Transformer Using Heterogeneous Knowledge Sources0
User Intent Recognition and Semantic Cache Optimization-Based Query Processing Framework using CFLIS and MGR-LAU0
DiffuSyn Bench: Evaluating Vision-Language Models on Real-World Complexities with Diffusion-Generated Synthetic Benchmarks0
Synthesizing Conversations from Unlabeled Documents using Automatic Response Segmentation0
On The Importance of Reasoning for Context Retrieval in Repository-Level Code EditingCode1
Repurposing Language Models into Embedding Models: Finding the Compute-Optimal RecipeCode0
A + B: A General Generator-Reader Framework for Optimizing LLMs to Unleash Synergy Potential0
Reducing the climate impact of data portals: a case study0
Measuring and Addressing Indexical Bias in Information RetrievalCode0
VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval0
Show:102550
← PrevPage 62 of 286Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second183.53Unverified
2ElasticsearchQueries per second21.8Unverified
3BM25-PTQueries per second6.49Unverified
4Rank-BM25Queries per second1.18Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second20.88Unverified
2ElasticsearchQueries per second7.11Unverified
3Rank-BM25Queries per second0.04Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second41.85Unverified
2ElasticsearchQueries per second12.16Unverified
3Rank-BM25Queries per second0.1Unverified
#ModelMetricClaimedVerifiedStatus
1FLMRRecall@589.32Unverified
2RA-VQARecall@582.84Unverified
#ModelMetricClaimedVerifiedStatus
1PreFLMRRecall@562.1Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP-KIStext-to-video Mean Rank30Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4OutfitRecall@57.59Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1COLTCOMP@84.55Unverified
#ModelMetricClaimedVerifiedStatus
1hello0L1,121,222Unverified