SOTAVerified

Retrieval

A methodology that involves selecting relevant data or examples from a large dataset to support tasks like prediction, learning, or inference. It enhances models by providing context or additional information, often used in systems like retrieval-augmented generation or in-context learning.

Papers

Showing 51100 of 14297 papers

TitleStatusHype
Don't Do RAG: When Cache-Augmented Generation is All You Need for Knowledge TasksCode5
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive RetrievalCode5
Make Your LLM Fully Utilize the ContextCode5
MiniRAG: Towards Extremely Simple Retrieval-Augmented GenerationCode5
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and InferenceCode5
TrustRAG: An Information Assistant with Retrieval Augmented GenerationCode5
ImageBind: One Embedding Space To Bind Them AllCode5
ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain KnowledgeCode4
Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attentionCode4
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-ReflectionCode4
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language ModelingCode4
s3: You Don't Need That Much Data to Train a Search Agent via RLCode4
LLM2CLIP: Powerful Language Model Unlocks Richer Visual RepresentationCode4
Chain-of-Discussion: A Multi-Model Framework for Complex Evidence-Based Question AnsweringCode4
RETSim: Resilient and Efficient Text SimilarityCode4
SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory SynthesisCode4
Bryndza at ClimateActivism 2024: Stance, Target and Hate Event Detection via Retrieval-Augmented GPT-4 and LLaMACode4
Retrieval-Augmented Generation for Knowledge-Intensive NLP TasksCode4
Retrieval-Augmented Generation for Large Language Models: A SurveyCode4
Improving Retrieval-Augmented Generation in Medicine with Iterative Follow-up QuestionsCode4
Retrieval-Augmented Generation with Hierarchical KnowledgeCode4
Resources for Brewing BEIR: Reproducible Reference Models and an Official LeaderboardCode4
AlignScore: Evaluating Factual Consistency with a Unified Alignment FunctionCode4
Long-CLIP: Unlocking the Long-Text Capability of CLIPCode4
Retrieval-Generation Synergy Augmented Large Language ModelsCode4
SLIM: Sparsified Late Interaction for Multi-Vector Retrieval with Inverted IndexesCode4
Beyond Outlining: Heterogeneous Recursive Planning for Adaptive Long-form Writing with Language ModelsCode4
G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question AnsweringCode4
Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented GenerationCode4
Benchmarking Retrieval-Augmented Generation for MedicineCode4
Goldfish: Vision-Language Understanding of Arbitrarily Long VideosCode4
Symbolic Prompt Program Search: A Structure-Aware Approach to Efficient Compile-Time Prompt OptimizationCode4
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement LearningCode4
From Web Search towards Agentic Deep Research: Incentivizing Search with Reasoning AgentsCode4
Gated Delta Networks: Improving Mamba2 with Delta RuleCode4
PLAID: An Efficient Engine for Late Interaction RetrievalCode4
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality TeachersCode4
Halu-J: Critique-Based Hallucination JudgeCode4
Generative Representational Instruction TuningCode4
OnPrem.LLM: A Privacy-Conscious Document Intelligence ToolkitCode4
Parameter-Efficient Prompt Tuning Makes Generalized and Calibrated Neural Text RetrieversCode4
Prompt2Model: Generating Deployable Models from Natural Language InstructionsCode4
ReARTeR: Retrieval-Augmented Reasoning with Trustworthy Process RewardingCode4
Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image RetrievalCode4
Multimodal Whole Slide Foundation Model for PathologyCode4
A Survey of LLM DATACode4
DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming HeadsCode4
EasyRAG: Efficient Retrieval-Augmented Generation Framework for Automated Network OperationsCode4
MOS: Model Surgery for Pre-Trained Model-Based Class-Incremental LearningCode4
DeepRetrieval: Hacking Real Search Engines and Retrievers with Large Language Models via Reinforcement LearningCode4
Show:102550
← PrevPage 2 of 286Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second183.53Unverified
2ElasticsearchQueries per second21.8Unverified
3BM25-PTQueries per second6.49Unverified
4Rank-BM25Queries per second1.18Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second20.88Unverified
2ElasticsearchQueries per second7.11Unverified
3Rank-BM25Queries per second0.04Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second41.85Unverified
2ElasticsearchQueries per second12.16Unverified
3Rank-BM25Queries per second0.1Unverified
#ModelMetricClaimedVerifiedStatus
1FLMRRecall@589.32Unverified
2RA-VQARecall@582.84Unverified
#ModelMetricClaimedVerifiedStatus
1PreFLMRRecall@562.1Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP-KIStext-to-video Mean Rank30Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4OutfitRecall@57.59Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1COLTCOMP@84.55Unverified
#ModelMetricClaimedVerifiedStatus
1hello0L1,121,222Unverified