SOTAVerified

Retrieval

A methodology that involves selecting relevant data or examples from a large dataset to support tasks like prediction, learning, or inference. It enhances models by providing context or additional information, often used in systems like retrieval-augmented generation or in-context learning.

Papers

Showing 476500 of 14297 papers

TitleStatusHype
Lost in the Middle: How Language Models Use Long ContextsCode2
MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information RetrievalCode2
LeanDojo: Theorem Proving with Retrieval-Augmented Language ModelsCode2
RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote SensingCode2
RemoteCLIP: A Vision Language Foundation Model for Remote SensingCode2
RETA-LLM: A Retrieval-Augmented Large Language Model ToolkitCode2
MolFM: A Multimodal Molecular Foundation ModelCode2
Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion PriorsCode2
Generating Images with Multimodal Language ModelsCode2
Enabling Large Language Models to Generate Text with CitationsCode2
Lawyer LLaMA Technical ReportCode2
Adapting Language Models to Compress ContextsCode2
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text GenerationCode2
ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language ModelsCode2
Knowledge-Design: Pushing the Limit of Protein Design via Knowledge RefinementCode2
WebCPM: Interactive Web Search for Chinese Long-form Question AnsweringCode2
Active Retrieval Augmented GenerationCode2
Autonomous GIS: the next-generation AI-powered GISCode2
Recommender Systems with Generative RetrievalCode2
RetroMAE-2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language ModelsCode2
TMR: Text-to-Motion Retrieval Using Contrastive 3D Human Motion SynthesisCode2
Huatuo-26M, a Large-scale Chinese Medical QA DatasetCode2
Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive ReviewCode2
GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical InformationCode2
Scaling Transformer to 1M tokens and beyond with RMTCode2
Show:102550
← PrevPage 20 of 572Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second183.53Unverified
2ElasticsearchQueries per second21.8Unverified
3BM25-PTQueries per second6.49Unverified
4Rank-BM25Queries per second1.18Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second20.88Unverified
2ElasticsearchQueries per second7.11Unverified
3Rank-BM25Queries per second0.04Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second41.85Unverified
2ElasticsearchQueries per second12.16Unverified
3Rank-BM25Queries per second0.1Unverified
#ModelMetricClaimedVerifiedStatus
1FLMRRecall@589.32Unverified
2RA-VQARecall@582.84Unverified
#ModelMetricClaimedVerifiedStatus
1PreFLMRRecall@562.1Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP-KIStext-to-video Mean Rank30Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4OutfitRecall@57.59Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1COLTCOMP@84.55Unverified
#ModelMetricClaimedVerifiedStatus
1hello0L1,121,222Unverified