SOTAVerified

Retrieval

A methodology that involves selecting relevant data or examples from a large dataset to support tasks like prediction, learning, or inference. It enhances models by providing context or additional information, often used in systems like retrieval-augmented generation or in-context learning.

Papers

Showing 451500 of 14297 papers

TitleStatusHype
Position Interpolation Improves ALiBi ExtrapolationCode2
ChatKBQA: A Generate-then-Retrieve Framework for Knowledge Base Question Answering with Fine-tuned Large Language ModelsCode2
Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-SpecificityCode2
VeCLIP: Improving CLIP Training via Visual-enriched CaptionsCode2
Uni3D: Exploring Unified 3D Representation at ScaleCode2
Fine-grained Late-interaction Multi-modal Retrieval for Retrieval Augmented Visual Question AnsweringCode2
RankVicuna: Zero-Shot Listwise Document Reranking with Open-Source Large Language ModelsCode2
DISC-LawLLM: Fine-tuning Large Language Models for Intelligent Legal ServicesCode2
Optimization of Rank Losses for Image RetrievalCode2
Kani: A Lightweight and Highly Hackable Framework for Building Language Model ApplicationsCode2
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction TuningCode2
NLLB-CLIP -- train performant multilingual image retrieval model on a budgetCode2
Benchmarking Large Language Models in Retrieval-Augmented GenerationCode2
Knowledge Graph Prompting for Multi-Document Question AnsweringCode2
MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language ModelsCode2
Global Features are All You Need for Image Retrieval and RerankingCode2
Large Language Models for Information Retrieval: A SurveyCode2
SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative AI ToolCode2
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open WorldCode2
UniVTG: Towards Unified Video-Language Temporal GroundingCode2
All-In-One Metrical And Functional Structure Analysis With Neighborhood Attentions on Demixed AudioCode2
TabR: Tabular Deep Learning Meets Nearest Neighbors in 2023Code2
Animate-A-Story: Storytelling with Retrieval-Augmented Video GenerationCode2
Generating Benchmarks for Factuality Evaluation of Language ModelsCode2
InPars Toolkit: A Unified and Reproducible Synthetic Data Generation Pipeline for Neural Information RetrievalCode2
Lost in the Middle: How Language Models Use Long ContextsCode2
MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information RetrievalCode2
LeanDojo: Theorem Proving with Retrieval-Augmented Language ModelsCode2
RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote SensingCode2
RemoteCLIP: A Vision Language Foundation Model for Remote SensingCode2
RETA-LLM: A Retrieval-Augmented Large Language Model ToolkitCode2
MolFM: A Multimodal Molecular Foundation ModelCode2
Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion PriorsCode2
Generating Images with Multimodal Language ModelsCode2
Enabling Large Language Models to Generate Text with CitationsCode2
Lawyer LLaMA Technical ReportCode2
Adapting Language Models to Compress ContextsCode2
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text GenerationCode2
ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language ModelsCode2
Knowledge-Design: Pushing the Limit of Protein Design via Knowledge RefinementCode2
WebCPM: Interactive Web Search for Chinese Long-form Question AnsweringCode2
Active Retrieval Augmented GenerationCode2
Autonomous GIS: the next-generation AI-powered GISCode2
Recommender Systems with Generative RetrievalCode2
RetroMAE-2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language ModelsCode2
TMR: Text-to-Motion Retrieval Using Contrastive 3D Human Motion SynthesisCode2
Huatuo-26M, a Large-scale Chinese Medical QA DatasetCode2
Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive ReviewCode2
GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical InformationCode2
Scaling Transformer to 1M tokens and beyond with RMTCode2
Show:102550
← PrevPage 10 of 286Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second183.53Unverified
2ElasticsearchQueries per second21.8Unverified
3BM25-PTQueries per second6.49Unverified
4Rank-BM25Queries per second1.18Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second20.88Unverified
2ElasticsearchQueries per second7.11Unverified
3Rank-BM25Queries per second0.04Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second41.85Unverified
2ElasticsearchQueries per second12.16Unverified
3Rank-BM25Queries per second0.1Unverified
#ModelMetricClaimedVerifiedStatus
1FLMRRecall@589.32Unverified
2RA-VQARecall@582.84Unverified
#ModelMetricClaimedVerifiedStatus
1PreFLMRRecall@562.1Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP-KIStext-to-video Mean Rank30Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4OutfitRecall@57.59Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1COLTCOMP@84.55Unverified
#ModelMetricClaimedVerifiedStatus
1hello0L1,121,222Unverified