SOTAVerified

Retrieval

A methodology that involves selecting relevant data or examples from a large dataset to support tasks like prediction, learning, or inference. It enhances models by providing context or additional information, often used in systems like retrieval-augmented generation or in-context learning.

Papers

Showing 63266350 of 14297 papers

TitleStatusHype
CREPE: Can Vision-Language Foundation Models Reason Compositionally?Code1
Domain Adaptation for Dense Retrieval through Self-Supervision by Pseudo-Relevance Labeling0
Auto-labelling of Bug Report using Natural Language Processing0
LidarCLIP or: How I Learned to Talk to Point CloudsCode1
Predicting Knowledge Gain for MOOC Video ConsumptionCode0
Contextual Explainable Video Representation: Human Perception-based UnderstandingCode0
Scale-Semantic Joint Decoupling Network for Image-text Retrieval in Remote Sensing0
In Defense of Cross-Encoders for Zero-Shot RetrievalCode1
Changes in Power and Information Flow in Resting-state EEG by Working Memory Process0
The diagnostic utility of endocytoscopy for the detection of esophageal lesions: a systematic review and meta-analysis0
SEPT: Towards Scalable and Efficient Visual Pre-Training0
Using Multiple Instance Learning to Build Multimodal Representations0
Information retrieval in single cell chromatin analysis using TF-IDF transformation methods0
LEAD: Liberal Feature-based Distillation for Dense Retrieval0
Natural Logic-guided Autoregressive Multi-hop Document Retrieval for Fact Verification0
REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge MemoryCode0
A Comparison of Audio Preprocessing Techniques and Deep Learning Algorithms for Raga Recognition0
VindLU: A Recipe for Effective Video-and-Language PretrainingCode1
VideoCoCa: Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners0
DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal Dialogue DatasetCode1
Vision and Structured-Language Pretraining for Cross-Modal Food RetrievalCode1
Group Generalized Mean Pooling for Vision Transformer0
Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models0
FineDance: A Fine-grained Choreography Dataset for 3D Full Body Dance GenerationCode1
Text Embeddings by Weakly-Supervised Contrastive Pre-training0
Show:102550
← PrevPage 254 of 572Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second183.53Unverified
2ElasticsearchQueries per second21.8Unverified
3BM25-PTQueries per second6.49Unverified
4Rank-BM25Queries per second1.18Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second20.88Unverified
2ElasticsearchQueries per second7.11Unverified
3Rank-BM25Queries per second0.04Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second41.85Unverified
2ElasticsearchQueries per second12.16Unverified
3Rank-BM25Queries per second0.1Unverified
#ModelMetricClaimedVerifiedStatus
1FLMRRecall@589.32Unverified
2RA-VQARecall@582.84Unverified
#ModelMetricClaimedVerifiedStatus
1PreFLMRRecall@562.1Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP-KIStext-to-video Mean Rank30Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4OutfitRecall@57.59Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1COLTCOMP@84.55Unverified
#ModelMetricClaimedVerifiedStatus
1hello0L1,121,222Unverified