SOTAVerified

Retrieval

A methodology that involves selecting relevant data or examples from a large dataset to support tasks like prediction, learning, or inference. It enhances models by providing context or additional information, often used in systems like retrieval-augmented generation or in-context learning.

Papers

Showing 62016250 of 14297 papers

TitleStatusHype
Progressive Spatio-Temporal Prototype Matching for Text-Video RetrievalCode1
Democratising 2D Sketch to 3D Shape Retrieval Through Pivoting0
Alignment Before Aggregation: Trajectory Memory Retrieval Network for Video Object Segmentation0
Misalign, Contrast then Distill: Rethinking Misalignments in Language-Image Pre-training0
Prototypical Mixing and Retrieval-Based Refinement for Label Noise-Resistant Image RetrievalCode0
Dual Learning with Dynamic Knowledge Distillation for Partially Relevant Video RetrievalCode1
Disentangled Representation Learning for Unsupervised Neural Quantization0
Deep Semi-Supervised Metric Learning With Mixed Label Propagation0
Teacher-Generated Spatial-Attention Labels Boost Robustness and Accuracy of Contrastive Models0
Multilateral Semantic Relations Modeling for Image Text Retrieval0
RA-CLIP: Retrieval Augmented Contrastive Language-Image Pre-Training0
Fine-Grained Image-Text Matching by Cross-Modal Hard Aligning NetworkCode1
SVGformer: Representation Learning for Continuous Vector Graphics Using Transformers0
Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-Based Active LearningCode1
Dual Alignment Unsupervised Domain Adaptation for Video-Text Retrieval0
DLBD: A Self-Supervised Direct-Learned Binary DescriptorCode0
Open-Set Fine-Grained Retrieval via Prompting Vision-Language Evaluator0
ViLEM: Visual-Language Error Modeling for Image-Text Retrieval0
Towards Modality-Agnostic Person Re-Identification With Descriptive QueryCode1
Deep Hashing With Minimal-Distance-Separated Hash Centers0
RONO: Robust Discriminative Learning With Noisy Labels for 2D-3D Cross-Modal RetrievalCode1
Learnable Skeleton-Aware 3D Point Cloud Sampling0
Modeling Video As Stochastic Processes for Fine-Grained Video Representation LearningCode1
Learning Attribute and Class-Specific Representation Duet for Fine-Grained Fashion Analysis0
M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout AnalysisCode1
CLIPPING: Distilling CLIP-Based Models With a Student Base for Video-Language Retrieval0
Revisiting Self-Similarity: Structural Embedding for Image RetrievalCode1
R2Former: Unified Retrieval and Reranking Transformer for Place RecognitionCode1
Learning Semantic Relationship Among Instances for Image-Text MatchingCode1
Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks0
GAFNet: A Global Fourier Self Attention Based Novel Network for multi-modal downstream tasks0
Rethinking with Retrieval: Faithful Large Language Model InferenceCode1
Rethinking Rotation Invariance with Point Cloud Registration0
Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?Code2
TA-DA: Topic-Aware Domain Adaptation for Scientific Keyphrase Identification and Classification (Student Abstract)0
HPointLoc: Point-based Indoor Place Recognition using Synthetic RGB-D ImagesCode1
BagFormer: Better Cross-Modal Retrieval via bag-wise interaction0
Result Diversification in Search and Recommendation: A SurveyCode0
Maximizing Use-Case Specificity through Precision Model Tuning0
Customizing Knowledge Graph Embedding to Improve Clinical Study Recommendation0
Demonstrate-Search-Predict: Composing retrieval and language models for knowledge-intensive NLPCode7
TempCLR: Temporal Alignment Representation with Contrastive LearningCode1
Attribute-Guided Multi-Level Attention Network for Fine-Grained Fashion RetrievalCode0
MVTN: Learning Multi-View Transformations for 3D UnderstandingCode1
Noise-aware Learning from Web-crawled Image-Text Data for Image CaptioningCode1
Efficiently Enabling Block Semantics and Data Updates in DNA Storage0
Cache-Aided Multi-User Private Information Retrieval using PDAs0
On Cache-Aided Multi-User Private Information Retrieval with Small Caches0
Modeling Time-Series and Spatial Data for Recommendations and Other Applications0
Development of a Thermodynamics of Human Cognition and Human Culture0
Show:102550
← PrevPage 125 of 286Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second183.53Unverified
2ElasticsearchQueries per second21.8Unverified
3BM25-PTQueries per second6.49Unverified
4Rank-BM25Queries per second1.18Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second20.88Unverified
2ElasticsearchQueries per second7.11Unverified
3Rank-BM25Queries per second0.04Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second41.85Unverified
2ElasticsearchQueries per second12.16Unverified
3Rank-BM25Queries per second0.1Unverified
#ModelMetricClaimedVerifiedStatus
1FLMRRecall@589.32Unverified
2RA-VQARecall@582.84Unverified
#ModelMetricClaimedVerifiedStatus
1PreFLMRRecall@562.1Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP-KIStext-to-video Mean Rank30Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4OutfitRecall@57.59Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1COLTCOMP@84.55Unverified
#ModelMetricClaimedVerifiedStatus
1hello0L1,121,222Unverified