SOTAVerified

Retrieval

A methodology that involves selecting relevant data or examples from a large dataset to support tasks like prediction, learning, or inference. It enhances models by providing context or additional information, often used in systems like retrieval-augmented generation or in-context learning.

Papers

Showing 15011550 of 14297 papers

TitleStatusHype
MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReIDCode1
Single-branch Network for Multimodal TrainingCode1
FastFill: Efficient Compatible Model UpdateCode1
Co-Attention Aligned Mutual Cross-Attention for Cloth-Changing Person Re-IdentificationCode1
xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and RetrievalCode1
FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion TasksCode1
WiCE: Real-World Entailment for Claims in WikipediaCode1
BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairsCode1
RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-trainingCode1
Spacerini: Plug-and-play Search Engines with Pyserini and Hugging FaceCode1
Pretraining De-Biased Language Model with Large-scale Click Logs for Document RankingCode1
ProofNet: Autoformalizing and Formally Proving Undergraduate-Level MathematicsCode1
Retrieved Sequence Augmentation for Protein Representation LearningCode1
Semantic-Fused Multi-Granularity Cross-City Traffic PredictionCode1
Teaching CLIP to Count to TenCode1
Simple and Scalable Nearest Neighbor Machine TranslationCode1
One-Shot Labeling for Automatic Relevance EstimationCode1
Cross-Modal Retrieval with Partially Mismatched PairsCode1
Patent Image Retrieval Using Cross-entropy-based Metric LearningCode1
Binary Embedding-based Retrieval at TencentCode1
Towards Unifying Medical Vision-and-Language Pre-training via Soft PromptsCode1
jazznet: A Dataset of Fundamental Piano Patterns for Music Audio Machine Learning ResearchCode1
Multimodal Federated Learning via Contrastive Representation EnsembleCode1
Retrieval-augmented Image CaptioningCode1
Unsupervised Hashing with Similarity Distribution CalibrationCode1
UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal ModelingCode1
Sketch Less Face Image Retrieval: A New ChallengeCode1
Bringing the State-of-the-Art to Customers: A Neural Agent Assistant Framework for Customer Service SupportCode1
Pic2Word: Mapping Pictures to Words for Zero-shot Composed Image RetrievalCode1
LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text RetrievalCode1
Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguous InputsCode1
ReDi: Efficient Learning-Free Diffusion Inference via Trajectory RetrievalCode1
Simple, Effective and General: A New Backbone for Cross-view Image Geo-localizationCode1
Multimodality Representation Learning: A Survey on Evolution, Pretraining and Its ApplicationsCode1
Learning, Fast and Slow: A Goal-Directed Memory-Based Approach for Dynamic EnvironmentsCode1
What Makes Good Examples for Visual In-Context Learning?Code1
Finding the Law: Enhancing Statutory Article Retrieval via Graph Neural NetworksCode1
ProtST: Multi-Modality Learning of Protein Sequences and Biomedical TextsCode1
Efficiently predicting high resolution mass spectra with graph neural networksCode1
Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge TransferringCode1
GPU-based Private Information Retrieval for On-Device Machine Learning InferenceCode1
ExaRanker: Explanation-Augmented Neural RankerCode1
Lexi: Self-Supervised Learning of the UI LanguageCode1
MV-Adapter: Multimodal Video Transfer Learning for Video Text RetrievalCode1
Learning Customized Visual Models with Retrieval-Augmented KnowledgeCode1
Modeling Uncertain Feature Representation for Domain GeneralizationCode1
EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health RecordsCode1
UATVR: Uncertainty-Adaptive Text-Video RetrievalCode1
Do the Findings of Document and Passage Retrieval Generalize to the Retrieval of Responses for Dialogues?Code1
Multimodal Inverse Cloze Task for Knowledge-based Visual Question AnsweringCode1
Show:102550
← PrevPage 31 of 286Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second183.53Unverified
2ElasticsearchQueries per second21.8Unverified
3BM25-PTQueries per second6.49Unverified
4Rank-BM25Queries per second1.18Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second20.88Unverified
2ElasticsearchQueries per second7.11Unverified
3Rank-BM25Queries per second0.04Unverified
#ModelMetricClaimedVerifiedStatus
1BM25SQueries per second41.85Unverified
2ElasticsearchQueries per second12.16Unverified
3Rank-BM25Queries per second0.1Unverified
#ModelMetricClaimedVerifiedStatus
1FLMRRecall@589.32Unverified
2RA-VQARecall@582.84Unverified
#ModelMetricClaimedVerifiedStatus
1PreFLMRRecall@562.1Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP-KIStext-to-video Mean Rank30Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4OutfitRecall@57.59Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1MetaGen Blended RAGAccuracy (Top-1)82.1Unverified
#ModelMetricClaimedVerifiedStatus
1COLTCOMP@84.55Unverified
#ModelMetricClaimedVerifiedStatus
1hello0L1,121,222Unverified