SOTAVerified

Text Matching

Matching a target text to a source text based on their meaning.

Papers

Showing 76100 of 364 papers

TitleStatusHype
Learning Semantic Relationship Among Instances for Image-Text MatchingCode1
Declaration-based Prompt Tuning for Visual Question AnsweringCode1
BiCro: Noisy Correspondence Rectification for Multi-modality Data via Bi-directional Cross-modal Similarity ConsistencyCode1
BrainCLIP: Bridging Brain and Visual-Linguistic Representation Via CLIP for Generic Natural Visual Stimulus DecodingCode1
Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text MatchingCode1
A Comparison of Supervised Learning to Match Methods for Product SearchCode1
Efficient Medical Vision-Language Alignment Through Adapting Masked Vision ModelsCode1
Improved Probabilistic Image-Text RepresentationsCode1
MedICaT: A Dataset of Medical Images, Captions, and Textual ReferencesCode1
Machine Reading Comprehension: The Role of Contextualized Language Models and BeyondCode1
More Grounded Image Captioning by Distilling Image-Text Matching ModelCode1
RadCLIP: Enhancing Radiologic Image Analysis through Contrastive Language-Image Pre-trainingCode1
Adaptive Offline Quintuplet Loss for Image-Text MatchingCode1
DenseCLIP: Language-Guided Dense Prediction with Context-Aware PromptingCode1
Narrative Action Evaluation with Prompt-Guided Multimodal InteractionCode1
No Token Left Behind: Explainability-Aided Image Classification and GenerationCode1
DF-GAN: A Simple and Effective Baseline for Text-to-Image SynthesisCode1
CLIP is Strong Enough to Fight Back: Test-time Counterattacks towards Zero-shot Adversarial Robustness of CLIPCode1
Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge TransferCode1
A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalCode1
Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language LearnersCode1
CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object RepresentationCode1
Revisiting Deep Audio-Text Retrieval Through the Lens of TransportationCode1
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression SegmentationCode1
Content-Based Image Retrieval for Multi-Class Volumetric Radiology Images: A Benchmark Study0
Show:102550
← PrevPage 4 of 15Next →

No leaderboard results yet.