SOTAVerified

Text Matching

Matching a target text to a source text based on their meaning.

Papers

Showing 2650 of 364 papers

TitleStatusHype
Are Diffusion Models Vision-And-Language Reasoners?Code1
DF-GAN: A Simple and Effective Baseline for Text-to-Image SynthesisCode1
IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word EmphasisCode1
Keyword-Attentive Deep Semantic MatchingCode1
ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCOCode1
Graph Structured Network for Image-Text MatchingCode1
Deep Multimodal Neural Architecture SearchCode1
ColorSwap: A Color and Word Order Dataset for Multimodal EvaluationCode1
Declaration-based Prompt Tuning for Visual Question AnsweringCode1
ComCLIP: Training-Free Compositional Image and Text MatchingCode1
Advancing Visual Grounding with Scene Knowledge: Benchmark and MethodCode1
Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text MatchingCode1
BiCro: Noisy Correspondence Rectification for Multi-modality Data via Bi-directional Cross-modal Similarity ConsistencyCode1
BrainCLIP: Bridging Brain and Visual-Linguistic Representation Via CLIP for Generic Natural Visual Stimulus DecodingCode1
A Comparison of Supervised Learning to Match Methods for Product SearchCode1
Consensus-Aware Visual-Semantic Embedding for Image-Text MatchingCode1
AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial NetworksCode1
Adaptive Offline Quintuplet Loss for Image-Text MatchingCode1
Efficient Medical Vision-Language Alignment Through Adapting Masked Vision ModelsCode1
Cross-modal Active Complementary Learning with Self-refining CorrespondenceCode1
CLIP is Strong Enough to Fight Back: Test-time Counterattacks towards Zero-shot Adversarial Robustness of CLIPCode1
A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalCode1
CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object RepresentationCode1
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression SegmentationCode1
DenseCLIP: Language-Guided Dense Prediction with Context-Aware PromptingCode1
Show:102550
← PrevPage 2 of 15Next →

No leaderboard results yet.