SOTAVerified

Text Matching

Matching a target text to a source text based on their meaning.

Papers

Showing 2650 of 364 papers

TitleStatusHype
Are Diffusion Models Vision-And-Language Reasoners?Code1
Consensus-Aware Visual-Semantic Embedding for Image-Text MatchingCode1
Graph Structured Network for Image-Text MatchingCode1
Keyword-Attentive Deep Semantic MatchingCode1
Identifying Machine-Paraphrased PlagiarismCode1
Extractive Summarization as Text MatchingCode1
Composing Object Relations and Attributes for Image-Text MatchingCode1
Fine-Grained Image-Text Matching by Cross-Modal Hard Aligning NetworkCode1
Advancing Visual Grounding with Scene Knowledge: Benchmark and MethodCode1
ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCOCode1
ComCLIP: Training-Free Compositional Image and Text MatchingCode1
AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial NetworksCode1
BiCro: Noisy Correspondence Rectification for Multi-modality Data via Bi-directional Cross-modal Similarity ConsistencyCode1
BrainCLIP: Bridging Brain and Visual-Linguistic Representation Via CLIP for Generic Natural Visual Stimulus DecodingCode1
A Comparison of Supervised Learning to Match Methods for Product SearchCode1
ColorSwap: A Color and Word Order Dataset for Multimodal EvaluationCode1
Cross-modal Active Complementary Learning with Self-refining CorrespondenceCode1
Adaptive Offline Quintuplet Loss for Image-Text MatchingCode1
Efficient Medical Vision-Language Alignment Through Adapting Masked Vision ModelsCode1
Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text MatchingCode1
CLIP is Strong Enough to Fight Back: Test-time Counterattacks towards Zero-shot Adversarial Robustness of CLIPCode1
A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalCode1
CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object RepresentationCode1
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression SegmentationCode1
Fine-grained Video-Text Retrieval with Hierarchical Graph ReasoningCode1
Show:102550
← PrevPage 2 of 15Next →

No leaderboard results yet.