SOTAVerified

Text Matching

Matching a target text to a source text based on their meaning.

Papers

Showing 151175 of 364 papers

TitleStatusHype
Selectively Hard Negative Mining for Alleviating Gradient Vanishing in Image-Text Matching0
BrainCLIP: Bridging Brain and Visual-Linguistic Representation Via CLIP for Generic Natural Visual Stimulus DecodingCode1
Co-Driven Recognition of Semantic Consistency via the Fusion of Transformer and HowNet Sememes KnowledgeCode0
Unified Vision-Language Representation Modeling for E-Commerce Same-Style Products Retrieval0
Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval0
Improving Zero-Shot Action Recognition using Human Instruction with Text Description0
VL-Match: Enhancing Vision-Language Pretraining with Token-Level and Instance-Level Matching0
Weakly Supervised Referring Image Segmentation with Intra-Chunk and Inter-Chunk Consistency0
ShapeScaffolder: Structure-Aware 3D Shape Generation from Text0
Fine-Grained Image-Text Matching by Cross-Modal Hard Aligning NetworkCode1
Learning Semantic Relationship Among Instances for Image-Text MatchingCode1
RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension0
Multimodal Matching-aware Co-attention Networks with Mutual Knowledge Distillation for Fake News Detection0
Uniform Masking Prevails in Vision-Language Pretraining0
SimVTP: Simple Video Text Pre-training with Masked AutoencodersCode0
ComCLIP: Training-Free Compositional Image and Text MatchingCode1
Self-supervised vision-language pretraining for Medical visual question answeringCode1
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text SpottingCode2
Zero-Shot Text Matching for Automated Auditing using Sentence Transformers0
UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal Guidance0
Dissecting Deep Metric Learning Losses for Image-Text RetrievalCode0
Do Vision-and-Language Transformers Learn Grounded Predicate-Noun Dependencies?Code0
Law Article-Enhanced Legal Case Matching: a Causal Learning ApproachCode0
MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training ModelCode1
Using Interventions to Improve Out-of-Distribution Generalization of Text-Matching Recommendation Systems0
Show:102550
← PrevPage 7 of 15Next →

No leaderboard results yet.