SOTAVerified

Text Matching

Matching a target text to a source text based on their meaning.

Papers

Showing 150 of 364 papers

TitleStatusHype
ColPali: Efficient Document Retrieval with Vision Language ModelsCode7
Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person RetrievalCode2
Aligning Information Capacity Between Vision and Language via Dense-to-Sparse Feature Distillation for Image-Text MatchingCode2
3D-VisTA: Pre-trained Transformer for 3D Vision and Text AlignmentCode2
Do You Remember? Dense Video Captioning with Cross-Modal Memory RetrievalCode2
MouSi: Poly-Visual-Expert Vision-Language ModelsCode2
LLaQo: Towards a Query-Based Coach in Expressive Music Performance AssessmentCode2
FiLo++: Zero-/Few-Shot Anomaly Detection by Fused Fine-Grained Descriptions and Deformable LocalizationCode2
A Systematic Survey of Prompt Engineering on Vision-Language Foundation ModelsCode2
Language Models Can See: Plugging Visual Controls in Text GenerationCode2
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text SpottingCode2
ActionCLIP: A New Paradigm for Video Action RecognitionCode1
KETM:A Knowledge-Enhanced Text Matching methodCode1
Knowledge Guided Text Retrieval and Reading for Open Domain Question AnsweringCode1
Image-text matching for large-scale book collectionsCode1
Identifying Machine-Paraphrased PlagiarismCode1
Improved Probabilistic Image-Text RepresentationsCode1
Lattice CNNs for Matching Based Chinese Question AnsweringCode1
Extractive Summarization as Text MatchingCode1
Fine-grained Video-Text Retrieval with Hierarchical Graph ReasoningCode1
Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language LearnersCode1
Fine-Grained Image-Text Matching by Cross-Modal Hard Aligning NetworkCode1
Composing Object Relations and Attributes for Image-Text MatchingCode1
HANet: Hierarchical Alignment Networks for Video-Text RetrievalCode1
A Dense Representation Framework for Lexical and Semantic MatchingCode1
Are Diffusion Models Vision-And-Language Reasoners?Code1
DF-GAN: A Simple and Effective Baseline for Text-to-Image SynthesisCode1
IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word EmphasisCode1
Keyword-Attentive Deep Semantic MatchingCode1
ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCOCode1
Graph Structured Network for Image-Text MatchingCode1
Deep Multimodal Neural Architecture SearchCode1
ColorSwap: A Color and Word Order Dataset for Multimodal EvaluationCode1
Declaration-based Prompt Tuning for Visual Question AnsweringCode1
ComCLIP: Training-Free Compositional Image and Text MatchingCode1
Advancing Visual Grounding with Scene Knowledge: Benchmark and MethodCode1
Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text MatchingCode1
BiCro: Noisy Correspondence Rectification for Multi-modality Data via Bi-directional Cross-modal Similarity ConsistencyCode1
BrainCLIP: Bridging Brain and Visual-Linguistic Representation Via CLIP for Generic Natural Visual Stimulus DecodingCode1
A Comparison of Supervised Learning to Match Methods for Product SearchCode1
Consensus-Aware Visual-Semantic Embedding for Image-Text MatchingCode1
AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial NetworksCode1
Adaptive Offline Quintuplet Loss for Image-Text MatchingCode1
Efficient Medical Vision-Language Alignment Through Adapting Masked Vision ModelsCode1
Cross-modal Active Complementary Learning with Self-refining CorrespondenceCode1
CLIP is Strong Enough to Fight Back: Test-time Counterattacks towards Zero-shot Adversarial Robustness of CLIPCode1
A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalCode1
CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object RepresentationCode1
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression SegmentationCode1
DenseCLIP: Language-Guided Dense Prediction with Context-Aware PromptingCode1
Show:102550
← PrevPage 1 of 8Next →

No leaderboard results yet.