SOTAVerified

Text Matching

Matching a target text to a source text based on their meaning.

Papers

Showing 101150 of 364 papers

TitleStatusHype
Learning From Noisy Correspondence With Tri-Partition for Cross-Modal Matching0
Dynamic Visual Semantic Sub-Embeddings and Fast Re-Ranking0
Improving Multimodal Classification of Social Media Posts by Leveraging Image-Text Auxiliary TasksCode0
Towards Better Multi-modal Keyphrase Generation via Visual Entity Enhancement and Multi-granularity Image Noise FilteringCode0
GLS-CSC: A Simple but Effective Strategy to Mitigate Chinese STM Models' Over-Reliance on Superficial Clue0
Prompt-based Effective Input Reformulation for Legal Case RetrievalCode0
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression SegmentationCode1
ViLTA: Enhancing Vision-Language Pre-training through Textual Augmentation0
Text Matching Improves Sequential Recommendation by Reducing Popularity BiasesCode1
Uniformly Distributed Category Prototype-Guided Vision-Language Framework for Long-Tail Recognition0
EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE0
Towards Grounded Visual Spatial Reasoning in Multi-Modal Vision Language Models0
KETM:A Knowledge-Enhanced Text Matching methodCode1
Improving Zero-Shot Text Matching for Financial Auditing with Large Language Models0
InfeRE: Step-by-Step Regex Generation via Chain of InferenceCode0
Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative EliminationCode1
3D-VisTA: Pre-trained Transformer for 3D Vision and Text AlignmentCode2
Grounded Image Text Matching with Mismatched Relation Reasoning0
A Systematic Survey of Prompt Engineering on Vision-Language Foundation ModelsCode2
Advancing Visual Grounding with Scene Knowledge: Benchmark and MethodCode1
UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language UnderstandingCode1
Improving Text Matching in E-Commerce Search with A Rationalizable, Intervenable and Fast Entity-Based Relevance Model0
Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search BenchmarkCode1
Revisiting the Role of Language Priors in Vision-Language ModelsCode1
Improved Probabilistic Image-Text RepresentationsCode1
Are Diffusion Models Vision-And-Language Reasoners?Code1
UniTRec: A Unified Text-to-Text Transformer and Joint Contrastive Learning Framework for Text-based RecommendationCode1
PESCO: Prompt-enhanced Self Contrastive Learning for Zero-shot Text Classification0
Fusion-in-T5: Unifying Document Ranking Signals for Improved Information RetrievalCode0
LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis EvaluationCode1
MALM: Mask Augmentation based Local Matching for Food-Recipe RetrievalCode0
Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language LearnersCode1
Probing the Role of Positional Information in Vision-Language Models0
Scene Text Recognition with Image-Text Matching-guided Dictionary0
Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured RepresentationsCode1
Vision Meets Definitions: Unsupervised Visual Word Sense Disambiguation Incorporating Gloss InformationCode0
RoCOCO: Robustness Benchmark of MS-COCO to Stress-test Image-Text Matching ModelsCode0
Integrity and Junkiness Failure Handling for Embedding-based Retrieval: A Case Study in Social Network Search0
Verbs in Action: Improving verb understanding in video-language modelsCode0
The Short Text Matching Model Enhanced with Knowledge via Contrastive Learning0
Multi-Modal Representation Learning with Text-Driven Soft Masks0
Probabilistic Prompt Learning for Dense Prediction0
A Measurement-Based Quantum-Like Language Model for Text Matching0
Multimodal Image-Text Matching Improves Retrieval-based Chest X-Ray Report GenerationCode1
Integrating Language Guidance Into Image-Text Matching for Correcting False NegativesCode0
Plug-and-Play Regulators for Image-Text MatchingCode1
Increasing Textual Context Size Boosts Medical Image-Text MatchingCode0
BiCro: Noisy Correspondence Rectification for Multi-modality Data via Bi-directional Cross-modal Similarity ConsistencyCode1
Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person RetrievalCode2
Refined Vision-Language Modeling for Fine-grained Multi-modal Pre-training0
Show:102550
← PrevPage 3 of 8Next →

No leaderboard results yet.