Compositional Image-Text Matching and Retrieval by Grounding Entities May 4, 2025 Image Captioning Image-text matching
Code Code Available 05 Dissecting Deep Metric Learning Losses for Image-Text Retrieval Oct 21, 2022 Cross-Modal Retrieval Image-text matching
Code Code Available 05 Vision Meets Definitions: Unsupervised Visual Word Sense Disambiguation Incorporating Gloss Information May 2, 2023 Bayesian Inference Image-text matching
Code Code Available 05 Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation Dec 10, 2021 Image-text matching Image-text Retrieval
— Unverified 00 MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations Mar 2, 2025 image-classification Image Classification
— Unverified 00 A Concept-Centric Approach to Multi-Modality Learning Dec 18, 2024 Image-text matching Question Answering
— Unverified 00 Active Mining Sample Pair Semantics for Image-text Matching Nov 9, 2023 Active Learning Image-text matching
— Unverified 00 AdsCVLR: Commercial Visual-Linguistic Representation Modeling in Sponsored Search Oct 10, 2022 Contrastive Learning Image-text matching
— Unverified 00 Advanced Multimodal Deep Learning Architecture for Image-Text Matching Jun 13, 2024 Deep Learning Image-text matching
— Unverified 00 A Novel Attention-based Aggregation Function to Combine Vision and Language Apr 27, 2020 General Classification Image Captioning
— Unverified 00 A Self-Boosting Framework for Automated Radiographic Report Generation Jun 19, 2021 Image Captioning Image-text matching
— Unverified 00 Automatic Prompt Generation and Grounding Object Detection for Zero-Shot Image Anomaly Detection Nov 28, 2024 Anomaly Detection Image-text matching
— Unverified 00 Breaking Through the Noisy Correspondence: A Robust Model for Image-Text Matching Apr 29, 2024 Cross-modal retrieval with noisy correspondence Image-text matching
— Unverified 00 Bridging the Modality Gap: Dimension Information Alignment and Sparse Spatial Constraint for Image-Text Matching Oct 22, 2024 Contrastive Learning Image-text matching
— Unverified 00 CLIP-Powered TASS: Target-Aware Single-Stream Network for Audio-Visual Question Answering May 13, 2024 Audio-visual Question Answering Audio-Visual Question Answering (AVQA)
— Unverified 00 Constructing Multilingual Visual-Text Datasets Revealing Visual Multilingual Ability of Vision Language Models Mar 29, 2024 Image-text matching Object Recognition
— Unverified 00 A New Fine-grained Alignment Method for Image-text Matching Nov 3, 2023 Image-text matching Image-text Retrieval
— Unverified 00 Cross-modal Subspace Learning for Fine-grained Sketch-based Image Retrieval May 28, 2017 Cross-Modal Retrieval Image Retrieval
— Unverified 00 CILF-CIAE: CLIP-driven Image-Language Fusion for Correcting Inverse Age Estimation Dec 4, 2023 Age Estimation Image-text matching
— Unverified 00 DARE: Diverse Visual Question Answering with Robustness Evaluation Sep 26, 2024 image-classification Image Classification
— Unverified 00 DEMO: A Statistical Perspective for Efficient Image-Text Matching May 19, 2024 Image-text matching Model Optimization
— Unverified 00 Dependency Structure Augmented Contextual Scoping Framework for Multimodal Aspect-Based Sentiment Analysis Apr 15, 2025 Aspect-Based Sentiment Analysis Dependency Parsing
— Unverified 00 Descriptive Image-Text Matching with Graded Contextual Similarity May 15, 2025 Descriptive Image-text matching
— Unverified 00 Discrete-continuous Action Space Policy Gradient-based Attention for Image-Text Matching Apr 21, 2021 Image-text matching Text Matching
— Unverified 00 Don't Stop Learning: Towards Continual Learning for the CLIP Model Jul 19, 2022 Continual Learning Image-text matching
— Unverified 00 DT2I: Dense Text-to-Image Generation from Region Descriptions Apr 5, 2022 Conditional Image Generation Image Generation
— Unverified 00 Dual Embodied-Symbolic Concept Representations for Deep Learning Mar 1, 2022 class-incremental learning Class Incremental Learning
— Unverified 00 Dynamic and Compressive Adaptation of Transformers From Images to Videos Aug 13, 2024 Image-text matching Text Matching
— Unverified 00 Embedding Arithmetic of Multimodal Queries for Image Retrieval Dec 6, 2021 Image Retrieval Image-text matching
— Unverified 00 EntityCLIP: Entity-Centric Image-Text Matching via Multimodal Attentive Contrastive Learning Oct 23, 2024 Contrastive Learning Image-text matching
— Unverified 00 EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE Aug 23, 2023 Image-text matching Image-text Retrieval
— Unverified 00 Expressing Objects just like Words: Recurrent Visual Embedding for Image-Text Matching Feb 20, 2020 Image-text matching Object
— Unverified 00 FSMR: A Feature Swapping Multi-modal Reasoning Approach with Joint Textual and Visual Clues Mar 29, 2024 Image-text matching Language Modeling
— Unverified 00 Grounded Image Text Matching with Mismatched Relation Reasoning Aug 2, 2023 Image-text matching Relation
— Unverified 00 Hashing based Efficient Inference for Image-Text Matching Aug 1, 2021 Image-text matching Text Matching
— Unverified 00 Hire: Hybrid-modal Interaction with Multiple Relational Enhancements for Image-Text Matching Jun 5, 2024 cross-modal alignment Image-text matching
— Unverified 00 ImageBERT: Cross-modal Pre-training with Large-scale Weak-supervised Image-Text Data Jan 22, 2020 Image Retrieval Image-text matching
— Unverified 00 Image-Text Matching with Multi-View Attention Feb 27, 2024 Diversity Image-text matching
— Unverified 00 Instruction-augmented Multimodal Alignment for Image-Text and Element Matching Apr 16, 2025 Image Augmentation Image Generation
— Unverified 00 InterBERT: Vision-and-Language Interaction for Multi-modal Pretraining Mar 30, 2020 Image Retrieval Image-text matching
— Unverified 00 Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching Oct 6, 2021 Image Captioning Image-text matching
— Unverified 00 Knowledge Aware Semantic Concept Expansion for Image-Text Matching Aug 10, 2019 Common Sense Reasoning Content-Based Image Retrieval
— Unverified 00 Learning Comprehensive Representations with Richer Self for Text-to-Image Person Re-Identification Oct 17, 2023 Image Retrieval Image-text matching
— Unverified 00 Learning Textual Prompts for Open-World Semi-Supervised Learning Jan 1, 2025 Image-text matching Open-World Semi-Supervised Learning
— Unverified 00 Macroscopic Control of Text Generation for Image Captioning Jan 20, 2021 Diversity Image Captioning
— Unverified 00 MASS: Overcoming Language Bias in Image-Text Matching Jan 20, 2025 Image-text matching Image-text Retrieval
— Unverified 00 Multimodal Matching-aware Co-attention Networks with Mutual Knowledge Distillation for Fake News Detection Dec 12, 2022 Fake News Detection Image-text matching
— Unverified 00 More Than Just Attention: Improving Cross-Modal Attentions with Contrastive Constraints for Image-Text Matching Nov 16, 2021 Contrastive Learning Image-text matching
— Unverified 00 More Than Just Attention: Improving Cross-Modal Attentions with Contrastive Constraints for Image-Text Matching May 20, 2021 Contrastive Learning Cross-Modal Retrieval
— Unverified 00 Multi-Head Attention Driven Dynamic Visual-Semantic Embedding for Enhanced Image-Text Matching Dec 26, 2024 Image-text matching Text Matching
— Unverified 00