Dual Embodied-Symbolic Concept Representations for Deep Learning Mar 1, 2022 class-incremental learning Class Incremental Learning
— Unverified 0Dynamic and Compressive Adaptation of Transformers From Images to Videos Aug 13, 2024 Image-text matching Text Matching
— Unverified 0Embedding Arithmetic of Multimodal Queries for Image Retrieval Dec 6, 2021 Image Retrieval Image-text matching
— Unverified 0EntityCLIP: Entity-Centric Image-Text Matching via Multimodal Attentive Contrastive Learning Oct 23, 2024 Contrastive Learning Image-text matching
— Unverified 0EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE Aug 23, 2023 Image-text matching Image-text Retrieval
— Unverified 0Expressing Objects just like Words: Recurrent Visual Embedding for Image-Text Matching Feb 20, 2020 Image-text matching Object
— Unverified 0FSMR: A Feature Swapping Multi-modal Reasoning Approach with Joint Textual and Visual Clues Mar 29, 2024 Image-text matching Language Modeling
— Unverified 0Grounded Image Text Matching with Mismatched Relation Reasoning Aug 2, 2023 Image-text matching Relation
— Unverified 0Hashing based Efficient Inference for Image-Text Matching Aug 1, 2021 Image-text matching Text Matching
— Unverified 0Hire: Hybrid-modal Interaction with Multiple Relational Enhancements for Image-Text Matching Jun 5, 2024 cross-modal alignment Image-text matching
— Unverified 0ImageBERT: Cross-modal Pre-training with Large-scale Weak-supervised Image-Text Data Jan 22, 2020 Image Retrieval Image-text matching
— Unverified 0Image-Text Matching with Multi-View Attention Feb 27, 2024 Diversity Image-text matching
— Unverified 0Instruction-augmented Multimodal Alignment for Image-Text and Element Matching Apr 16, 2025 Image Augmentation Image Generation
— Unverified 0InterBERT: Vision-and-Language Interaction for Multi-modal Pretraining Mar 30, 2020 Image Retrieval Image-text matching
— Unverified 0Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching Oct 6, 2021 Image Captioning Image-text matching
— Unverified 0Knowledge Aware Semantic Concept Expansion for Image-Text Matching Aug 10, 2019 Common Sense Reasoning Content-Based Image Retrieval
— Unverified 0Learning Comprehensive Representations with Richer Self for Text-to-Image Person Re-Identification Oct 17, 2023 Image Retrieval Image-text matching
— Unverified 0Learning Textual Prompts for Open-World Semi-Supervised Learning Jan 1, 2025 Image-text matching Open-World Semi-Supervised Learning
— Unverified 0Macroscopic Control of Text Generation for Image Captioning Jan 20, 2021 Diversity Image Captioning
— Unverified 0MASS: Overcoming Language Bias in Image-Text Matching Jan 20, 2025 Image-text matching Image-text Retrieval
— Unverified 0Multimodal Matching-aware Co-attention Networks with Mutual Knowledge Distillation for Fake News Detection Dec 12, 2022 Fake News Detection Image-text matching
— Unverified 0More Than Just Attention: Improving Cross-Modal Attentions with Contrastive Constraints for Image-Text Matching Nov 16, 2021 Contrastive Learning Image-text matching
— Unverified 0More Than Just Attention: Improving Cross-Modal Attentions with Contrastive Constraints for Image-Text Matching May 20, 2021 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Multi-Head Attention Driven Dynamic Visual-Semantic Embedding for Enhanced Image-Text Matching Dec 26, 2024 Image-text matching Text Matching
— Unverified 0Multi-Modal Representation Learning with Text-Driven Soft Masks Apr 3, 2023 Contrastive Learning Data Augmentation
— Unverified 0MURAL: Multimodal, Multitask Representations Across Languages Nov 1, 2021 Cross-Modal Retrieval Image-text matching
— Unverified 0MURAL: Multimodal, Multitask Retrieval Across Languages Sep 10, 2021 Cross-Modal Retrieval Image-text matching
— Unverified 0NEVLP: Noise-Robust Framework for Efficient Vision-Language Pre-training Sep 15, 2024 Contrastive Learning cross-modal alignment
— Unverified 0Object-centric Binding in Contrastive Language-Image Pretraining Feb 19, 2025 Image-text matching Object
— Unverified 0OT-Attack: Enhancing Adversarial Transferability of Vision-Language Models via Optimal Transport Optimization Dec 7, 2023 Adversarial Attack Data Augmentation
— Unverified 0ParNet: Position-aware Aggregated Relation Network for Image-Text matching Jun 17, 2019 Image-text matching Position
— Unverified 0Selectively Hard Negative Mining for Alleviating Gradient Vanishing in Image-Text Matching Mar 1, 2023 Image-text matching Text Matching
— Unverified 0Step-Wise Hierarchical Alignment Network for Image-Text Matching Jun 11, 2021 Image-text matching Text Matching
— Unverified 0SyncMask: Synchronized Attentional Masking for Fashion-centric Vision-Language Pretraining Apr 1, 2024 Contrastive Learning Image-text matching
— Unverified 0TNG-CLIP:Training-Time Negation Data Generation for Negation Awareness of CLIP May 24, 2025 Image Captioning Image Generation
— Unverified 0Towards Deconfounded Image-Text Matching with Causal Inference Aug 22, 2024 Causal Inference Image-text matching
— Unverified 0Towards Efficient Cross-Modal Visual Textual Retrieval using Transformer-Encoder Deep Features Jun 1, 2021 Cross-Modal Retrieval Image Retrieval
— Unverified 0Towards Grounded Visual Spatial Reasoning in Multi-Modal Vision Language Models Aug 18, 2023 Image-text matching Object Localization
— Unverified 0Two-stream Hierarchical Similarity Reasoning for Image-text Matching Mar 10, 2022 Image-text matching Image to text
— Unverified 0UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training Apr 1, 2021 Image-text matching Image-text Retrieval
— Unverified 0UFO: A UniFied TransfOrmer for Vision-Language Representation Learning Nov 19, 2021 Image Captioning Image-text matching
— Unverified 0Dynamic Visual Semantic Sub-Embeddings and Fast Re-Ranking Sep 15, 2023 Image-text matching Re-Ranking
— Unverified 0Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations Apr 20, 2022 Cross-Modal Retrieval Image Retrieval
— Unverified 0Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training Aug 16, 2019 Image-text matching Image-text Retrieval
— Unverified 0Learning Visual Relation Priors for Image-Text Matching and Image Captioning with Neural Scene Graph Generators Sep 22, 2019 Image Captioning Image-text matching
— Unverified 0Uniformly Distributed Category Prototype-Guided Vision-Language Framework for Long-Tail Recognition Aug 24, 2023 Attribute Image-text matching
— Unverified 0Uniform Masking Prevails in Vision-Language Pretraining Dec 10, 2022 Image-text matching Language Modeling
— Unverified 0UNITER: Learning UNiversal Image-TExt Representations Sep 25, 2019 Image-text matching Image-text Retrieval
— Unverified 0Unpaired Referring Expression Grounding via Bidirectional Cross-Modal Matching Jan 18, 2022 Image-text matching Referring Expression
— Unverified 0UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal Guidance Oct 28, 2022 Image Generation Image-text matching
— Unverified 0