Exposing and Mitigating Spurious Correlations for Cross-Modal Retrieval Apr 6, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 0Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples Mar 30, 2023 Cross-Modal Retrieval Retrieval
— Unverified 0MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks Mar 29, 2023 Cross-Modal Retrieval Decoder
Code Code Available 0MXM-CLR: A Unified Framework for Contrastive Learning of Multifold Cross-Modal Representations Mar 20, 2023 Contrastive Learning Cross-Modal Retrieval
Code Code Available 0Adversarial Modality Alignment Network for Cross-Modal Molecule Retrieval Mar 8, 2023 Contrastive Learning Cross-Modal Retrieval
Code Code Available 0Cross-modal Retrieval with Improved Graph Convolution Mar 7, 2023 Cross-Modal Retrieval Representation Learning
— Unverified 0Data leakage in cross-modal retrieval training: A case study Feb 23, 2023 Cross-Modal Retrieval Retrieval
— Unverified 0X-TRA: Improving Chest X-ray Tasks with Cross-Modal Retrieval Augmentation Feb 22, 2023 Cross-Modal Retrieval Retrieval
— Unverified 0VITR: Augmenting Vision Transformers with Relation-Focused Learning for Cross-Modal Information Retrieval Feb 13, 2023 Cross-Modal Information Retrieval Cross-Modal Retrieval
— Unverified 0Distribution Aligned Feature Clustering for Zero-Shot Sketch-Based Image Retrieval Jan 17, 2023 Clustering Cross-Modal Retrieval
— Unverified 0Toward Building General Foundation Models for Language, Vision, and Vision-Language Understanding Tasks Jan 12, 2023 Cross-Modal Retrieval Open-Ended Question Answering
Code Code Available 0Scene-centric vs. Object-centric Image-Text Cross-modal Retrieval: A Reproducibility Study Jan 12, 2023 Cross-Modal Retrieval Object
Code Code Available 0Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images Jan 10, 2023 Autonomous Navigation Cross-Modal Retrieval
— Unverified 0NAPReg: Nouns As Proxies Regularization for Semantically Aware Cross-Modal Embeddings Jan 7, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 0Learning Concordant Attention via Target-aware Alignment for Visible-Infrared Person Re-identification Jan 1, 2023 Cross-Modal Retrieval Person Re-Identification
— Unverified 0Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks Jan 1, 2023 Cross-Modal Retrieval Image Captioning
— Unverified 0BagFormer: Better Cross-Modal Retrieval via bag-wise interaction Dec 29, 2022 Cross-Modal Retrieval Retrieval
— Unverified 0Retrieval-based Disentangled Representation Learning with Natural Language Supervision Dec 15, 2022 Cross-Modal Retrieval Disentanglement
— Unverified 0Scale-Semantic Joint Decoupling Network for Image-text Retrieval in Remote Sensing Dec 12, 2022 Cross-Modal Retrieval Image-text Retrieval
— Unverified 0Using Multiple Instance Learning to Build Multimodal Representations Dec 11, 2022 Contrastive Learning Cross-Modal Retrieval
— Unverified 0TimbreCLIP: Connecting Timbre to Text and Images Nov 21, 2022 Cross-Modal Retrieval Image Generation
— Unverified 0Complete Cross-triplet Loss in Label Space for Audio-visual Cross-modal Retrieval Nov 7, 2022 Cross-Modal Retrieval Representation Learning
— Unverified 03D Shape Knowledge Graph for Cross-domain 3D Shape Retrieval Oct 27, 2022 3D Shape Retrieval Cross-Modal Retrieval
— Unverified 0FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified Retrieval and Captioning Oct 26, 2022 Cross-Modal Retrieval Decoder
— Unverified 0Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision Oct 24, 2022 cross-modal alignment Cross-Modal Retrieval
— Unverified 0Dissecting Deep Metric Learning Losses for Image-Text Retrieval Oct 21, 2022 Cross-Modal Retrieval Image-text matching
Code Code Available 0Cross-modal Search Method of Technology Video based on Adversarial Learning and Feature Fusion Oct 11, 2022 Cross-Modal Retrieval Retrieval
— Unverified 0ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training Sep 30, 2022 Computational Efficiency Contrastive Learning
Code Code Available 0Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval Sep 27, 2022 Cross-Modal Retrieval Retrieval
— Unverified 0Information-Theoretic Hashing for Zero-Shot Cross-Modal Retrieval Sep 26, 2022 Cross-Modal Retrieval Retrieval
— Unverified 0Deep Manifold Hashing: A Divide-and-Conquer Approach for Semi-Paired Unsupervised Cross-Modal Retrieval Sep 26, 2022 Cross-Modal Retrieval Retrieval
— Unverified 0OmniVL:One Foundation Model for Image-Language and Video-Language Tasks Sep 15, 2022 Action Classification Action Recognition
— Unverified 0MuLan: A Joint Embedding of Music Audio and Natural Language Aug 26, 2022 Cross-Modal Retrieval Music Tagging
Code Code Available 0A Channel Mix Method for Fine-Grained Cross-Modal Retrieval Aug 26, 2022 Cross-Modal Retrieval Retrieval
Code Code Available 0Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks Aug 22, 2022 All Cross-Modal Retrieval
Code Code Available 0See What You See: Self-supervised Cross-modal Retrieval of Visual Stimuli from Brain Activity Aug 7, 2022 cross-modal alignment Cross-Modal Retrieval
— Unverified 0Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval Jul 29, 2022 Cross-Modal Retrieval Data Augmentation
— Unverified 0ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval Jul 29, 2022 Cross-Modal Retrieval Image-text matching
Code Code Available 0Adaptive Asymmetric Label-guided Hashing for Multimedia Search Jul 26, 2022 Cross-Modal Retrieval Quantization
— Unverified 0Intra-Modal Constraint Loss For Image-Text Retrieval Jul 11, 2022 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 0Contrastive Cross-Modal Knowledge Sharing Pre-training for Vision-Language Representation Learning and Retrieval Jul 2, 2022 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Exploiting Transformation Invariance and Equivariance for Self-supervised Sound Localisation Jun 26, 2022 Cross-Modal Retrieval Representation Learning
— Unverified 0Emphasizing Complementary Samples for Non-literal Cross-modal Retrieval Jun 25, 2022 Cross-Modal Retrieval Retrieval
— Unverified 0HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text Retrieval May 24, 2022 Cross-Modal Retrieval Image-text Retrieval
— Unverified 0Deep Supervised Information Bottleneck Hashing for Cross-modal Retrieval based Computer-aided Diagnosis May 6, 2022 Cross-Modal Retrieval Retrieval
— Unverified 0Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations Apr 20, 2022 Cross-Modal Retrieval Image Retrieval
— Unverified 0Unsupervised Contrastive Hashing for Cross-Modal Retrieval in Remote Sensing Apr 19, 2022 Binarization Cross-Modal Retrieval
— Unverified 0Learning Similarity Preserving Binary Codes for Recommender Systems Apr 18, 2022 Binarization Cross-Modal Retrieval
— Unverified 0COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval Apr 15, 2022 Contrastive Learning Cross-Modal Retrieval
— Unverified 0ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval Mar 31, 2022 Contrastive Learning Cross-Modal Retrieval
— Unverified 0