EI-CLIP: Entity-Aware Interventional Contrastive Learning for E-Commerce Cross-Modal Retrieval Jan 1, 2022 Causal Inference Contrastive Learning
— Unverified 0Cross Modal Retrieval with Querybank Normalisation Dec 23, 2021 Cross-Modal Retrieval Metric Learning
Code Code Available 1Fusion and Orthogonal Projection for Improved Face-Voice Association Dec 20, 2021 Cross-Modal Retrieval Triplet
Code Code Available 1CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising Dec 14, 2021 Cross-Modal Retrieval Decoder
— Unverified 0Multi-Modal Mutual Information Maximization: A Novel Approach for Unsupervised Deep Cross-Modal Hashing Dec 13, 2021 Cross-Modal Retrieval Retrieval
— Unverified 0Variational Autoencoder with CCA for Audio-Visual Cross-Modal Retrieval Dec 5, 2021 Cross-Modal Retrieval Information Retrieval
— Unverified 0Learning with Noisy Correspondence for Cross-modal Matching Dec 1, 2021 Cross-Modal Retrieval Cross-modal retrieval with noisy correspondence
Code Code Available 1Emotion Embedding Spaces for Matching Music to Stories Nov 26, 2021 Cross-Modal Retrieval Metric Learning
Code Code Available 1Florence: A New Foundation Model for Computer Vision Nov 22, 2021 Action Classification Action Recognition
Code Code Available 1Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts Nov 16, 2021 Cross-Modal Retrieval Image Captioning
Code Code Available 1SwAMP: Swapped Assignment of Multi-Modal Pairs for Cross-Modal Retrieval Nov 10, 2021 Contrastive Learning Cross-Modal Retrieval
— Unverified 0The Curious Layperson: Fine-Grained Image Recognition without Expert Labels Nov 5, 2021 Cross-Modal Retrieval Fine-Grained Image Recognition
Code Code Available 1An Empirical Study of Training End-to-End Vision-and-Language Transformers Nov 3, 2021 Cross-Modal Retrieval Decoder
Code Code Available 1Inflate and Shrink:Enriching and Reducing Interactions for Fast Text-Image Retrieval Nov 1, 2021 Cross-Modal Retrieval Image Retrieval
— Unverified 0Text2Mol: Cross-Modal Molecule Retrieval with Natural Language Queries Nov 1, 2021 Cross-Modal Retrieval Natural Language Queries
Code Code Available 1MURAL: Multimodal, Multitask Representations Across Languages Nov 1, 2021 Cross-Modal Retrieval Image-text matching
— Unverified 0BiC-Net: Learning Efficient Spatio-Temporal Relation for Text-Video Retrieval Oct 29, 2021 Cross-Modal Retrieval Relation
Code Code Available 1Learning Text-Image Joint Embedding for Efficient Cross-Modal Retrieval with Deep Feature Engineering Oct 22, 2021 Cross-Modal Retrieval Feature Engineering
Code Code Available 0Wav2CLIP: Learning Robust Audio Representations From CLIP Oct 21, 2021 Cross-Modal Retrieval Image Generation
Code Code Available 1Text-Based Person Search with Limited Data Oct 20, 2021 Benchmarking Contrastive Learning
Code Code Available 1VLDeformer: Vision-Language Decomposed Transformer for Fast Cross-Modal Retrieval Oct 20, 2021 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Learning Structural Representations for Recipe Generation and Food Retrieval Oct 4, 2021 Cross-Modal Retrieval Image Captioning
— Unverified 0Self-Supervised Modality-Invariant and Modality-Specific Feature Learning for 3D Objects Sep 29, 2021 3D Object Recognition Cross-Modal Retrieval
— Unverified 0Calibrating Probabilistic Embeddings for Cross-Modal Retrieval Sep 29, 2021 Cross-Modal Retrieval Retrieval
— Unverified 0MURAL: Multimodal, Multitask Retrieval Across Languages Sep 10, 2021 Cross-Modal Retrieval Image-text matching
— Unverified 0EfficientCLIP: Efficient Cross-Modal Pre-training by Ensemble Confident Learning and Language Modeling Sep 10, 2021 Cross-Modal Retrieval Language Modeling
— Unverified 0X-modaler: A Versatile and High-performance Codebase for Cross-modal Analytics Aug 18, 2021 Cross-Modal Retrieval Decoder
Code Code Available 1Learning Joint Embedding with Modality Alignments for Cross-Modal Retrieval of Recipes and Food Images Aug 9, 2021 cross-modal alignment Cross-Modal Retrieval
— Unverified 0Adaptive label-aware graph convolutional networks for cross-modal retrieval Aug 6, 2021 Cross-Modal Retrieval Representation Learning
Code Code Available 1Learning TFIDF Enhanced Joint Embedding for Recipe-Image Cross-Modal Retrieval Service Aug 2, 2021 Cross-Modal Retrieval Retrieval
Code Code Available 0Self-supervised Audiovisual Representation Learning for Remote Sensing Data Aug 2, 2021 Cross-Modal Retrieval Representation Learning
Code Code Available 1Align before Fuse: Vision and Language Representation Learning with Momentum Distillation Jul 16, 2021 Cross-Modal Retrieval Grounded language learning
Code Code Available 1Dynamic Modality Interaction Modeling for Image-Text Retrieval Jul 11, 2021 cross-modal alignment Cross-Modal Retrieval
Code Code Available 1Evaluation of Audio-Visual Alignments in Visually Grounded Speech Models Jul 5, 2021 Cross-Modal Retrieval Object Localization
Code Code Available 0FedCMR: Federated Cross-Modal Retrieval Jul 1, 2021 Cross-Modal Retrieval Federated Learning
Code Code Available 1OPT: Omni-Perception Pre-Trainer for Cross-Modal Understanding and Generation Jul 1, 2021 Audio to Text Retrieval Cross-Modal Retrieval
Code Code Available 0Graph Pattern Loss based Diversified Attention Network for Cross-Modal Retrieval Jun 25, 2021 Cross-Modal Retrieval Retrieval
— Unverified 0Domain-Smoothing Network for Zero-Shot Sketch-Based Image Retrieval Jun 22, 2021 Cross-Modal Retrieval Diversity
Code Code Available 1Learning Cross-Modal Retrieval With Noisy Labels Jun 19, 2021 Cross-Modal Retrieval Retrieval
Code Code Available 1Cross-Modal Center Loss for 3D Cross-Modal Retrieval Jun 19, 2021 Cross-Modal Retrieval Retrieval
— Unverified 0Multi-Modal Relational Graph for Cross-Modal Video Moment Retrieval Jun 19, 2021 Cross-Modal Retrieval Graph Matching
— Unverified 0Cross-Modal Discrete Representation Learning Jun 10, 2021 Cross-Modal Retrieval Quantization
— Unverified 0Exploring modality-agnostic representations for music classification Jun 2, 2021 Classification Cross-Modal Retrieval
Code Code Available 0Cross-lingual Cross-modal Pretraining for Multimodal Retrieval Jun 1, 2021 Cross-Modal Retrieval Machine Translation
— Unverified 0Towards Efficient Cross-Modal Visual Textual Retrieval using Transformer-Encoder Deep Features Jun 1, 2021 Cross-Modal Retrieval Image Retrieval
— Unverified 0Learning Relation Alignment for Calibrated Cross-modal Retrieval May 28, 2021 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 1More Than Just Attention: Improving Cross-Modal Attentions with Contrastive Constraints for Image-Text Matching May 20, 2021 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Dual adversarial graph neural networks for multi-label cross-modal retrieval May 18, 2021 Cross-Modal Retrieval Retrieval
Code Code Available 1Weakly Supervised Dense Video Captioning via Jointly Usage of Knowledge Distillation and Cross-modal Matching May 18, 2021 Caption Generation Cross-Modal Retrieval
— Unverified 0FDDH: Fast Discriminative Discrete Hashing for Large-Scale Cross-Modal Retrieval May 15, 2021 Cross-Modal Retrieval Quantization
Code Code Available 0