Learning Soft-Attention Models for Tempo-invariant Audio-Sheet Music Retrieval Jun 26, 2019 Cross-Modal Retrieval Retrieval
— Unverified 0Learning Sparse Disentangled Representations for Multimodal Exclusion Retrieval Apr 4, 2025 Cross-Modal Retrieval Disentanglement
— Unverified 0Learning Structural Representations for Recipe Generation and Food Retrieval Oct 4, 2021 Cross-Modal Retrieval Image Captioning
— Unverified 0Learning Visual-Semantic Embeddings for Reporting Abnormal Findings on Chest X-rays Oct 6, 2020 Clustering Cross-Modal Retrieval
— Unverified 0New Ideas and Trends in Deep Multimodal Content Understanding: A Review Oct 16, 2020 Cross-Modal Retrieval Deep Learning
— Unverified 0Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation Dec 27, 2023 Cross-Modal Retrieval Cross-modal retrieval with noisy correspondence
— Unverified 0Objects that Sound Dec 18, 2017 Cross-Modal Retrieval Optical Flow Estimation
— Unverified 0OMGM: Orchestrate Multiple Granularities and Modalities for Efficient Multimodal Retrieval May 10, 2025 Cross-Modal Retrieval Question Answering
— Unverified 0OmniVL:One Foundation Model for Image-Language and Video-Language Tasks Sep 15, 2022 Action Classification Action Recognition
— Unverified 0Online Asymmetric Similarity Learning for Cross-Modal Retrieval Jul 1, 2017 Cross-Modal Retrieval Retrieval
— Unverified 0On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation Feb 26, 2025 Cross-Modal Retrieval Hallucination
— Unverified 0Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval Jul 29, 2022 Cross-Modal Retrieval Data Augmentation
— Unverified 0Pairwise Relationship Guided Deep Hashing for Cross-Modal Retrieval Feb 12, 2017 Cross-Modal Retrieval Deep Hashing
— Unverified 0PATFinger: Prompt-Adapted Transferable Fingerprinting against Unauthorized Multimodal Dataset Usage Apr 15, 2025 Cross-Modal Retrieval Retrieval
— Unverified 0Pathology Report Generation and Multimodal Representation Learning for Cutaneous Melanocytic Lesions Feb 26, 2025 Cross-Modal Retrieval Language Modeling
— Unverified 0Perfect match: Improved cross-modal embeddings for audio-visual synchronisation Sep 21, 2018 Binary Classification Cross-Modal Retrieval
— Unverified 0PiTL: Cross-modal Retrieval with Weakly-supervised Vision-language Pre-training via Prompting Jul 14, 2023 Cross-Modal Retrieval Image to text
— Unverified 0Pix2Map: Cross-modal Retrieval for Inferring Street Maps from Images Jan 10, 2023 Autonomous Navigation Cross-Modal Retrieval
— Unverified 0Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval Jul 16, 2020 Articles Cross-Modal Retrieval
— Unverified 0Progressive Domain-Independent Feature Decomposition Network for Zero-Shot Sketch-Based Image Retrieval Mar 22, 2020 Cross-Modal Retrieval Image Retrieval
— Unverified 0Ranking-based Deep Cross-modal Hashing May 11, 2019 Cross-Modal Retrieval Retrieval
— Unverified 0Rebalanced Vision-Language Retrieval Considering Structure-Aware Distillation Dec 14, 2024 Cross-Modal Retrieval Retrieval
— Unverified 0Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images Oct 14, 2018 Cross-Modal Retrieval General Classification
— Unverified 0Retrieval-based Disentangled Representation Learning with Natural Language Supervision Dec 15, 2022 Cross-Modal Retrieval Disentanglement
— Unverified 0Retrieving and Highlighting Action with Spatiotemporal Reference May 19, 2020 Action Recognition Cross-Modal Retrieval
— Unverified 0Revisiting Cross Modal Retrieval Jul 19, 2018 Cross-Modal Retrieval Retrieval
— Unverified 0Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation Jul 24, 2024 Avg Cross-Modal Retrieval
— Unverified 0RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval May 28, 2024 Cross-Modal Retrieval Retrieval
— Unverified 0Sample-Specific Debiasing for Better Image-Text Models Apr 25, 2023 Contrastive Learning Cross-Modal Retrieval
— Unverified 0SA-Person: Text-Based Person Retrieval with Scene-aware Re-ranking May 30, 2025 Cross-Modal Retrieval Person Retrieval
— Unverified 0Sat2Sound: A Unified Framework for Zero-Shot Soundscape Mapping May 19, 2025 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Scale-Semantic Joint Decoupling Network for Image-text Retrieval in Remote Sensing Dec 12, 2022 Cross-Modal Retrieval Image-text Retrieval
— Unverified 0Second Place Solution of WSDM2023 Toloka Visual Question Answering Challenge Jul 5, 2024 Cross-Modal Retrieval Question Answering
— Unverified 0Seeing Speech and Sound: Distinguishing and Locating Audios in Visual Scenes Mar 24, 2025 Cross-Modal Retrieval Disentanglement
— Unverified 0Seeing Speech and Sound: Distinguishing and Locating Audio Sources in Visual Scenes Jan 1, 2025 Cross-Modal Retrieval Disentanglement
— Unverified 0See What You See: Self-supervised Cross-modal Retrieval of Visual Stimuli from Brain Activity Aug 7, 2022 cross-modal alignment Cross-Modal Retrieval
— Unverified 0Self-supervised Modal and View Invariant Feature Learning May 28, 2020 Cross-Modal Retrieval Retrieval
— Unverified 0Self-Supervised Modality-Invariant and Modality-Specific Feature Learning for 3D Objects Sep 29, 2021 3D Object Recognition Cross-Modal Retrieval
— Unverified 0Self-Supervised Visual Representations for Cross-Modal Retrieval Jan 31, 2019 Articles Cross-Modal Retrieval
— Unverified 0Semantic Adversarial Network for Zero-Shot Sketch-Based Image Retrieval May 7, 2019 Cross-Modal Retrieval Image Retrieval
— Unverified 0Semantic Compositions Enhance Vision-Language Contrastive Learning Jul 1, 2024 Classification Contrastive Learning
— Unverified 0SemCORE: A Semantic-Enhanced Generative Cross-Modal Retrieval Framework with MLLMs Apr 17, 2025 Cross-Modal Retrieval Image Retrieval
— Unverified 0Simple to Complex Cross-modal Learning to Rank Feb 4, 2017 Cross-Modal Retrieval Information Retrieval
— Unverified 0Snap and Diagnose: An Advanced Multimodal Retrieval System for Identifying Plant Diseases in the Wild Aug 27, 2024 Cross-Modal Retrieval Image Retrieval
— Unverified 0Sound Source Localization is All about Cross-Modal Alignment Sep 19, 2023 All cross-modal alignment
— Unverified 0Start from Video-Music Retrieval: An Inter-Intra Modal Loss for Cross Modal Retrieval Jul 28, 2024 Contrastive Learning Cross-Modal Retrieval
— Unverified 0SwAMP: Swapped Assignment of Multi-Modal Pairs for Cross-Modal Retrieval Nov 10, 2021 Contrastive Learning Cross-Modal Retrieval
— Unverified 0T3D: Advancing 3D Medical Vision-Language Pre-training by Learning Multi-View Visual Consistency Dec 3, 2023 Clinical Knowledge Contrastive Learning
— Unverified 0Task-adaptive Asymmetric Deep Cross-modal Hashing Apr 1, 2020 Cross-Modal Retrieval Retrieval
— Unverified 0Learning Joint Embedding for Cross-Modal Retrieval Aug 21, 2019 Cross-Modal Retrieval Retrieval
— Unverified 0