Towards Identity-Aware Cross-Modal Retrieval: a Dataset and a Baseline Dec 30, 2024 Cross-Modal Retrieval Face Swapping
Code Code Available 0Maybe you are looking for CroQS: Cross-modal Query Suggestion for Text-to-Image Retrieval Dec 18, 2024 Cross-Modal Retrieval Image Captioning
— Unverified 0Dynamic Adapter with Semantics Disentangling for Cross-lingual Cross-modal Retrieval Dec 18, 2024 Cross-Modal Retrieval Retrieval
Code Code Available 0Rebalanced Vision-Language Retrieval Considering Structure-Aware Distillation Dec 14, 2024 Cross-Modal Retrieval Retrieval
— Unverified 0CLIP-PING: Boosting Lightweight Vision-Language Models with Proximus Intrinsic Neighbors Guidance Dec 5, 2024 Contrastive Learning cross-modal alignment
— Unverified 0Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey Dec 3, 2024 Cross-Modal Retrieval Natural Language Understanding
— Unverified 0Fusing Physics-Driven Strategies and Cross-Modal Adversarial Learning: Toward Multi-Domain Applications Nov 30, 2024 Cross-Modal Retrieval
— Unverified 0FLEX-CLIP: Feature-Level GEneration Network Enhanced CLIP for X-shot Cross-modal Retrieval Nov 26, 2024 Cross-Modal Retrieval Retrieval
— Unverified 0CLIPS: An Enhanced CLIP Framework for Learning with Synthetic Captions Nov 25, 2024 Cross-Modal Retrieval
— Unverified 0Improving Factuality of 3D Brain MRI Report Generation with Paired Image-domain Retrieval and Text-domain Augmentation Nov 23, 2024 Cross-Modal Retrieval Image to text
— Unverified 0FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity Nov 23, 2024 Attribute Cross-Modal Retrieval
— Unverified 0Everything is a Video: Unifying Modalities through Next-Frame Prediction Nov 15, 2024 Caption Generation Cross-Modal Retrieval
— Unverified 0Exploring Optimal Transport-Based Multi-Grained Alignments for Text-Molecule Retrieval Nov 4, 2024 Contrastive Learning Cross-Modal Retrieval
— Unverified 0MM-Embed: Universal Multimodal Retrieval with Multimodal LLMs Nov 4, 2024 Cross-Modal Retrieval Information Retrieval
— Unverified 0Towards Cross-Modal Text-Molecule Retrieval with Better Modality Alignment Oct 31, 2024 Contrastive Learning cross-modal alignment
Code Code Available 0Multilingual Vision-Language Pre-training for the Remote Sensing Domain Oct 30, 2024 Cross-Modal Retrieval image-classification
Code Code Available 0Test-time Adaptation for Cross-modal Retrieval with Query Shift Oct 21, 2024 Cross-Modal Retrieval Diversity
— Unverified 0Deep Class-guided Hashing for Multi-label Cross-modal Retrieval Oct 20, 2024 Cross-Modal Retrieval Deep Hashing
Code Code Available 0GleanVec: Accelerating vector search with minimalist nonlinear dimensionality reduction Oct 14, 2024 Cross-Modal Retrieval Dimensionality Reduction
— Unverified 0MMCOMPOSITION: Revisiting the Compositionality of Pre-trained Vision-Language Models Oct 13, 2024 Cross-Modal Retrieval Question Answering
— Unverified 0CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features Oct 10, 2024 Cross-Modal Retrieval GPU
— Unverified 0Multimodal LLM Enhanced Cross-lingual Cross-modal Retrieval Sep 30, 2024 Cross-Modal Retrieval Large Language Model
Code Code Available 0Snap and Diagnose: An Advanced Multimodal Retrieval System for Identifying Plant Diseases in the Wild Aug 27, 2024 Cross-Modal Retrieval Image Retrieval
— Unverified 0Limitations in Employing Natural Language Supervision for Sensor-Based Human Activity Recognition -- And Ways to Overcome Them Aug 21, 2024 Activity Recognition Cross-Modal Retrieval
— Unverified 0Leveraging Chemistry Foundation Models to Facilitate Structure Focused Retrieval Augmented Generation in Multi-Agent Workflows for Catalyst and Materials Design Aug 21, 2024 Cross-Modal Retrieval Information Retrieval
— Unverified 0Bridging Information Asymmetry in Text-video Retrieval: A Data-centric Approach Aug 14, 2024 Cross-Modal Retrieval Language Modeling
— Unverified 0Efficient and Versatile Robust Fine-Tuning of Zero-shot Models Aug 11, 2024 Cross-Modal Retrieval zero-shot-classification
— Unverified 0Contrastive masked auto-encoders based self-supervised hashing for 2D image and 3D point cloud cross-modal retrieval Aug 11, 2024 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Disentangled Noisy Correspondence Learning Aug 10, 2024 cross-modal alignment Cross-Modal Retrieval
— Unverified 0Start from Video-Music Retrieval: An Inter-Intra Modal Loss for Cross Modal Retrieval Jul 28, 2024 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Unified Lexical Representation for Interpretable Visual-Language Alignment Jul 25, 2024 Cross-Modal Retrieval Language Modelling
Code Code Available 0DAC: 2D-3D Retrieval with Noisy Labels via Divide-and-Conquer Alignment and Correction Jul 25, 2024 cross-modal alignment Cross-Modal Retrieval
Code Code Available 0Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation Jul 24, 2024 Avg Cross-Modal Retrieval
— Unverified 0ModalChorus: Visual Probing and Alignment of Multi-modal Embeddings via Modal Fusion Map Jul 17, 2024 Cross-Modal Retrieval Dimensionality Reduction
Code Code Available 0Second Place Solution of WSDM2023 Toloka Visual Question Answering Challenge Jul 5, 2024 Cross-Modal Retrieval Question Answering
— Unverified 0Semantic Compositions Enhance Vision-Language Contrastive Learning Jul 1, 2024 Classification Contrastive Learning
— Unverified 0Improving the Consistency in Cross-Lingual Cross-Modal Retrieval with 1-to-K Contrastive Learning Jun 26, 2024 Contrastive Learning Cross-Modal Retrieval
Code Code Available 0MATE: Meet At The Embedding -- Connecting Images with Long Texts Jun 26, 2024 Cross-Modal Retrieval Descriptive
— Unverified 0ACE: A Generative Cross-Modal Retrieval Framework with Coarse-To-Fine Semantic Modeling Jun 25, 2024 Cross-Modal Retrieval Natural Language Queries
— Unverified 0Deep Sketched Output Kernel Regression for Structured Prediction Jun 13, 2024 Cross-Modal Retrieval Prediction
Code Code Available 0What If We Recaption Billions of Web Images with LLaMA-3? Jun 12, 2024 Cross-Modal Retrieval Image Generation
— Unverified 0No Captions, No Problem: Captionless 3D-CLIP Alignment with Hard Negatives via CLIP Knowledge and LLMs Jun 4, 2024 3D Classification Cross-Modal Retrieval
— Unverified 0Multi-Modal Generative Embedding Model May 29, 2024 Caption Generation Cross-Modal Retrieval
— Unverified 0RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval May 28, 2024 Cross-Modal Retrieval Retrieval
— Unverified 0Towards Cross-modal Backward-compatible Representation Learning for Vision-Language Models May 23, 2024 Cross-Modal Retrieval Representation Learning
— Unverified 0Distilling Vision-Language Pretraining for Efficient Cross-Modal Retrieval May 23, 2024 Cross-Modal Retrieval Quantization
— Unverified 0MVBIND: Self-Supervised Music Recommendation For Videos Via Embedding Space Binding May 15, 2024 Cross-Modal Retrieval Music Recommendation
— Unverified 0Global–Local Information Soft-Alignment for Cross-Modal Remote-Sensing Image–Text Retrieval May 14, 2024 Cross-Modal Retrieval Cross-Modal Retrieval on RSITMD
— Unverified 0All in One Framework for Multimodal Re-identification in the Wild May 8, 2024 All Cross-Modal Retrieval
— Unverified 0COM3D: Leveraging Cross-View Correspondence and Cross-Modal Mining for 3D Retrieval May 7, 2024 Cross-Modal Retrieval Retrieval
— Unverified 0