Defense of Adversarial Ranking Attack in Text Retrieval: Benchmark and Baseline via Detection Jul 31, 2023 Adversarial Attack Information Retrieval
— Unverified 0Towards a Visual-Language Foundation Model for Computational Pathology Jul 24, 2023 Contrastive Learning image-classification
— Unverified 0Extracting Molecular Properties from Natural Language with Multimodal Contrastive Learning Jul 22, 2023 Contrastive Learning Property Prediction
— Unverified 0Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning in CLIP Jul 18, 2023 Attribute Image-text Retrieval
— Unverified 0Stop Pre-Training: Adapt Visual-Language Models to Unseen Languages Jun 29, 2023 Image-text Retrieval Machine Translation
Code Code Available 0Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input Jun 25, 2023 Diversity Image-text Retrieval
— Unverified 0TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter Jun 22, 2023 Question Answering Retrieval
Code Code Available 0MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian Jun 20, 2023 Cross-Lingual Transfer Retrieval
Code Code Available 0Align, Adapt and Inject: Sound-guided Unified Image Generation Jun 20, 2023 Image Generation Retrieval
— Unverified 0Integrating Listwise Ranking into Pairwise-based Image-Text Retrieval May 26, 2023 Image-text Retrieval Retrieval
Code Code Available 0Enhancing the Ranking Context of Dense Retrieval Methods through Reciprocal Nearest Neighbors May 25, 2023 Contrastive Learning Reranking
Code Code Available 0PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts May 24, 2023 Dialogue State Tracking Image Retrieval
Code Code Available 0When the Music Stops: Tip-of-the-Tongue Retrieval for Music May 23, 2023 Benchmarking Language Modeling
Code Code Available 0i-Code Studio: A Configurable and Composable Framework for Integrative AI May 23, 2023 Question Answering Retrieval
— Unverified 0VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending May 22, 2023 Question Answering Retrieval
— Unverified 0TOME: A Two-stage Approach for Model-based Retrieval May 18, 2023 Natural Questions Retrieval
— Unverified 0Mask to reconstruct: Cooperative Semantics Completion for Video-text Retrieval May 13, 2023 Retrieval Text Retrieval
— Unverified 0Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception May 10, 2023 Classification image-classification
— Unverified 0Hypernymization of named entity-rich captions for grounding-based multi-modal pretraining Apr 25, 2023 Articles Image-text Retrieval
— Unverified 0Is Cross-modal Information Retrieval Possible without Training? Apr 20, 2023 Contrastive Learning Cross-Modal Information Retrieval
— Unverified 0Automated Cardiovascular Record Retrieval by Multimodal Learning between Electrocardiogram and Clinical Report Apr 13, 2023 Diagnostic regression
— Unverified 0RECLIP: Resource-efficient CLIP by Training with Small Images Apr 12, 2023 Contrastive Learning Image-text Retrieval
— Unverified 0Exposing and Mitigating Spurious Correlations for Cross-Modal Retrieval Apr 6, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 0Free-Form Multi-Modal Multimedia Retrieval (4MR) Mar 29, 2023 Form Management
— Unverified 0Computationally Efficient Labeling of Cancer Related Forum Posts by Non-Clinical Text Information Retrieval Mar 24, 2023 Clustering Distributed Computing
— Unverified 0CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive Learning Mar 22, 2023 Contrastive Learning Retrieval
— Unverified 0On-the-fly Text Retrieval for End-to-End ASR Adaptation Mar 20, 2023 Language Modeling Language Modelling
— Unverified 0Scene Graph Based Fusion Network For Image-Text Retrieval Mar 20, 2023 Image-text Retrieval Retrieval
— Unverified 0Efficient Image-Text Retrieval via Keyword-Guided Pre-Screening Mar 14, 2023 Image-text Retrieval Multi-Label Classification
— Unverified 0Semantic-Preserving Augmentation for Robust Image-Text Retrieval Mar 10, 2023 Image-text Retrieval Retrieval
Code Code Available 0Understanding and Constructing Latent Modality Structures in Multi-modal Representation Learning Mar 10, 2023 Few-Shot Image Classification image-classification
— Unverified 0The style transformer with common knowledge optimization for image-text retrieval Mar 1, 2023 Image-text Retrieval Retrieval
— Unverified 0Deep Learning for Video-Text Retrieval: a Review Feb 24, 2023 Deep Learning Retrieval
— Unverified 0Video-Text Retrieval by Supervised Sparse Multi-Grained Learning Feb 19, 2023 Representation Learning Retrieval
Code Code Available 0Differentiable Outlier Detection Enable Robust Deep Multimodal Analysis Feb 11, 2023 Image-text Retrieval Knowledge Graphs
Code Code Available 0STAIR: Learning Sparse Text and Image Representation in Grounded Tokens Jan 30, 2023 Information Retrieval Retrieval
— Unverified 0Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval Jan 30, 2023 Language Modeling Language Modelling
— Unverified 0USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval Jan 17, 2023 Contrastive Learning Image-text Retrieval
Code Code Available 0HADA: A Graph-based Amalgamation Framework in Image-text Retrieval Jan 11, 2023 Graph Neural Network Image Retrieval
Code Code Available 0NAPReg: Nouns As Proxies Regularization for Semantically Aware Cross-Modal Embeddings Jan 7, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 0ViLEM: Visual-Language Error Modeling for Image-Text Retrieval Jan 1, 2023 Contrastive Learning Image-text Retrieval
— Unverified 0Dual Alignment Unsupervised Domain Adaptation for Video-Text Retrieval Jan 1, 2023 Domain Adaptation Retrieval
— Unverified 0Lecture Presentations Multimodal Dataset: Towards Understanding Multimodality in Educational Videos Jan 1, 2023 Attribute Retrieval
— Unverified 0HiVLP: Hierarchical Interactive Video-Language Pre-Training Jan 1, 2023 Retrieval Self-Supervised Learning
— Unverified 0Multilateral Semantic Relations Modeling for Image Text Retrieval Jan 1, 2023 Image-text Retrieval Retrieval
— Unverified 0GAFNet: A Global Fourier Self Attention Based Novel Network for multi-modal downstream tasks Jan 1, 2023 Image Generation Image-text Retrieval
— Unverified 0VL-Match: Enhancing Vision-Language Pretraining with Token-Level and Instance-Level Matching Jan 1, 2023 Image-text matching Image-text Retrieval
— Unverified 0When are Lemons Purple? The Concept Association Bias of Vision-Language Models Dec 22, 2022 Attribute image-classification
— Unverified 0Efficient Image Captioning for Edge Devices Dec 18, 2022 CPU Image Captioning
— Unverified 0AugTriever: Unsupervised Dense Retrieval and Domain Adaptation by Scalable Data Augmentation Dec 17, 2022 Data Augmentation Domain Adaptation
Code Code Available 0