Learnable Pillar-based Re-ranking for Image-Text Retrieval Apr 25, 2023 Image-text Retrieval Re-Ranking
Code Code Available 1Rethinking Benchmarks for Cross-modal Image-text Retrieval Apr 21, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 1Image-text Retrieval via Preserving Main Semantics of Vision Apr 20, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 1Is Cross-modal Information Retrieval Possible without Training? Apr 20, 2023 Contrastive Learning Cross-Modal Information Retrieval
— Unverified 0SViTT: Temporal Learning of Sparse Video-Text Transformers Apr 18, 2023 Question Answering Retrieval
Code Code Available 1Hyperbolic Image-Text Representations Apr 18, 2023 image-classification Image Classification
Code Code Available 1Automated Cardiovascular Record Retrieval by Multimodal Learning between Electrocardiogram and Clinical Report Apr 13, 2023 Diagnostic regression
— Unverified 0RECLIP: Resource-efficient CLIP by Training with Small Images Apr 12, 2023 Contrastive Learning Image-text Retrieval
— Unverified 0Exposing and Mitigating Spurious Correlations for Cross-Modal Retrieval Apr 6, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 0AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia Content Creation Apr 4, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 3Free-Form Multi-Modal Multimedia Retrieval (4MR) Mar 29, 2023 Form Management
— Unverified 0Equivariant Similarity for Vision-Language Foundation Models Mar 25, 2023 Image-text Retrieval Retrieval
Code Code Available 1Computationally Efficient Labeling of Cancer Related Forum Posts by Non-Clinical Text Information Retrieval Mar 24, 2023 Clustering Distributed Computing
— Unverified 0CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive Learning Mar 22, 2023 Contrastive Learning Retrieval
— Unverified 0On-the-fly Text Retrieval for End-to-End ASR Adaptation Mar 20, 2023 Language Modeling Language Modelling
— Unverified 0Scene Graph Based Fusion Network For Image-Text Retrieval Mar 20, 2023 Image-text Retrieval Retrieval
— Unverified 0Efficient Image-Text Retrieval via Keyword-Guided Pre-Screening Mar 14, 2023 Image-text Retrieval Multi-Label Classification
— Unverified 0PMC-CLIP: Contrastive Language-Image Pre-training using Biomedical Documents Mar 13, 2023 image-classification Image Classification
Code Code Available 2Semantic-Preserving Augmentation for Robust Image-Text Retrieval Mar 10, 2023 Image-text Retrieval Retrieval
Code Code Available 0Understanding and Constructing Latent Modality Structures in Multi-modal Representation Learning Mar 10, 2023 Few-Shot Image Classification image-classification
— Unverified 0The style transformer with common knowledge optimization for image-text retrieval Mar 1, 2023 Image-text Retrieval Retrieval
— Unverified 0Deep Learning for Video-Text Retrieval: a Review Feb 24, 2023 Deep Learning Retrieval
— Unverified 0Cross-Modal Retrieval with Partially Mismatched Pairs Feb 22, 2023 Contrastive Learning Cross-Modal Retrieval
Code Code Available 1Video-Text Retrieval by Supervised Sparse Multi-Grained Learning Feb 19, 2023 Representation Learning Retrieval
Code Code Available 0Multimodal Federated Learning via Contrastive Representation Ensemble Feb 17, 2023 Federated Learning Image-text Retrieval
Code Code Available 1UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling Feb 13, 2023 Image-text Retrieval Retrieval
Code Code Available 1Differentiable Outlier Detection Enable Robust Deep Multimodal Analysis Feb 11, 2023 Image-text Retrieval Knowledge Graphs
Code Code Available 0LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Retrieval Feb 6, 2023 Image-text Retrieval Retrieval
Code Code Available 1UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers Jan 31, 2023 Image Captioning Image Classification
Code Code Available 1Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval Jan 30, 2023 Language Modeling Language Modelling
— Unverified 0STAIR: Learning Sparse Text and Image Representation in Grounded Tokens Jan 30, 2023 Information Retrieval Retrieval
— Unverified 0Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring Jan 26, 2023 Representation Learning Retrieval
Code Code Available 1MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval Jan 19, 2023 Retrieval Text Retrieval
Code Code Available 1USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval Jan 17, 2023 Contrastive Learning Image-text Retrieval
Code Code Available 0HADA: A Graph-based Amalgamation Framework in Image-text Retrieval Jan 11, 2023 Graph Neural Network Image Retrieval
Code Code Available 0NAPReg: Nouns As Proxies Regularization for Semantically Aware Cross-Modal Embeddings Jan 7, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 0LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval Jan 1, 2023 image-classification Image Classification
Code Code Available 1VL-Match: Enhancing Vision-Language Pretraining with Token-Level and Instance-Level Matching Jan 1, 2023 Image-text matching Image-text Retrieval
— Unverified 0HiVLP: Hierarchical Interactive Video-Language Pre-Training Jan 1, 2023 Retrieval Self-Supervised Learning
— Unverified 0Lecture Presentations Multimodal Dataset: Towards Understanding Multimodality in Educational Videos Jan 1, 2023 Attribute Retrieval
— Unverified 0Dual Alignment Unsupervised Domain Adaptation for Video-Text Retrieval Jan 1, 2023 Domain Adaptation Retrieval
— Unverified 0Multilateral Semantic Relations Modeling for Image Text Retrieval Jan 1, 2023 Image-text Retrieval Retrieval
— Unverified 0Fine-Grained Image-Text Matching by Cross-Modal Hard Aligning Network Jan 1, 2023 Image-text matching Retrieval
Code Code Available 1ViLEM: Visual-Language Error Modeling for Image-Text Retrieval Jan 1, 2023 Contrastive Learning Image-text Retrieval
— Unverified 0Learning Semantic Relationship Among Instances for Image-Text Matching Jan 1, 2023 Cross-Modal Retrieval Image Retrieval
Code Code Available 1GAFNet: A Global Fourier Self Attention Based Novel Network for multi-modal downstream tasks Jan 1, 2023 Image Generation Image-text Retrieval
— Unverified 0When are Lemons Purple? The Concept Association Bias of Vision-Language Models Dec 22, 2022 Attribute image-classification
— Unverified 0Multi-modal Molecule Structure-text Model for Text-based Retrieval and Editing Dec 21, 2022 Contrastive Learning Drug Design
Code Code Available 2Efficient Image Captioning for Edge Devices Dec 18, 2022 CPU Image Captioning
— Unverified 0AugTriever: Unsupervised Dense Retrieval and Domain Adaptation by Scalable Data Augmentation Dec 17, 2022 Data Augmentation Domain Adaptation
Code Code Available 0