CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization Mar 31, 2025 Contrastive Learning image-classification
— Unverified 0LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text Mar 25, 2025 Cross-Modal Retrieval Hallucination
Code Code Available 1Beyond the Visible: Multispectral Vision-Language Learning for Earth Observation Mar 20, 2025 Contrastive Learning Earth Observation
— Unverified 0Bayesian Test-Time Adaptation for Vision-Language Models Mar 12, 2025 image-classification Image Classification
— Unverified 0MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification Mar 10, 2025 Attribute image-classification
— Unverified 0MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations Mar 2, 2025 image-classification Image Classification
— Unverified 0Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion Feb 6, 2025 image-classification Image Classification
Code Code Available 2KPL: Training-Free Medical Knowledge Mining of Vision-Language Models Jan 20, 2025 Classification image-classification
Code Code Available 0Retaining Knowledge and Enhancing Long-Text Representations in CLIP through Dual-Teacher Distillation Jan 1, 2025 image-classification Image Classification
— Unverified 0Post-hoc Probabilistic Vision-Language Models Dec 8, 2024 Active Learning Uncertainty Quantification
Code Code Available 1CLIP-PING: Boosting Lightweight Vision-Language Models with Proximus Intrinsic Neighbors Guidance Dec 5, 2024 Contrastive Learning cross-modal alignment
— Unverified 0TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives Nov 4, 2024 Diversity image-classification
— Unverified 0TaxaBind: A Unified Embedding Space for Ecological Applications Nov 1, 2024 Audio Classification Cross-Modal Retrieval
Code Code Available 1Retrieval-enriched zero-shot image classification in low-resource domains Nov 1, 2024 image-classification Image Classification
— Unverified 0Multilingual Vision-Language Pre-training for the Remote Sensing Domain Oct 30, 2024 Cross-Modal Retrieval image-classification
Code Code Available 0Altogether: Image Captioning via Re-aligning Alt-text Oct 22, 2024 Image Captioning image-classification
Code Code Available 0Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability Oct 20, 2024 Few-Shot Object Detection image-classification
Code Code Available 0Interpreting and Analysing CLIP's Zero-Shot Image Classification via Mutual Knowledge Oct 16, 2024 Classification image-classification
Code Code Available 1CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features Oct 10, 2024 Cross-Modal Retrieval GPU
— Unverified 0LoGra-Med: Long Context Multi-Graph Alignment for Medical Vision-Language Model Oct 3, 2024 image-classification Image Classification
— Unverified 0CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling Sep 28, 2024 image-classification Image Classification
Code Code Available 2DPA: Dual Prototypes Alignment for Unsupervised Adaptation of Vision-Language Models Aug 16, 2024 Domain Adaptation image-classification
Code Code Available 0Do Vision-Language Foundational models show Robust Visual Perception? Aug 13, 2024 image-classification Image Classification
Code Code Available 0CoAPT: Context Attribute words for Prompt Tuning Jul 18, 2024 Attribute Descriptive
— Unverified 0Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion Jul 15, 2024 image-classification Image Classification
Code Code Available 0Semantic Compositions Enhance Vision-Language Contrastive Learning Jul 1, 2024 Classification Contrastive Learning
— Unverified 0PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration Jun 28, 2024 image-classification Image Classification
Code Code Available 2Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIP Jun 25, 2024 cross-modal alignment Image Classification
Code Code Available 2WATT: Weight Average Test-Time Adaptation of CLIP Jun 19, 2024 image-classification Image Classification
Code Code Available 2BaFTA: Backprop-Free Test-Time Adaptation For Zero-Shot Vision-Language Models Jun 17, 2024 image-classification Image Classification
— Unverified 0Mind's Eye: Image Recognition by EEG via Multimodal Similarity-Keeping Contrastive Learning Jun 5, 2024 Contrastive Learning EEG
Code Code Available 1Multimodal Adversarial Defense for Vision-Language Models by Leveraging One-To-Many Relationships May 29, 2024 Adversarial Defense Adversarial Robustness
— Unverified 0It's Not a Modality Gap: Characterizing and Addressing the Contrastive Gap May 28, 2024 image-classification Image Classification
— Unverified 0What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models May 24, 2024 Classification image-classification
Code Code Available 0Who's in and who's out? A case study of multimodal CLIP-filtering in DataComp May 13, 2024 image-classification Image Classification
Code Code Available 0Multi-method Integration with Confidence-based Weighting for Zero-shot Image Classification May 3, 2024 image-classification Image Classification
— Unverified 0MoDE: CLIP Data Experts via Clustering Apr 24, 2024 Clustering image-classification
Code Code Available 0A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene Apr 17, 2024 image-classification Image Classification
— Unverified 0Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning Apr 4, 2024 Contrastive Learning image-classification
Code Code Available 1Learn "No" to Say "Yes" Better: Improving Vision-Language Models via Negations Mar 29, 2024 image-classification Image Classification
Code Code Available 1Bridge the Modality and Capability Gaps in Vision-Language Model Selection Mar 20, 2024 Capacity Estimation image-classification
— Unverified 0Can We Talk Models Into Seeing the World Differently? Mar 14, 2024 Image Captioning Image Classification
Code Code Available 1PromptKD: Unsupervised Prompt Distillation for Vision-Language Models Mar 5, 2024 Knowledge Distillation Prompt Engineering
Code Code Available 3Exploring Low-Resource Medical Image Classification with Weakly Supervised Prompt Learning Feb 6, 2024 Few-Shot Learning image-classification
— Unverified 0Image-Caption Encoding for Improving Zero-Shot Generalization Feb 5, 2024 image-classification Image Classification
Code Code Available 0Segment Any Change Feb 2, 2024 Change Detection image-classification
Code Code Available 0CLAMP: Contrastive LAnguage Model Prompt-tuning Dec 4, 2023 Contrastive Learning Image Captioning
— Unverified 0LightCLIP: Learning Multi-Level Interaction for Lightweight Vision-Language Models Dec 1, 2023 image-classification Image Classification
— Unverified 0Towards Difficulty-Agnostic Efficient Transfer Learning for Vision-Language Models Nov 27, 2023 General Knowledge image-classification
Code Code Available 0Efficient Model-Agnostic Multi-Group Equivariant Networks Oct 14, 2023 Fairness image-classification
— Unverified 0