KPL: Training-Free Medical Knowledge Mining of Vision-Language Models Jan 20, 2025 Classification image-classification
Code Code Available 05 Text-to-Image Diffusion Models are Zero-Shot Classifiers Mar 27, 2023 Attribute Contrastive Learning
Code Code Available 05 Segment Any Change Feb 2, 2024 Change Detection image-classification
Code Code Available 05 Unconstrained Open Vocabulary Image Classification: Zero-Shot Transfer from Text to Image via CLIP Inversion Jul 15, 2024 image-classification Image Classification
Code Code Available 05 Altogether: Image Captioning via Re-aligning Alt-text Oct 22, 2024 Image Captioning image-classification
Code Code Available 05 Towards Difficulty-Agnostic Efficient Transfer Learning for Vision-Language Models Nov 27, 2023 General Knowledge image-classification
Code Code Available 05 PaLI: A Jointly-Scaled Multilingual Language-Image Model Sep 14, 2022 Decoder Few-Shot Image Classification
Code Code Available 05 Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability Oct 20, 2024 Few-Shot Object Detection image-classification
Code Code Available 05 Multilingual Vision-Language Pre-training for the Remote Sensing Domain Oct 30, 2024 Cross-Modal Retrieval image-classification
Code Code Available 05 What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models May 24, 2024 Classification image-classification
Code Code Available 05 Learning from Children: Improving Image-Caption Pretraining via Curriculum May 27, 2023 image-classification Image Classification
Code Code Available 05 Who's in and who's out? A case study of multimodal CLIP-filtering in DataComp May 13, 2024 image-classification Image Classification
Code Code Available 05 Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training Benchmark Feb 14, 2022 Benchmarking Contrastive Learning
Code Code Available 05 MoDE: CLIP Data Experts via Clustering Apr 24, 2024 Clustering image-classification
Code Code Available 05 Semantically-Prompted Language Models Improve Visual Descriptions Jun 5, 2023 Classification Descriptive
— Unverified 00 MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations Mar 2, 2025 image-classification Image Classification
— Unverified 00 A Fistful of Words: Learning Transferable Visual Models from Bag-of-Words Supervision Dec 27, 2021 Classification Image Captioning
— Unverified 00 A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene Apr 17, 2024 image-classification Image Classification
— Unverified 00 BaFTA: Backprop-Free Test-Time Adaptation For Zero-Shot Vision-Language Models Jun 17, 2024 image-classification Image Classification
— Unverified 00 Bayesian Test-Time Adaptation for Vision-Language Models Mar 12, 2025 image-classification Image Classification
— Unverified 00 Beyond the Visible: Multispectral Vision-Language Learning for Earth Observation Mar 20, 2025 Contrastive Learning Earth Observation
— Unverified 00 Bridge the Modality and Capability Gaps in Vision-Language Model Selection Mar 20, 2024 Capacity Estimation image-classification
— Unverified 00 CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization Mar 31, 2025 Contrastive Learning image-classification
— Unverified 00 CLAMP: Contrastive LAnguage Model Prompt-tuning Dec 4, 2023 Contrastive Learning Image Captioning
— Unverified 00 Class Knowledge Overlay to Visual Feature Learning for Zero-Shot Image Classification Feb 26, 2021 General Classification image-classification
— Unverified 00 CLIP-PING: Boosting Lightweight Vision-Language Models with Proximus Intrinsic Neighbors Guidance Dec 5, 2024 Contrastive Learning cross-modal alignment
— Unverified 00 CoAPT: Context Attribute words for Prompt Tuning Jul 18, 2024 Attribute Descriptive
— Unverified 00 CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features Oct 10, 2024 Cross-Modal Retrieval GPU
— Unverified 00 DiRaC-I: Identifying Diverse and Rare Training Classes for Zero-Shot Learning Dec 31, 2022 Active Learning Attribute
— Unverified 00 Efficient Model-Agnostic Multi-Group Equivariant Networks Oct 14, 2023 Fairness image-classification
— Unverified 00 Efficient Multilingual Multi-modal Pre-training through Triple Contrastive Loss Oct 1, 2022 image-classification Image Classification
— Unverified 00 Exploring Low-Resource Medical Image Classification with Weakly Supervised Prompt Learning Feb 6, 2024 Few-Shot Learning image-classification
— Unverified 00 Gaze Embeddings for Zero-Shot Image Classification Nov 28, 2016 Classification Fine-Grained Image Classification
— Unverified 00 Generative Negative Text Replay for Continual Vision-Language Pretraining Oct 31, 2022 Continual Learning image-classification
— Unverified 00 GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training Aug 22, 2023 image-classification Image Classification
— Unverified 00 I2DFormer: Learning Image to Document Attention for Zero-Shot Image Classification Sep 21, 2022 Generalized Zero-Shot Learning image-classification
— Unverified 00 I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification Dec 5, 2022 Classification image-classification
— Unverified 00 Improving Semantic Embedding Consistency by Metric Learning for Zero-Shot Classification Jul 27, 2016 Attribute General Classification
— Unverified 00 Integrating Propositional and Relational Label Side Information for Hierarchical Zero-Shot Image Classification Feb 14, 2019 Attribute General Classification
— Unverified 00 It's Not a Modality Gap: Characterizing and Addressing the Contrastive Gap May 28, 2024 image-classification Image Classification
— Unverified 00 Language to Network: Conditional Parameter Adaptation with Natural Language Descriptions Jul 1, 2020 General Classification image-classification
— Unverified 00 Large-Scale Zero-Shot Image Classification from Rich and Diverse Textual Descriptions Mar 17, 2021 Articles General Classification
— Unverified 00 Multimodal Adversarial Defense for Vision-Language Models by Leveraging One-To-Many Relationships May 29, 2024 Adversarial Defense Adversarial Robustness
— Unverified 00 LightCLIP: Learning Multi-Level Interaction for Lightweight Vision-Language Models Dec 1, 2023 image-classification Image Classification
— Unverified 00 LoGra-Med: Long Context Multi-Graph Alignment for Medical Vision-Language Model Oct 3, 2024 image-classification Image Classification
— Unverified 00 MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification Mar 10, 2025 Attribute image-classification
— Unverified 00 Multi-method Integration with Confidence-based Weighting for Zero-shot Image Classification May 3, 2024 image-classification Image Classification
— Unverified 00 Noise-Tolerant Few-Shot Unsupervised Adapter for Vision-Language Models Sep 26, 2023 image-classification Image Classification
— Unverified 00 PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining Apr 29, 2022 Image Classification Language Modeling
— Unverified 00 RA-CLIP: Retrieval Augmented Contrastive Language-Image Pre-Training Jan 1, 2023 Classification image-classification
— Unverified 00