CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction Oct 2, 2023 image-classification Image Classification
Code Code Available 2Noise-Tolerant Few-Shot Unsupervised Adapter for Vision-Language Models Sep 26, 2023 image-classification Image Classification
— Unverified 0GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training Aug 22, 2023 image-classification Image Classification
— Unverified 0PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts Aug 2, 2023 Classification image-classification
Code Code Available 1PromptStyler: Prompt-driven Style Generation for Source-free Domain Generalization Jul 27, 2023 Domain Generalization Image Classification
Code Code Available 1Distilling Large Vision-Language Model with Out-of-Distribution Generalizability Jul 6, 2023 Few-Shot Image Classification Image Classification
Code Code Available 1RemoteCLIP: A Vision Language Foundation Model for Remote Sensing Jun 19, 2023 Classification Cross-Modal Retrieval
Code Code Available 2Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding Jun 15, 2023 Contrastive Learning image-classification
Code Code Available 1Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations Jun 14, 2023 image-classification Image Classification
Code Code Available 1Semantically-Prompted Language Models Improve Visual Descriptions Jun 5, 2023 Classification Descriptive
— Unverified 0Learning from Children: Improving Image-Caption Pretraining via Curriculum May 27, 2023 image-classification Image Classification
Code Code Available 0CamDiff: Camouflage Image Augmentation via Diffusion Model Apr 11, 2023 Dataset Generation Image Augmentation
Code Code Available 1Text-to-Image Diffusion Models are Zero-Shot Classifiers Mar 27, 2023 Attribute Contrastive Learning
Code Code Available 0Structure Pretraining and Prompt Tuning for Knowledge Graph Transfer Mar 3, 2023 image-classification Image Classification
Code Code Available 1CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets Feb 6, 2023 Classification image-classification
Code Code Available 1Language-Driven Anchors for Zero-Shot Adversarial Robustness Jan 30, 2023 Adversarial Defense Adversarial Robustness
Code Code Available 0Vision-Language Models Performing Zero-Shot Tasks Exhibit Gender-based Disparities Jan 26, 2023 image-classification Image Classification
— Unverified 0LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval Jan 1, 2023 image-classification Image Classification
Code Code Available 1RA-CLIP: Retrieval Augmented Contrastive Language-Image Pre-Training Jan 1, 2023 Classification image-classification
— Unverified 0DiRaC-I: Identifying Diverse and Rare Training Classes for Zero-Shot Learning Dec 31, 2022 Active Learning Attribute
— Unverified 0When are Lemons Purple? The Concept Association Bias of Vision-Language Models Dec 22, 2022 Attribute image-classification
— Unverified 0CLIPPO: Image-and-Language Understanding from Pixels Only Dec 15, 2022 Contrastive Learning image-classification
Code Code Available 0Reproducible scaling laws for contrastive language-image learning Dec 14, 2022 Image Classification Open Vocabulary Attribute Detection
Code Code Available 1I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification Dec 5, 2022 Classification image-classification
— Unverified 0AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities Nov 12, 2022 Contrastive Learning Cross-Modal Retrieval
Code Code Available 4Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese Nov 2, 2022 Contrastive Learning image-classification
Code Code Available 5Generative Negative Text Replay for Continual Vision-Language Pretraining Oct 31, 2022 Continual Learning image-classification
— Unverified 0Text2Model: Text-based Model Induction for Zero-shot Image Classification Oct 27, 2022 3D Point Cloud Classification Action Recognition
— Unverified 0General Image Descriptors for Open World Image Retrieval using ViT CLIP Oct 20, 2022 Image Retrieval Retrieval
Code Code Available 1Efficient Multilingual Multi-modal Pre-training through Triple Contrastive Loss Oct 1, 2022 image-classification Image Classification
— Unverified 0I2DFormer: Learning Image to Document Attention for Zero-Shot Image Classification Sep 21, 2022 Generalized Zero-Shot Learning image-classification
— Unverified 0PaLI: A Jointly-Scaled Multilingual Language-Image Model Sep 14, 2022 Decoder Few-Shot Image Classification
Code Code Available 0What does a platypus look like? Generating customized prompts for zero-shot image classification Sep 7, 2022 Descriptive image-classification
Code Code Available 2Zero-Shot Temporal Action Detection via Vision-Language Prompting Jul 17, 2022 Action Detection Classification
Code Code Available 1DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning Jul 4, 2022 Attribute Contrastive Learning
Code Code Available 1Disentangled Ontology Embedding for Zero-shot Learning Jun 8, 2022 image-classification Image Classification
Code Code Available 1Masked Unsupervised Self-training for Label-free Image Classification Jun 7, 2022 image-classification Image Classification
Code Code Available 1CCMB: A Large-scale Chinese Cross-modal Benchmark May 8, 2022 image-classification Image Classification
Code Code Available 1PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining Apr 29, 2022 Image Classification Language Modeling
— Unverified 0Zero-Shot Logit Adjustment Apr 25, 2022 Bayesian Inference Generalized Zero-Shot Learning
Code Code Available 1ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models Apr 19, 2022 Fairness Few-Shot Image Classification
Code Code Available 4Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image Classification Mar 2, 2022 image-classification Image Classification
Code Code Available 1Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training Benchmark Feb 14, 2022 Benchmarking Contrastive Learning
Code Code Available 0A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-language Model Dec 29, 2021 image-classification Image Classification
Code Code Available 1A Fistful of Words: Learning Transferable Visual Models from Bag-of-Words Supervision Dec 27, 2021 Classification Image Captioning
— Unverified 0Soundify: Matching Sound Effects to Video Dec 17, 2021 Audio Generation Image Classification
— Unverified 0LiT: Zero-Shot Transfer with Locked-image text Tuning Nov 15, 2021 image-classification Image Classification
Code Code Available 1FILIP: Fine-grained Interactive Language-Image Pre-Training Nov 9, 2021 image-classification Image Classification
Code Code Available 1Benchmarking Knowledge-driven Zero-shot Learning Jun 29, 2021 Attribute Benchmarking
Code Code Available 1Zero-sample surface defect detection and classification based on semantic feedback neural network Jun 15, 2021 Attribute Defect Detection
— Unverified 0