| PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts | Aug 2, 2023 | Classificationimage-classification | CodeCode Available | 1 |
| PRIOR: Prototype Representation Joint Learning from Medical Images and Reports | Jul 24, 2023 | Contrastive LearningImage to text | CodeCode Available | 1 |
| MineralImage5k: A benchmark for zero-shot raw mineral visual recognition and description | Jul 20, 2023 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 1 |
| UCAS-IIE-NLP at SemEval-2023 Task 12: Enhancing Generalization of Multilingual BERT for Low-resource Sentiment Analysis | Jun 1, 2023 | Contrastive LearningRepresentation Learning | CodeCode Available | 1 |
| Improved Probabilistic Image-Text Representations | May 29, 2023 | Data AugmentationImage-text matching | CodeCode Available | 1 |
| Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models | May 29, 2023 | Image CaptioningImage Classification | CodeCode Available | 1 |
| Parts of Speech-Grounded Subspaces in Vision-Language Models | May 23, 2023 | Image GenerationPOS | CodeCode Available | 1 |
| S-CLIP: Semi-supervised Vision-Language Learning using Few Specialist Captions | May 23, 2023 | Contrastive LearningImage-text Retrieval | CodeCode Available | 1 |
| MedBLIP: Bootstrapping Language-Image Pre-training from 3D Medical Images and Texts | May 18, 2023 | Medical Visual Question AnsweringQuestion Answering | CodeCode Available | 1 |
| The Parrot Dilemma: Human-Labeled vs. LLM-augmented Data in Classification Tasks | Apr 26, 2023 | Data AugmentationLanguage Modelling | CodeCode Available | 1 |
| SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval) | Apr 13, 2023 | ClassificationSentiment Analysis | CodeCode Available | 1 |
| Exploring Vision-Language Models for Imbalanced Learning | Apr 4, 2023 | Decoderzero-shot-classification | CodeCode Available | 1 |
| Robust Contrastive Language-Image Pre-training against Data Poisoning and Backdoor Attacks | Mar 13, 2023 | Backdoor AttackData Poisoning | CodeCode Available | 1 |
| Teaching CLIP to Count to Ten | Feb 23, 2023 | counterfactualImage Generation | CodeCode Available | 1 |
| Diversity is Definitely Needed: Improving Model-Agnostic Zero-shot Classification via Stable Diffusion | Feb 7, 2023 | ClassificationDiversity | CodeCode Available | 1 |
| CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets | Feb 6, 2023 | Classificationimage-classification | CodeCode Available | 1 |
| Learning Customized Visual Models with Retrieval-Augmented Knowledge | Jan 17, 2023 | Contrastive LearningRetrieval | CodeCode Available | 1 |
| Attentive Mask CLIP | Dec 16, 2022 | Contrastive LearningRetrieval | CodeCode Available | 1 |
| Reproducible scaling laws for contrastive language-image learning | Dec 14, 2022 | Image ClassificationOpen Vocabulary Attribute Detection | CodeCode Available | 1 |
| LidarCLIP or: How I Learned to Talk to Point Clouds | Dec 13, 2022 | Image GenerationRetrieval | CodeCode Available | 1 |
| Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning | Dec 9, 2022 | Contrastive Learningimage-classification | CodeCode Available | 1 |
| SuS-X: Training-Free Name-Only Transfer of Vision-Language Models | Nov 28, 2022 | Retrievalzero-shot-classification | CodeCode Available | 1 |
| Visual Classification via Description from Large Language Models | Oct 13, 2022 | ClassificationDescriptive | CodeCode Available | 1 |
| CALIP: Zero-Shot Enhancement of CLIP with Parameter-free Attention | Sep 28, 2022 | Training-free 3D Point Cloud ClassificationTransfer Learning | CodeCode Available | 1 |
| Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield Networks | Aug 8, 2022 | Image GenerationText to Image Generation | CodeCode Available | 1 |