| S-CLIP: Semi-supervised Vision-Language Learning using Few Specialist Captions | May 23, 2023 | Contrastive LearningImage-text Retrieval | CodeCode Available | 1 |
| Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science | May 23, 2023 | zero-shot-classificationZero-Shot Learning | —Unverified | 0 |
| Parts of Speech-Grounded Subspaces in Vision-Language Models | May 23, 2023 | Image GenerationPOS | CodeCode Available | 1 |
| LLM-Pruner: On the Structural Pruning of Large Language Models | May 19, 2023 | Text Generationzero-shot-classification | CodeCode Available | 3 |
| MedBLIP: Bootstrapping Language-Image Pre-training from 3D Medical Images and Texts | May 18, 2023 | Medical Visual Question AnsweringQuestion Answering | CodeCode Available | 1 |
| ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding | May 14, 2023 | 3D Classification3D Point Cloud Classification | CodeCode Available | 2 |
| Boosting Visual-Language Models by Exploiting Hard Samples | May 9, 2023 | Retrievalzero-shot-classification | CodeCode Available | 0 |
| The Benefits of Label-Description Training for Zero-Shot Text Classification | May 3, 2023 | Classificationdomain classification | CodeCode Available | 0 |
| Unsupervised Improvement of Audio-Text Cross-Modal Representations | May 3, 2023 | Acoustic Scene ClassificationClassification | CodeCode Available | 0 |
| The Parrot Dilemma: Human-Labeled vs. LLM-augmented Data in Classification Tasks | Apr 26, 2023 | Data AugmentationLanguage Modelling | CodeCode Available | 1 |
| CLaMP: Contrastive Language-Music Pre-training for Cross-Modal Symbolic Music Information Retrieval | Apr 21, 2023 | Data AugmentationInformation Retrieval | CodeCode Available | 0 |
| WYTIWYR: A User Intent-Aware Framework with Multi-modal Inputs for Visualization Retrieval | Apr 14, 2023 | Retrievalzero-shot-classification | CodeCode Available | 0 |
| SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval) | Apr 13, 2023 | ClassificationSentiment Analysis | CodeCode Available | 1 |
| What does CLIP know about a red circle? Visual prompt engineering for VLMs | Apr 13, 2023 | Image GenerationPrompt Engineering | —Unverified | 0 |
| RECLIP: Resource-efficient CLIP by Training with Small Images | Apr 12, 2023 | Contrastive LearningImage-text Retrieval | —Unverified | 0 |
| Exploring Vision-Language Models for Imbalanced Learning | Apr 4, 2023 | Decoderzero-shot-classification | CodeCode Available | 1 |
| SoftCLIP: Softer Cross-modal Alignment Makes CLIP Stronger | Mar 30, 2023 | cross-modal alignmentzero-shot-classification | —Unverified | 0 |
| Your Diffusion Model is Secretly a Zero-Shot Classifier | Mar 28, 2023 | Domain GeneralizationFine-Grained Image Classification | CodeCode Available | 2 |
| Evaluation of ChatGPT for NLP-based Mental Health Applications | Mar 28, 2023 | ClassificationDepression Detection | —Unverified | 0 |
| Prompt-Guided Transformers for End-to-End Open-Vocabulary Object Detection | Mar 25, 2023 | Decoderobject-detection | —Unverified | 0 |
| Frozen Language Model Helps ECG Zero-Shot Learning | Mar 22, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Models in the Workplace: A Case Study on Prompt Engineering for Job Type Classification | Mar 13, 2023 | Job classificationPrompt Engineering | —Unverified | 0 |
| Robust Contrastive Language-Image Pre-training against Data Poisoning and Backdoor Attacks | Mar 13, 2023 | Backdoor AttackData Poisoning | CodeCode Available | 1 |
| Exploiting the Textual Potential from Vision-Language Pre-training for Text-based Person Search | Mar 8, 2023 | AttributePerson Search | —Unverified | 0 |
| Describe me an Aucklet: Generating Grounded Perceptual Category Descriptions | Mar 7, 2023 | nlg evaluationRepresentation Learning | CodeCode Available | 0 |