| Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models | May 29, 2023 | Image CaptioningImage Classification | CodeCode Available | 1 |
| Improved Probabilistic Image-Text Representations | May 29, 2023 | Data AugmentationImage-text matching | CodeCode Available | 1 |
| Adapting Language-Audio Models as Few-Shot Audio Learners | May 28, 2023 | Audio ClassificationClassification | —Unverified | 0 |
| DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification | May 25, 2023 | 3D ClassificationClassification | —Unverified | 0 |
| OverPrompt: Enhancing ChatGPT through Efficient In-Context Learning | May 24, 2023 | Data AugmentationFact Checking | CodeCode Available | 0 |
| S-CLIP: Semi-supervised Vision-Language Learning using Few Specialist Captions | May 23, 2023 | Contrastive LearningImage-text Retrieval | CodeCode Available | 1 |
| Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science | May 23, 2023 | zero-shot-classificationZero-Shot Learning | —Unverified | 0 |
| Parts of Speech-Grounded Subspaces in Vision-Language Models | May 23, 2023 | Image GenerationPOS | CodeCode Available | 1 |
| LLM-Pruner: On the Structural Pruning of Large Language Models | May 19, 2023 | Text Generationzero-shot-classification | CodeCode Available | 3 |
| MedBLIP: Bootstrapping Language-Image Pre-training from 3D Medical Images and Texts | May 18, 2023 | Medical Visual Question AnsweringQuestion Answering | CodeCode Available | 1 |