| The Benefits of Label-Description Training for Zero-Shot Text Classification | May 3, 2023 | Classificationdomain classification | CodeCode Available | 0 |
| Unsupervised Improvement of Audio-Text Cross-Modal Representations | May 3, 2023 | Acoustic Scene ClassificationClassification | CodeCode Available | 0 |
| CLaMP: Contrastive Language-Music Pre-training for Cross-Modal Symbolic Music Information Retrieval | Apr 21, 2023 | Data AugmentationInformation Retrieval | CodeCode Available | 0 |
| WYTIWYR: A User Intent-Aware Framework with Multi-modal Inputs for Visualization Retrieval | Apr 14, 2023 | Retrievalzero-shot-classification | CodeCode Available | 0 |
| What does CLIP know about a red circle? Visual prompt engineering for VLMs | Apr 13, 2023 | Image GenerationPrompt Engineering | —Unverified | 0 |
| RECLIP: Resource-efficient CLIP by Training with Small Images | Apr 12, 2023 | Contrastive LearningImage-text Retrieval | —Unverified | 0 |
| SoftCLIP: Softer Cross-modal Alignment Makes CLIP Stronger | Mar 30, 2023 | cross-modal alignmentzero-shot-classification | —Unverified | 0 |
| Evaluation of ChatGPT for NLP-based Mental Health Applications | Mar 28, 2023 | ClassificationDepression Detection | —Unverified | 0 |
| Prompt-Guided Transformers for End-to-End Open-Vocabulary Object Detection | Mar 25, 2023 | Decoderobject-detection | —Unverified | 0 |
| Frozen Language Model Helps ECG Zero-Shot Learning | Mar 22, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |