| DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection | Oct 2, 2023 | Novel Object DetectionObject | CodeCode Available | 1 |
| Telling Stories for Common Sense Zero-Shot Action Recognition | Sep 29, 2023 | Action RecognitionArticles | CodeCode Available | 0 |
| CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic Segmentation For-Free | Sep 25, 2023 | Image SegmentationObject Localization | CodeCode Available | 1 |
| Exploiting CLIP-based Multi-modal Approach for Artwork Classification and Retrieval | Sep 21, 2023 | Retrievalzero-shot-classification | —Unverified | 0 |
| Auto-ACD: A Large-scale Dataset for Audio-Language Representation Learning | Sep 20, 2023 | Audio captioningCaption Generation | —Unverified | 0 |
| TAP: Targeted Prompting for Task Adaptive Generation of Textual Training Instances for Visual Classification | Sep 13, 2023 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 1 |
| Zero-Shot Visual Classification with Guided Cropping | Sep 12, 2023 | ClassificationObject | —Unverified | 0 |
| Mitigating Word Bias in Zero-shot Prompt-based Classifiers | Sep 10, 2023 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 0 |
| Context-Aware Prompt Tuning for Vision-Language Model with Dual-Alignment | Sep 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ETP: Learning Transferable ECG Representations via ECG-Text Pre-training | Sep 6, 2023 | DiagnosticLanguage Modeling | —Unverified | 0 |