| Re-Scoring Using Image-Language Similarity for Few-Shot Object Detection | Nov 1, 2023 | ClassificationFew-Shot Object Detection | CodeCode Available | 1 |
| Using Large Language Models to Support Thematic Analysis in Empirical Legal Studies | Oct 28, 2023 | Language ModellingLarge Language Model | —Unverified | 0 |
| ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models | Oct 27, 2023 | Column Type AnnotationTable annotation | CodeCode Available | 1 |
| EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression Recognition | Oct 25, 2023 | Facial Expression RecognitionFacial Expression Recognition (FER) | CodeCode Available | 1 |
| Linear Representations of Sentiment in Large Language Models | Oct 23, 2023 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 0 |
| CLIP meets Model Zoo Experts: Pseudo-Supervision for Visual Enhancement | Oct 21, 2023 | Depth Estimationimage-classification | —Unverified | 0 |
| SILC: Improving Vision Language Pretraining with Self-Distillation | Oct 20, 2023 | ClassificationContrastive Learning | —Unverified | 0 |
| MedAI Dialog Corpus (MEDIC): Zero-Shot Classification of Doctor and AI Responses in Health Consultations | Oct 19, 2023 | Classificationtext-classification | —Unverified | 0 |
| Evaluating the Fairness of Discriminative Foundation Models in Computer Vision | Oct 18, 2023 | FairnessImage Captioning | CodeCode Available | 0 |
| Estimating Uncertainty in Multimodal Foundation Models using Public Internet Data | Oct 15, 2023 | Conformal PredictionPrediction | CodeCode Available | 0 |
| VeCLIP: Improving CLIP Training via Visual-enriched Captions | Oct 11, 2023 | Image-text RetrievalRetrieval | CodeCode Available | 2 |
| Uni3D: Exploring Unified 3D Representation at Scale | Oct 10, 2023 | 3D Object ClassificationRetrieval | CodeCode Available | 2 |
| Blind Dates: Examining the Expression of Temporality in Historical Photographs | Oct 10, 2023 | zero-shot-classificationZero-Shot Learning | —Unverified | 0 |
| Understanding the Robustness of Multi-modal Contrastive Learning to Distribution Shift | Oct 8, 2023 | Contrastive Learningzero-shot-classification | —Unverified | 0 |
| Better Safe than Sorry: Pre-training CLIP against Targeted Data Poisoning and Backdoor Attacks | Oct 5, 2023 | Contrastive LearningData Poisoning | CodeCode Available | 0 |
| DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection | Oct 2, 2023 | Novel Object DetectionObject | CodeCode Available | 1 |
| Telling Stories for Common Sense Zero-Shot Action Recognition | Sep 29, 2023 | Action RecognitionArticles | CodeCode Available | 0 |
| CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic Segmentation For-Free | Sep 25, 2023 | Image SegmentationObject Localization | CodeCode Available | 1 |
| Exploiting CLIP-based Multi-modal Approach for Artwork Classification and Retrieval | Sep 21, 2023 | Retrievalzero-shot-classification | —Unverified | 0 |
| Auto-ACD: A Large-scale Dataset for Audio-Language Representation Learning | Sep 20, 2023 | Audio captioningCaption Generation | —Unverified | 0 |
| TAP: Targeted Prompting for Task Adaptive Generation of Textual Training Instances for Visual Classification | Sep 13, 2023 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 1 |
| Zero-Shot Visual Classification with Guided Cropping | Sep 12, 2023 | ClassificationObject | —Unverified | 0 |
| Mitigating Word Bias in Zero-shot Prompt-based Classifiers | Sep 10, 2023 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 0 |
| Context-Aware Prompt Tuning for Vision-Language Model with Dual-Alignment | Sep 8, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ETP: Learning Transferable ECG Representations via ECG-Text Pre-training | Sep 6, 2023 | DiagnosticLanguage Modeling | —Unverified | 0 |