| Pseudo-Prompt Generating in Pre-trained Vision-Language Models for Multi-Label Medical Image Classification | May 10, 2024 | Decoderimage-classification | CodeCode Available | 1 |
| CLIPArTT: Adaptation of CLIP to New Domains at Test Time | May 1, 2024 | Pseudo LabelTest-time Adaptation | CodeCode Available | 1 |
| Modeling Caption Diversity in Contrastive Vision-Language Pretraining | Apr 30, 2024 | Diversityzero-shot-classification | CodeCode Available | 1 |
| OpenDlign: Open-World Point Cloud Understanding with Depth-Aligned Images | Apr 25, 2024 | Representation LearningTransfer Learning | CodeCode Available | 1 |
| Knowledge-enhanced Visual-Language Pretraining for Computational Pathology | Apr 15, 2024 | Cross-Modal RetrievalLanguage Modeling | CodeCode Available | 1 |
| Label Propagation for Zero-shot Classification with Vision-Language Models | Apr 5, 2024 | ClassificationImage Classification | CodeCode Available | 1 |
| VLM-CPL: Consensus Pseudo Labels from Vision-Language Models for Human Annotation-Free Pathological Image Classification | Mar 23, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation | Mar 19, 2024 | DecoderInstance Segmentation | CodeCode Available | 1 |
| MoralBERT: A Fine-Tuned Language Model for Capturing Moral Values in Social Discussions | Mar 12, 2024 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| CLIP-Guided Source-Free Object Detection in Aerial Images | Jan 10, 2024 | Domain AdaptationObject | CodeCode Available | 1 |
| Improved Zero-Shot Classification by Adapting VLMs with Text Descriptions | Jan 4, 2024 | Fine-Grained Image Classificationimage-classification | CodeCode Available | 1 |
| Open-Pose 3D Zero-Shot Learning: Benchmark and Challenges | Dec 12, 2023 | 3D Object ClassificationClassification | CodeCode Available | 1 |
| Lite-Mind: Towards Efficient and Robust Brain Representation Network | Dec 6, 2023 | Brain DecodingImage Retrieval | CodeCode Available | 1 |
| SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference | Dec 4, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| ViT-Lens: Towards Omni-modal Representations | Nov 27, 2023 | EEGImage Generation | CodeCode Available | 1 |
| Re-Scoring Using Image-Language Similarity for Few-Shot Object Detection | Nov 1, 2023 | ClassificationFew-Shot Object Detection | CodeCode Available | 1 |
| ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models | Oct 27, 2023 | Column Type AnnotationTable annotation | CodeCode Available | 1 |
| EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression Recognition | Oct 25, 2023 | Facial Expression RecognitionFacial Expression Recognition (FER) | CodeCode Available | 1 |
| DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection | Oct 2, 2023 | Novel Object DetectionObject | CodeCode Available | 1 |
| CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic Segmentation For-Free | Sep 25, 2023 | Image SegmentationObject Localization | CodeCode Available | 1 |
| TAP: Targeted Prompting for Task Adaptive Generation of Textual Training Instances for Visual Classification | Sep 13, 2023 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 1 |
| Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment | Aug 24, 2023 | Self-Learningzero-shot-classification | CodeCode Available | 1 |
| Adversarial Illusions in Multi-Modal Embeddings | Aug 22, 2023 | Image GenerationText Generation | CodeCode Available | 1 |
| Image-free Classifier Injection for Zero-Shot Classification | Aug 21, 2023 | ClassificationDecoder | CodeCode Available | 1 |
| ReCLIP: Refine Contrastive Language Image Pre-Training with Source Free Domain Adaptation | Aug 4, 2023 | Domain Adaptationimage-classification | CodeCode Available | 1 |