| Pseudo-Prompt Generating in Pre-trained Vision-Language Models for Multi-Label Medical Image Classification | May 10, 2024 | Decoderimage-classification | CodeCode Available | 1 |
| CLIPArTT: Adaptation of CLIP to New Domains at Test Time | May 1, 2024 | Pseudo LabelTest-time Adaptation | CodeCode Available | 1 |
| Modeling Caption Diversity in Contrastive Vision-Language Pretraining | Apr 30, 2024 | Diversityzero-shot-classification | CodeCode Available | 1 |
| OpenDlign: Open-World Point Cloud Understanding with Depth-Aligned Images | Apr 25, 2024 | Representation LearningTransfer Learning | CodeCode Available | 1 |
| Knowledge-enhanced Visual-Language Pretraining for Computational Pathology | Apr 15, 2024 | Cross-Modal RetrievalLanguage Modeling | CodeCode Available | 1 |
| Label Propagation for Zero-shot Classification with Vision-Language Models | Apr 5, 2024 | ClassificationImage Classification | CodeCode Available | 1 |
| VLM-CPL: Consensus Pseudo Labels from Vision-Language Models for Human Annotation-Free Pathological Image Classification | Mar 23, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation | Mar 19, 2024 | DecoderInstance Segmentation | CodeCode Available | 1 |
| MoralBERT: A Fine-Tuned Language Model for Capturing Moral Values in Social Discussions | Mar 12, 2024 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| CLIP-Guided Source-Free Object Detection in Aerial Images | Jan 10, 2024 | Domain AdaptationObject | CodeCode Available | 1 |
| Improved Zero-Shot Classification by Adapting VLMs with Text Descriptions | Jan 4, 2024 | Fine-Grained Image Classificationimage-classification | CodeCode Available | 1 |
| Open-Pose 3D Zero-Shot Learning: Benchmark and Challenges | Dec 12, 2023 | 3D Object ClassificationClassification | CodeCode Available | 1 |
| Lite-Mind: Towards Efficient and Robust Brain Representation Network | Dec 6, 2023 | Brain DecodingImage Retrieval | CodeCode Available | 1 |
| SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference | Dec 4, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| ViT-Lens: Towards Omni-modal Representations | Nov 27, 2023 | EEGImage Generation | CodeCode Available | 1 |
| Re-Scoring Using Image-Language Similarity for Few-Shot Object Detection | Nov 1, 2023 | ClassificationFew-Shot Object Detection | CodeCode Available | 1 |
| ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models | Oct 27, 2023 | Column Type AnnotationTable annotation | CodeCode Available | 1 |
| EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression Recognition | Oct 25, 2023 | Facial Expression RecognitionFacial Expression Recognition (FER) | CodeCode Available | 1 |
| DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection | Oct 2, 2023 | Novel Object DetectionObject | CodeCode Available | 1 |
| CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic Segmentation For-Free | Sep 25, 2023 | Image SegmentationObject Localization | CodeCode Available | 1 |
| TAP: Targeted Prompting for Task Adaptive Generation of Textual Training Instances for Visual Classification | Sep 13, 2023 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 1 |
| Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment | Aug 24, 2023 | Self-Learningzero-shot-classification | CodeCode Available | 1 |
| Adversarial Illusions in Multi-Modal Embeddings | Aug 22, 2023 | Image GenerationText Generation | CodeCode Available | 1 |
| Image-free Classifier Injection for Zero-Shot Classification | Aug 21, 2023 | ClassificationDecoder | CodeCode Available | 1 |
| ReCLIP: Refine Contrastive Language Image Pre-Training with Source Free Domain Adaptation | Aug 4, 2023 | Domain Adaptationimage-classification | CodeCode Available | 1 |
| PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts | Aug 2, 2023 | Classificationimage-classification | CodeCode Available | 1 |
| PRIOR: Prototype Representation Joint Learning from Medical Images and Reports | Jul 24, 2023 | Contrastive LearningImage to text | CodeCode Available | 1 |
| MineralImage5k: A benchmark for zero-shot raw mineral visual recognition and description | Jul 20, 2023 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 1 |
| UCAS-IIE-NLP at SemEval-2023 Task 12: Enhancing Generalization of Multilingual BERT for Low-resource Sentiment Analysis | Jun 1, 2023 | Contrastive LearningRepresentation Learning | CodeCode Available | 1 |
| Improved Probabilistic Image-Text Representations | May 29, 2023 | Data AugmentationImage-text matching | CodeCode Available | 1 |
| Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models | May 29, 2023 | Image CaptioningImage Classification | CodeCode Available | 1 |
| Parts of Speech-Grounded Subspaces in Vision-Language Models | May 23, 2023 | Image GenerationPOS | CodeCode Available | 1 |
| S-CLIP: Semi-supervised Vision-Language Learning using Few Specialist Captions | May 23, 2023 | Contrastive LearningImage-text Retrieval | CodeCode Available | 1 |
| MedBLIP: Bootstrapping Language-Image Pre-training from 3D Medical Images and Texts | May 18, 2023 | Medical Visual Question AnsweringQuestion Answering | CodeCode Available | 1 |
| The Parrot Dilemma: Human-Labeled vs. LLM-augmented Data in Classification Tasks | Apr 26, 2023 | Data AugmentationLanguage Modelling | CodeCode Available | 1 |
| SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval) | Apr 13, 2023 | ClassificationSentiment Analysis | CodeCode Available | 1 |
| Exploring Vision-Language Models for Imbalanced Learning | Apr 4, 2023 | Decoderzero-shot-classification | CodeCode Available | 1 |
| Robust Contrastive Language-Image Pre-training against Data Poisoning and Backdoor Attacks | Mar 13, 2023 | Backdoor AttackData Poisoning | CodeCode Available | 1 |
| Teaching CLIP to Count to Ten | Feb 23, 2023 | counterfactualImage Generation | CodeCode Available | 1 |
| Diversity is Definitely Needed: Improving Model-Agnostic Zero-shot Classification via Stable Diffusion | Feb 7, 2023 | ClassificationDiversity | CodeCode Available | 1 |
| CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets | Feb 6, 2023 | Classificationimage-classification | CodeCode Available | 1 |
| Learning Customized Visual Models with Retrieval-Augmented Knowledge | Jan 17, 2023 | Contrastive LearningRetrieval | CodeCode Available | 1 |
| Attentive Mask CLIP | Dec 16, 2022 | Contrastive LearningRetrieval | CodeCode Available | 1 |
| Reproducible scaling laws for contrastive language-image learning | Dec 14, 2022 | Image ClassificationOpen Vocabulary Attribute Detection | CodeCode Available | 1 |
| LidarCLIP or: How I Learned to Talk to Point Clouds | Dec 13, 2022 | Image GenerationRetrieval | CodeCode Available | 1 |
| Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning | Dec 9, 2022 | Contrastive Learningimage-classification | CodeCode Available | 1 |
| SuS-X: Training-Free Name-Only Transfer of Vision-Language Models | Nov 28, 2022 | Retrievalzero-shot-classification | CodeCode Available | 1 |
| Visual Classification via Description from Large Language Models | Oct 13, 2022 | ClassificationDescriptive | CodeCode Available | 1 |
| CALIP: Zero-Shot Enhancement of CLIP with Parameter-free Attention | Sep 28, 2022 | Training-free 3D Point Cloud ClassificationTransfer Learning | CodeCode Available | 1 |
| Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield Networks | Aug 8, 2022 | Image GenerationText to Image Generation | CodeCode Available | 1 |