| Label Propagation for Zero-shot Classification with Vision-Language Models | Apr 5, 2024 | ClassificationImage Classification | CodeCode Available | 1 |
| Training-Free Semantic Segmentation via LLM-Supervision | Mar 31, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VLM-CPL: Consensus Pseudo Labels from Vision-Language Models for Human Annotation-Free Pathological Image Classification | Mar 23, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Long-CLIP: Unlocking the Long-Text Capability of CLIP | Mar 22, 2024 | Image GenerationImage Retrieval | CodeCode Available | 4 |
| CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation | Mar 19, 2024 | DecoderInstance Segmentation | CodeCode Available | 1 |
| Audio-Visual Compound Expression Recognition Method based on Late Modality Fusion and Rule-based Decision | Mar 19, 2024 | Cross-corpusEmotion Recognition | —Unverified | 0 |
| MEDBind: Unifying Language and Multimodal Medical Data Embeddings | Mar 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition | Mar 19, 2024 | Dense CaptioningImage Captioning | —Unverified | 0 |
| Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models | Mar 14, 2024 | Continual LearningKnowledge Distillation | —Unverified | 0 |
| MoralBERT: A Fine-Tuned Language Model for Capturing Moral Values in Social Discussions | Mar 12, 2024 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |