| MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report | Oct 21, 2024 | DiagnosticMedical Diagnosis | CodeCode Available | 0 | 5 |
| Describe me an Aucklet: Generating Grounded Perceptual Category Descriptions | Mar 7, 2023 | nlg evaluationRepresentation Learning | CodeCode Available | 0 | 5 |
| M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining | Jan 29, 2024 | GPUzero-shot-classification | CodeCode Available | 0 | 5 |
| ModalChorus: Visual Probing and Alignment of Multi-modal Embeddings via Modal Fusion Map | Jul 17, 2024 | Cross-Modal RetrievalDimensionality Reduction | CodeCode Available | 0 | 5 |
| DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation | Jul 14, 2025 | DecoderGPU | CodeCode Available | 0 | 5 |
| Mitigating Word Bias in Zero-shot Prompt-based Classifiers | Sep 10, 2023 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 0 | 5 |
| Data-Free Generalized Zero-Shot Learning | Jan 28, 2024 | Generalized Zero-Shot Learningzero-shot-classification | CodeCode Available | 0 | 5 |
| Better Safe than Sorry: Pre-training CLIP against Targeted Data Poisoning and Backdoor Attacks | Oct 5, 2023 | Contrastive LearningData Poisoning | CodeCode Available | 0 | 5 |
| Multi-level Cross-modal Feature Alignment via Contrastive Learning towards Zero-shot Classification of Remote Sensing Image Scenes | May 31, 2023 | ClassificationContrastive Learning | CodeCode Available | 0 | 5 |
| An Efficient Framework for Enhancing Discriminative Models via Diffusion Techniques | Dec 12, 2024 | Classificationimage-classification | CodeCode Available | 0 | 5 |