| Knowledge-enhanced Visual-Language Pretraining for Computational Pathology | Apr 15, 2024 | Cross-Modal RetrievalLanguage Modeling | CodeCode Available | 1 | 5 |
| Controlling Latent Diffusion Using Latent CLIP | Mar 11, 2025 | DenoisingDescriptive | CodeCode Available | 1 | 5 |
| Adversarial Robustification via Text-to-Image Diffusion Models | Jul 26, 2024 | Adversarial Robustnesszero-shot-classification | CodeCode Available | 1 | 5 |
| Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment | Aug 24, 2023 | Self-Learningzero-shot-classification | CodeCode Available | 1 | 5 |
| CALIP: Zero-Shot Enhancement of CLIP with Parameter-free Attention | Sep 28, 2022 | Training-free 3D Point Cloud ClassificationTransfer Learning | CodeCode Available | 1 | 5 |
| Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification | Mar 17, 2020 | Action ClassificationClassification | CodeCode Available | 1 | 5 |
| Discriminative Region-based Multi-Label Zero-Shot Learning | Aug 20, 2021 | Image RetrievalMulti-label zero-shot learning | CodeCode Available | 1 | 5 |
| CountCLIP -- [Re] Teaching CLIP to Count to Ten | Jun 5, 2024 | zero-shot-classificationZero-Shot Counting | CodeCode Available | 1 | 5 |
| Differentiable Model Scaling using Differentiable Topk | May 12, 2024 | GPUimage-classification | CodeCode Available | 1 | 5 |
| Open-Pose 3D Zero-Shot Learning: Benchmark and Challenges | Dec 12, 2023 | 3D Object ClassificationClassification | CodeCode Available | 1 | 5 |
| Diversity is Definitely Needed: Improving Model-Agnostic Zero-shot Classification via Stable Diffusion | Feb 7, 2023 | ClassificationDiversity | CodeCode Available | 1 | 5 |
| DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection | Oct 2, 2023 | Novel Object DetectionObject | CodeCode Available | 1 | 5 |
| Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition | Jun 13, 2024 | Retrievalzero-shot-classification | CodeCode Available | 1 | 5 |
| Modeling Caption Diversity in Contrastive Vision-Language Pretraining | Apr 30, 2024 | Diversityzero-shot-classification | CodeCode Available | 1 | 5 |
| No Token Left Behind: Explainability-Aided Image Classification and Generation | Apr 11, 2022 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| DC3DO: Diffusion Classifier for 3D Objects | Aug 13, 2024 | 3D Object ClassificationClassification | CodeCode Available | 1 | 5 |
| RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models | Nov 6, 2024 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| MedBLIP: Bootstrapping Language-Image Pre-training from 3D Medical Images and Texts | May 18, 2023 | Medical Visual Question AnsweringQuestion Answering | CodeCode Available | 1 | 5 |
| TableTime: Reformulating Time Series Classification as Zero-Shot Table Understanding via Large Language Models | Nov 24, 2024 | Problem DecompositionTime Series | CodeCode Available | 1 | 5 |
| Design of the topology for contrastive visual-textual alignment | Sep 5, 2022 | Contrastive LearningImage-to-Text Retrieval | CodeCode Available | 0 | 5 |
| Mitigating Word Bias in Zero-shot Prompt-based Classifiers | Sep 10, 2023 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 0 | 5 |
| Describe me an Aucklet: Generating Grounded Perceptual Category Descriptions | Mar 7, 2023 | nlg evaluationRepresentation Learning | CodeCode Available | 0 | 5 |
| M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining | Jan 29, 2024 | GPUzero-shot-classification | CodeCode Available | 0 | 5 |
| ModalChorus: Visual Probing and Alignment of Multi-modal Embeddings via Modal Fusion Map | Jul 17, 2024 | Cross-Modal RetrievalDimensionality Reduction | CodeCode Available | 0 | 5 |
| DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation | Jul 14, 2025 | DecoderGPU | CodeCode Available | 0 | 5 |