| Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning | Mar 3, 2022 | Contrastive LearningFairness | CodeCode Available | 1 |
| Zero-Shot and Few-Shot Classification of Biomedical Articles in Context of the COVID-19 Pandemic | Jan 9, 2022 | ArticlesMulti-Task Learning | —Unverified | 0 |
| A Fistful of Words: Learning Transferable Visual Models from Bag-of-Words Supervision | Dec 27, 2021 | ClassificationImage Captioning | —Unverified | 0 |
| Learning Aligned Cross-Modal Representation for Generalized Zero-Shot Classification | Dec 24, 2021 | Classificationzero-shot-classification | —Unverified | 0 |
| Decoupling Zero-Shot Semantic Segmentation | Dec 15, 2021 | Open Vocabulary Semantic SegmentationSegmentation | CodeCode Available | 1 |
| CLIP-Lite: Information Efficient Visual Representation Learning with Language Supervision | Dec 14, 2021 | Contrastive LearningRepresentation Learning | CodeCode Available | 1 |
| 3D Compositional Zero-shot Learning with DeCompositional Consensus | Nov 29, 2021 | BenchmarkingCompositional Zero-Shot Learning | —Unverified | 0 |
| Make an Omelette with Breaking Eggs: Zero-Shot Learning for Novel Attribute Synthesis | Nov 28, 2021 | AttributeClassification | —Unverified | 0 |
| Florence: A New Foundation Model for Computer Vision | Nov 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Wav2CLIP: Learning Robust Audio Representations From CLIP | Oct 21, 2021 | Cross-Modal RetrievalImage Generation | CodeCode Available | 1 |