| Cross-Modal Retrieval Meets Inference:Improving Zero-Shot Classification with Cross-Modal Retrieval | Aug 29, 2023 | Cross-Modal Retrievalimage-classification | —Unverified | 0 |
| Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment | Aug 24, 2023 | Self-Learningzero-shot-classification | CodeCode Available | 1 |
| Adversarial Illusions in Multi-Modal Embeddings | Aug 22, 2023 | Image GenerationText Generation | CodeCode Available | 1 |
| Image-free Classifier Injection for Zero-Shot Classification | Aug 21, 2023 | ClassificationDecoder | CodeCode Available | 1 |
| DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability | Aug 18, 2023 | Image Generationzero-shot-classification | —Unverified | 0 |
| Robustifying Point Cloud Networks by Refocusing | Aug 10, 2023 | 3D ClassificationAdversarial Defense | CodeCode Available | 0 |
| ReCLIP: Refine Contrastive Language Image Pre-Training with Source Free Domain Adaptation | Aug 4, 2023 | Domain Adaptationimage-classification | CodeCode Available | 1 |
| PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts | Aug 2, 2023 | Classificationimage-classification | CodeCode Available | 1 |
| Developing and Evaluating Tiny to Medium-Sized Turkish BERT Models | Jul 26, 2023 | ClassificationComputational Efficiency | —Unverified | 0 |
| PRIOR: Prototype Representation Joint Learning from Medical Images and Reports | Jul 24, 2023 | Contrastive LearningImage to text | CodeCode Available | 1 |
| MineralImage5k: A benchmark for zero-shot raw mineral visual recognition and description | Jul 20, 2023 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 1 |
| Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning in CLIP | Jul 18, 2023 | AttributeImage-text Retrieval | —Unverified | 0 |
| RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing | Jun 20, 2023 | Cross-Modal RetrievalImage Retrieval | CodeCode Available | 2 |
| RemoteCLIP: A Vision Language Foundation Model for Remote Sensing | Jun 19, 2023 | ClassificationCross-Modal Retrieval | CodeCode Available | 2 |
| Recognizing Unseen Objects via Multimodal Intensive Knowledge Graph Propagation | Jun 14, 2023 | AttributeKnowledge Graphs | —Unverified | 0 |
| Improving Zero-Shot Detection of Low Prevalence Chest Pathologies using Domain Pre-trained Language Models | Jun 13, 2023 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 0 |
| Analysis of the Fed's communication by using textual entailment model of Zero-Shot classification | Jun 7, 2023 | Natural Language InferenceSentiment Analysis | —Unverified | 0 |
| UCAS-IIE-NLP at SemEval-2023 Task 12: Enhancing Generalization of Multilingual BERT for Low-resource Sentiment Analysis | Jun 1, 2023 | Contrastive LearningRepresentation Learning | CodeCode Available | 1 |
| Multi-level Cross-modal Feature Alignment via Contrastive Learning towards Zero-shot Classification of Remote Sensing Image Scenes | May 31, 2023 | ClassificationContrastive Learning | CodeCode Available | 0 |
| Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation Learning | May 31, 2023 | Decision MakingGeneral Knowledge | CodeCode Available | 2 |
| Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models | May 29, 2023 | Image CaptioningImage Classification | CodeCode Available | 1 |
| Improved Probabilistic Image-Text Representations | May 29, 2023 | Data AugmentationImage-text matching | CodeCode Available | 1 |
| Adapting Language-Audio Models as Few-Shot Audio Learners | May 28, 2023 | Audio ClassificationClassification | —Unverified | 0 |
| DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification | May 25, 2023 | 3D ClassificationClassification | —Unverified | 0 |
| OverPrompt: Enhancing ChatGPT through Efficient In-Context Learning | May 24, 2023 | Data AugmentationFact Checking | CodeCode Available | 0 |