| Lite-Mind: Towards Efficient and Robust Brain Representation Network | Dec 6, 2023 | Brain DecodingImage Retrieval | CodeCode Available | 1 |
| SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference | Dec 4, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Pipeline Enabling Zero-shot Classification for Bangla Handwritten Grapheme | Dec 1, 2023 | Bangla Text DetectionClassification | —Unverified | 0 |
| Explaining CLIP's performance disparities on data from blind/low vision users | Nov 29, 2023 | Few-Shot Learningzero-shot-classification | —Unverified | 0 |
| MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training | Nov 28, 2023 | Image CaptioningTransfer Learning | —Unverified | 0 |
| IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers | Nov 27, 2023 | Caption GenerationImage-text Retrieval | —Unverified | 0 |
| ViT-Lens: Towards Omni-modal Representations | Nov 27, 2023 | EEGImage Generation | CodeCode Available | 1 |
| Effective Backdoor Mitigation in Vision-Language Models Depends on the Pre-training Objective | Nov 25, 2023 | zero-shot-classificationZero-Shot Learning | —Unverified | 0 |
| tinyCLAP: Distilling Constrastive Language-Audio Pretrained Models | Nov 24, 2023 | Audio GenerationEvent Detection | —Unverified | 0 |
| Deep Learning and NLP in Cryptocurrency Forecasting: Integrating Financial, Blockchain, and Social Media Data | Nov 23, 2023 | Data IntegrationSentiment Analysis | —Unverified | 0 |