| Contrastive Audio-Visual Masked Autoencoder | Oct 2, 2022 | Audio ClassificationAudio Tagging | CodeCode Available | 2 | 5 |
| PromptStyler: Prompt-driven Style Generation for Source-free Domain Generalization | Jul 27, 2023 | Domain GeneralizationImage Classification | CodeCode Available | 1 | 5 |
| Multimodal Dynamics: Dynamical Fusion for Trustworthy Multimodal Classification | Jan 1, 2022 | ClassificationInformativeness | CodeCode Available | 1 | 5 |
| FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks | Mar 4, 2023 | Cross-Modal RetrievalImage Captioning | CodeCode Available | 1 | 5 |
| Multimodal Learning with Uncertainty Quantification based on Discounted Belief Fusion | Dec 23, 2024 | Decision MakingMulti-modal Classification | CodeCode Available | 1 | 5 |
| Multi-modal Sarcasm Detection and Humor Classification in Code-mixed Conversations | May 20, 2021 | ClassificationMulti-modal Classification | CodeCode Available | 1 | 5 |
| UAVM: Towards Unifying Audio and Visual Models | Jul 29, 2022 | Audio Classificationaudio-visual learning | CodeCode Available | 1 | 5 |
| On Modality Bias Recognition and Reduction | Feb 25, 2022 | Action RecognitionMulti-modal Classification | CodeCode Available | 0 | 5 |
| Image and Encoded Text Fusion for Multi-Modal Classification | Oct 3, 2018 | ClassificationGeneral Classification | CodeCode Available | 0 | 5 |
| Hateful Meme Detection through Context-Sensitive Prompting and Fine-Grained Labeling | Nov 13, 2024 | Model OptimizationMulti-modal Classification | CodeCode Available | 0 | 5 |