| BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding | Oct 11, 2018 | Citation Intent ClassificationCommon Sense Reasoning | CodeCode Available | 3 |
| Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer | Oct 23, 2019 | Answer GenerationCommon Sense Reasoning | CodeCode Available | 2 |
| MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation | Nov 10, 2022 | Multimodal Intent RecognitionRetrieval | CodeCode Available | 2 |
| ALBERT: A Lite BERT for Self-supervised Learning of Language Representations | Sep 26, 2019 | Common Sense ReasoningGPU | CodeCode Available | 2 |
| MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations | Mar 16, 2024 | Intent RecognitionMultimodal Intent Recognition | CodeCode Available | 2 |
| ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision | Feb 5, 2021 | Cross-Modal RetrievalImage Retrieval | CodeCode Available | 1 |
| MIntRec: A New Dataset for Multimodal Intent Recognition | Sep 9, 2022 | Intent RecognitionMultimodal Intent Recognition | CodeCode Available | 1 |
| Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition | Dec 22, 2023 | Contrastive LearningIntent Recognition | CodeCode Available | 1 |
| A-MESS: Anchor based Multimodal Embedding with Semantic Synchronization for Multimodal Intent Recognition | Mar 25, 2025 | Contrastive LearningIntent Recognition | —Unverified | 0 |
| 0/1 Deep Neural Networks via Block Coordinate Descent | Jun 19, 2022 | 10-shot image generation | —Unverified | 0 |
| WDMIR: Wavelet-Driven Multimodal Intent Recognition | May 27, 2025 | Intent RecognitionMultimodal Intent Recognition | —Unverified | 0 |
| EMOE: Modality-Specific Enhanced Dynamic Emotion Experts | Jan 1, 2025 | Emotion RecognitionIntent Recognition | —Unverified | 0 |
| Contextual Augmented Global Contrast for Multimodal Intent Recognition | Jan 1, 2024 | Contrastive LearningIntent Recognition | —Unverified | 0 |
| TECO: Improving Multimodal Intent Recognition with Text Enhancement through Commonsense Knowledge Extraction | Dec 11, 2024 | Intent RecognitionMultimodal Intent Recognition | —Unverified | 0 |
| PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts | May 24, 2023 | Dialogue State TrackingImage Retrieval | CodeCode Available | 0 |
| Speech-Text Dialog Pre-training for Spoken Dialog Understanding with Explicit Cross-Modal Alignment | May 19, 2023 | cross-modal alignmentEmotion Recognition in Conversation | CodeCode Available | 0 |