| CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation | Sep 13, 2021 | DecoderDenoising | CodeCode Available | 1 |
| CPT: Efficient Deep Neural Network Training via Cyclic Precision | Jan 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MedualTime: A Dual-Adapter Language Model for Medical Time Series-Text Multimodal Learning | Jun 7, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| Dual Rectified Linear Units (DReLUs): A Replacement for Tanh Activation Functions in Quasi-Recurrent Neural Networks | Jul 25, 2017 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Probabilistic Generative Transformer Language models for Generative Design of Molecules | Sep 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Crafting Large Language Models for Enhanced Interpretability | Jul 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DUnE: Dataset for Unified Editing | Nov 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model | Aug 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DUMA: Reading Comprehension with Transposition Thinking | Jan 26, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DuplexMamba: Enhancing Real-time Speech Conversations with Duplex and Streaming Capabilities | Feb 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |