| CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation | Sep 13, 2021 | DecoderDenoising | CodeCode Available | 1 | 5 |
| CPT: Efficient Deep Neural Network Training via Cyclic Precision | Jan 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Long-Short Transformer: Efficient Transformers for Language and Vision | Jul 5, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling | Sep 25, 2024 | Data AugmentationDiversity | CodeCode Available | 1 | 5 |
| Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities | May 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Crafting Large Language Models for Enhanced Interpretability | Jul 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DALE: Generative Data Augmentation for Low-Resource Legal NLP | Oct 24, 2023 | Data AugmentationDecoder | CodeCode Available | 1 | 5 |
| Long-context Protein Language Modeling Using Bidirectional Mamba with Shared Projection Layers | Oct 29, 2024 | Drug DesignLanguage Modeling | CodeCode Available | 1 | 5 |
| UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model | Aug 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Long Expressive Memory for Sequence Modeling | Oct 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |