| Revisiting Pre-Trained Models for Chinese Natural Language Processing | Apr 29, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Longformer: The Long-Document Transformer | Apr 10, 2020 | DecoderLanguage Modeling | CodeCode Available | 3 |
| Semi-Supervised Speech Recognition via Local Prior Matching | Feb 24, 2020 | Knowledge DistillationLanguage Modeling | CodeCode Available | 3 |
| Universal Language Model Fine-tuning for Text Classification | Jan 18, 2018 | General ClassificationLanguage Modeling | CodeCode Available | 3 |
| Order Matters: Sequence to sequence for sets | Nov 19, 2015 | Language Modeling | CodeCode Available | 3 |
| Open Source Planning & Control System with Language Agents for Autonomous Scientific Discovery | Jul 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment | Jul 3, 2025 | cross-modal alignmentInstruction Following | CodeCode Available | 2 |
| OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling | Jun 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Language Modeling by Language Models | Jun 25, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation Booster | Jun 22, 2025 | DecoderImage Segmentation | CodeCode Available | 2 |