| Improving Transformer Optimization Through Better Initialization | Jan 1, 2020 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| Crafting Large Language Models for Enhanced Interpretability | Jul 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Improving Visual Commonsense in Language Models via Multiple Image Generation | Jun 19, 2024 | Common Sense ReasoningImage Generation | CodeCode Available | 1 | 5 |
| Finetuning Pretrained Transformers into Variational Autoencoders | Aug 5, 2021 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| Attention-based Contextual Language Model Adaptation for Speech Recognition | Jun 2, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining | Oct 10, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Aligning Knowledge Concepts to Whole Slide Images for Precise Histopathology Image Analysis | Nov 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| FineZip : Pushing the Limits of Large Language Models for Practical Lossless Text Compression | Sep 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPT: Efficient Deep Neural Network Training via Cyclic Precision | Jan 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation | Sep 13, 2021 | DecoderDenoising | CodeCode Available | 1 | 5 |