| HMT: Hierarchical Memory Transformer for Long Context Language Processing | May 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Knowledge Circuits in Pretrained Transformers | May 28, 2024 | In-Context Learningknowledge editing | CodeCode Available | 2 | 5 |
| Huatuo-26M, a Large-scale Chinese Medical QA Dataset | May 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| A Touch, Vision, and Language Dataset for Multimodal Alignment | Feb 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Hungry Hungry Hippos: Towards Language Modeling with State Space Models | Dec 28, 2022 | 8kCoreference Resolution | CodeCode Available | 2 | 5 |
| KV Shifting Attention Enhances Language Modeling | Nov 29, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 | 5 |
| Improving Factuality and Reasoning in Language Models through Multiagent Debate | May 23, 2023 | Few-Shot LearningLanguage Modeling | CodeCode Available | 2 | 5 |
| Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions | Sep 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 | 5 |
| LLaSM: Large Language and Speech Model | Aug 30, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 | 5 |
| SOLO: A Single Transformer for Scalable Vision-Language Modeling | Jul 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |