| XMoE: Sparse Models with Fine-grained and Adaptive Expert Selection | Feb 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Evolving Deep Neural Networks | Mar 1, 2017 | Deep LearningImage Captioning | CodeCode Available | 1 |
| Faster Causal Attention Over Large Sequences Through Sparse Flash Attention | Jun 1, 2023 | 16k8k | CodeCode Available | 1 |
| Emulated Disalignment: Safety Alignment for Large Language Models May Backfire! | Feb 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Working Memory Capacity of ChatGPT: An Empirical Study | Apr 30, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Enabling Language Models to Fill in the Blanks | May 11, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting | Oct 1, 2024 | Continual LearningLanguage Modeling | CodeCode Available | 1 |
| Empowering Large Language Model Agents through Action Learning | Feb 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering | May 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product Operators | Jun 4, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |