| Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free | May 10, 2025 | AttributeMixture-of-Experts | CodeCode Available | 4 | 5 |
| Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Jun 11, 2024 | 4kLanguage Modeling | CodeCode Available | 4 | 5 |
| BlackMamba: Mixture of Experts for State-Space Models | Feb 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis | May 23, 2024 | Image GenerationMamba | CodeCode Available | 3 | 5 |
| Mambular: A Sequential Model for Tabular Deep Learning | Aug 12, 2024 | Deep LearningMamba | CodeCode Available | 3 | 5 |
| Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks | Feb 6, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 3 | 5 |
| Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis | Jun 5, 2024 | MambaMedical Image Analysis | CodeCode Available | 3 | 5 |
| CryptoMamba: Leveraging State Space Models for Accurate Bitcoin Price Prediction | Jan 2, 2025 | MambaState Space Models | CodeCode Available | 3 | 5 |
| LocalMamba: Visual State Space Model with Windowed Selective Scan | Mar 14, 2024 | MambaState Space Models | CodeCode Available | 3 | 5 |
| Graph-Mamba: Towards Long-Range Graph Sequence Modeling with Selective State Spaces | Feb 1, 2024 | Computational EfficiencyGPU | CodeCode Available | 3 | 5 |