| Linear Transformers with Learnable Kernel Functions are Better In-Context Models | Feb 16, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling | Feb 15, 2024 | MambaPrediction | CodeCode Available | 1 |
| Graph Mamba: Towards Learning on Graphs with State Space Models | Feb 13, 2024 | Graph Representation LearningMamba | CodeCode Available | 0 |
| Scalable Diffusion Models with State Space Backbone | Feb 8, 2024 | Conditional Image GenerationImage Generation | CodeCode Available | 2 |
| Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data | Feb 8, 2024 | Action RecognitionMamba | CodeCode Available | 2 |
| On Provable Length and Compositional Generalization | Feb 7, 2024 | DiversityOut-of-Distribution Generalization | CodeCode Available | 0 |
| Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks | Feb 6, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 3 |
| Importance sampling for online variational learning | Feb 5, 2024 | State Space Models | —Unverified | 0 |
| Contingency Detection in Modern Power Systems: A Stochastic Hybrid System Method | Feb 2, 2024 | State Space Models | —Unverified | 0 |
| BlackMamba: Mixture of Experts for State-Space Models | Feb 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |