| Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality | May 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 11 | 5 |
| xLSTM: Extended Long Short-Term Memory | May 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 7 | 5 |
| ThunderKittens: Simple, Fast, and Adorable AI Kernels | Oct 27, 2024 | GPUState Space Models | CodeCode Available | 7 | 5 |
| Mamba: Linear-Time Sequence Modeling with Selective State Spaces | Dec 1, 2023 | 2D Pose EstimationCommon Sense Reasoning | CodeCode Available | 6 | 5 |
| Awesome Multi-modal Object Tracking | May 23, 2024 | Autonomous DrivingKnowledge Distillation | CodeCode Available | 5 | 5 |
| ChangeMamba: Remote Sensing Change Detection With Spatiotemporal State Space Model | Apr 4, 2024 | 2D Semantic SegmentationAttribute | CodeCode Available | 4 | 5 |
| A Survey on Visual Mamba | Apr 24, 2024 | Image RegistrationImage Restoration | CodeCode Available | 4 | 5 |
| MedMamba: Vision Mamba for Medical Image Classification | Mar 6, 2024 | Classificationimage-classification | CodeCode Available | 4 | 5 |
| Mamba YOLO: A Simple Baseline for Object Detection with State Space Model | Jun 9, 2024 | GPUMamba | CodeCode Available | 4 | 5 |
| Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free | May 10, 2025 | AttributeMixture-of-Experts | CodeCode Available | 4 | 5 |
| Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length | Apr 12, 2024 | State Space Models | CodeCode Available | 4 | 5 |
| Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Jun 11, 2024 | 4kLanguage Modeling | CodeCode Available | 4 | 5 |
| FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba | Apr 15, 2024 | Infrared And Visible Image FusionMamba | CodeCode Available | 3 | 5 |
| Repeat After Me: Transformers are Better than State Space Models at Copying | Feb 1, 2024 | State Space Models | CodeCode Available | 3 | 5 |
| EfficientVMamba: Atrous Selective Scan for Light Weight Visual Mamba | Mar 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| MobileMamba: Lightweight Multi-Receptive Visual Mamba Network | Nov 24, 2024 | GPUMamba | CodeCode Available | 3 | 5 |
| MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts | Jan 8, 2024 | MambaMixture-of-Experts | CodeCode Available | 3 | 5 |
| Mambular: A Sequential Model for Tabular Deep Learning | Aug 12, 2024 | Deep LearningMamba | CodeCode Available | 3 | 5 |
| MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection | Apr 9, 2024 | Anomaly DetectionDecoder | CodeCode Available | 3 | 5 |
| Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis | Jun 5, 2024 | MambaMedical Image Analysis | CodeCode Available | 3 | 5 |
| BlackMamba: Mixture of Experts for State-Space Models | Feb 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 | 5 |
| DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis | May 23, 2024 | Image GenerationMamba | CodeCode Available | 3 | 5 |
| Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks | Feb 6, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 3 | 5 |
| CryptoMamba: Leveraging State Space Models for Accurate Bitcoin Price Prediction | Jan 2, 2025 | MambaState Space Models | CodeCode Available | 3 | 5 |
| Graph-Mamba: Towards Long-Range Graph Sequence Modeling with Selective State Spaces | Feb 1, 2024 | Computational EfficiencyGPU | CodeCode Available | 3 | 5 |