| TRIM: Token Reduction and Inference Modeling for Cost-Effective Language Generation | Dec 10, 2024 | General KnowledgeText Generation | —Unverified | 0 | 0 |
| Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers | Mar 14, 2025 | GPUMamba | —Unverified | 0 | 0 |
| Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer | Aug 30, 2024 | Token Reduction | —Unverified | 0 | 0 |
| VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models | May 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Knowing When to Stop: Dynamic Context Cutoff for Large Language Models | Feb 3, 2025 | Token Reduction | —Unverified | 0 | 0 |
| AsymRnR: Video Diffusion Transformers Acceleration with Asymmetric Reduction and Restoration | Dec 16, 2024 | DenoisingToken Reduction | —Unverified | 0 | 0 |
| ImagePiece: Content-aware Re-tokenization for Efficient Image Recognition | Dec 21, 2024 | Efficient ViTsToken Reduction | —Unverified | 0 | 0 |
| Learning Free Token Reduction for Multi-Modal Large Language Models | Jan 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |