| Plan and Budget: Effective and Efficient Test-Time Scaling on Large Language Model Reasoning | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| EcoSafeRAG: Efficient Security through Context Analysis in Retrieval-Augmented Generation | May 16, 2025 | DiversityRAG | —Unverified | 0 | 0 |
| Dynamic Token Reduction during Generation for Vision Language Models | Jan 24, 2025 | DecoderToken Reduction | —Unverified | 0 | 0 |
| Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration | Nov 26, 2024 | Token Reduction | —Unverified | 0 | 0 |
| ZipR1: Reinforcing Token Sparsity in MLLMs | Apr 23, 2025 | Token Reduction | —Unverified | 0 | 0 |
| PAR: Prompt-Aware Token Reduction Method for Efficient Large Multimodal Models | Oct 9, 2024 | Question AnsweringRetrieval | —Unverified | 0 | 0 |
| Selective Structured State-Spaces for Long-Form Video Understanding | Mar 25, 2023 | Contrastive LearningForm | —Unverified | 0 | 0 |
| Does Acceleration Cause Hidden Instability in Vision Language Models? Uncovering Instance-Level Divergence Through a Large-Scale Empirical Study | Mar 9, 2025 | QuantizationToken Reduction | —Unverified | 0 | 0 |
| Accelerating Multimodal Large Language Models by Searching Optimal Vision Token Reduction | Nov 30, 2024 | Bayesian OptimizationToken Reduction | —Unverified | 0 | 0 |
| STAR: Stage-Wise Attention-Guided Token Reduction for Efficient Large Vision-Language Models Inference | May 18, 2025 | Token Reduction | —Unverified | 0 | 0 |