| DyMU: Dynamic Merging and Virtual Unmerging for Efficient VLMs | Apr 23, 2025 | Token ReductionVideo Understanding | —Unverified | 0 | 0 |
| The Overthinker's DIET: Cutting Token Calories with DIfficulty-AwarE Training | May 25, 2025 | Reinforcement Learning (RL)Token Reduction | —Unverified | 0 | 0 |
| DRP: Distilled Reasoning Pruning with Skill-aware Step Decomposition for Efficient Large Reasoning Models | May 20, 2025 | GSM8KMathematical Reasoning | —Unverified | 0 | 0 |
| Token Dynamics: Towards Efficient and Dynamic Video Token Representation for Video Large Language Models | Mar 21, 2025 | Computational EfficiencyToken Reduction | —Unverified | 0 | 0 |
| TPC-ViT: Token Propagation Controller for Efficient Vision Transformer | Jan 3, 2024 | Token Reduction | —Unverified | 0 | 0 |
| Deploying Foundation Model Powered Agent Services: A Survey | Dec 18, 2024 | modelModel Compression | —Unverified | 0 | 0 |
| Token Transforming: A Unified and Training-Free Token Compression Framework for Vision Transformer Acceleration | Jun 6, 2025 | Depth Estimationobject-detection | —Unverified | 0 | 0 |
| TORE: Token Reduction for Efficient Human Mesh Recovery with Transformer | Nov 19, 2022 | 3D geometryHuman Mesh Recovery | —Unverified | 0 | 0 |
| Towards Storage-Efficient Visual Document Retrieval: An Empirical Study on Reducing Patch-Level Embeddings | Jun 5, 2025 | RetrievalToken Reduction | —Unverified | 0 | 0 |
| Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems | Oct 3, 2024 | Language ModellingLarge Language Model | —Unverified | 0 | 0 |