| FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models | May 26, 2025 | Token Reduction | CodeCode Available | 1 |
| CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms | May 22, 2025 | Token Reduction | CodeCode Available | 1 |
| Streamline Without Sacrifice -- Squeeze out Computation Redundancy in LMM | May 21, 2025 | DecoderToken Reduction | CodeCode Available | 1 |
| Window Token Concatenation for Efficient Visual Large Language Models | Apr 5, 2025 | Token Reduction | CodeCode Available | 1 |
| FOLDER: Accelerating Multi-modal Large Language Models with Enhanced Performance | Jan 5, 2025 | Token Reduction | CodeCode Available | 1 |
| Faster Vision Mamba is Rebuilt in Minutes via Merged Token Re-training | Dec 17, 2024 | MambaToken Reduction | CodeCode Available | 1 |
| Enhancing Multimodal Large Language Models Complex Reason via Similarity Computation | Dec 13, 2024 | Token Reduction | CodeCode Available | 1 |
| Token Cropr: Faster ViTs for Quite a Few Tasks | Dec 1, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Inference Optimal VLMs Need Fewer Visual Tokens and More Parameters | Nov 5, 2024 | Token ReductionVisual Reasoning | CodeCode Available | 1 |
| Rethinking Token Reduction for State Space Models | Oct 16, 2024 | MambaState Space Models | CodeCode Available | 1 |