| Content-aware Token Sharing for Efficient Semantic Segmentation with Vision Transformers | Jun 3, 2023 | Computational Efficiencyimage-classification | CodeCode Available | 1 | 5 |
| Learning Compact Vision Tokens for Efficient Large Multimodal Models | Jun 8, 2025 | Multimodal ReasoningToken Reduction | CodeCode Available | 1 | 5 |
| Bridging Local Details and Global Context in Text-Attributed Graphs | Jun 18, 2024 | Representation LearningToken Reduction | CodeCode Available | 1 | 5 |
| FastAdaSP: Multitask-Adapted Efficient Inference for Large Speech Language Model | Oct 3, 2024 | Emotion RecognitionLanguage Modeling | CodeCode Available | 1 | 5 |
| ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers | Jun 14, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 1 | 5 |
| Faster Vision Mamba is Rebuilt in Minutes via Merged Token Re-training | Dec 17, 2024 | MambaToken Reduction | CodeCode Available | 1 | 5 |
| Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMs | Sep 17, 2024 | Question AnsweringToken Reduction | CodeCode Available | 1 | 5 |
| PuMer: Pruning and Merging Tokens for Efficient Vision Language Models | May 27, 2023 | Token Reduction | CodeCode Available | 1 | 5 |
| Token Cropr: Faster ViTs for Quite a Few Tasks | Dec 1, 2024 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Window Token Concatenation for Efficient Visual Large Language Models | Apr 5, 2025 | Token Reduction | CodeCode Available | 1 | 5 |