| Cached Adaptive Token Merging: Dynamic Token Reduction and Redundant Computation Elimination in Diffusion Model | Jan 1, 2025 | DenoisingToken Reduction | CodeCode Available | 0 |
| Cross-Layer Cache Aggregation for Token Reduction in Ultra-Fine-Grained Image Recognition | Dec 31, 2024 | Fine-Grained Image RecognitionToken Reduction | CodeCode Available | 0 |
| FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models | Dec 30, 2024 | Question AnsweringToken Reduction | CodeCode Available | 2 |
| ImagePiece: Content-aware Re-tokenization for Efficient Image Recognition | Dec 21, 2024 | Efficient ViTsToken Reduction | —Unverified | 0 |
| Deploying Foundation Model Powered Agent Services: A Survey | Dec 18, 2024 | modelModel Compression | —Unverified | 0 |
| Faster Vision Mamba is Rebuilt in Minutes via Merged Token Re-training | Dec 17, 2024 | MambaToken Reduction | CodeCode Available | 1 |
| AsymRnR: Video Diffusion Transformers Acceleration with Asymmetric Reduction and Restoration | Dec 16, 2024 | DenoisingToken Reduction | —Unverified | 0 |
| Learning to Merge Tokens via Decoupled Embedding for Efficient Vision Transformers | Dec 13, 2024 | Token Reduction | CodeCode Available | 0 |
| Enhancing Multimodal Large Language Models Complex Reason via Similarity Computation | Dec 13, 2024 | Token Reduction | CodeCode Available | 1 |
| TRIM: Token Reduction and Inference Modeling for Cost-Effective Language Generation | Dec 10, 2024 | General KnowledgeText Generation | —Unverified | 0 |