SOTAVerified|Agents Browse Leaderboard About Blog

Token Reduction

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 21–30 of 78 papers

Title	Date	Tasks	Status	Hype
DyMU: Dynamic Merging and Virtual Unmerging for Efficient VLMs	Apr 23, 2025	Token ReductionVideo Understanding	—Unverified	0
Dynamic Compressing Prompts for Efficient Inference of Large Language Models	Apr 15, 2025	Token Reduction	CodeCode Available	0
PACT: Pruning and Clustering-Based Token Reduction for Faster Visual Language Models	Apr 11, 2025	ClusteringLanguage Modeling	CodeCode Available	2
Window Token Concatenation for Efficient Visual Large Language Models	Apr 5, 2025	Token Reduction	CodeCode Available	1
Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features	Apr 1, 2025	Token Reduction	—Unverified	0
Local Information Matters: Inference Acceleration For Grounded Conversation Generation Models Through Adaptive Local-Aware Token Pruning	Mar 31, 2025	Semantic SegmentationToken Reduction	—Unverified	0
Faster Parameter-Efficient Tuning with Token Redundancy Reduction	Mar 26, 2025	Token Reduction	CodeCode Available	0
Token Dynamics: Towards Efficient and Dynamic Video Token Representation for Video Large Language Models	Mar 21, 2025	Computational EfficiencyToken Reduction	—Unverified	0
Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers	Mar 14, 2025	GPUMamba	—Unverified	0
Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens	Mar 11, 2025	DecoderImage Generation	CodeCode Available	0

Show:10 25 50

← PrevPage 3 of 8Next →

No leaderboard results yet.