SOTAVerified

GPU

Papers

Showing 171180 of 5629 papers

TitleStatusHype
REDUCIO! Generating 10241024 Video within 16 Seconds using Extremely Compressed Motion LatentsCode3
Data Generation for Hardware-Friendly Post-Training QuantizationCode3
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM InferenceCode3
Modular Duality in Deep LearningCode3
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive LossCode3
MagicPIG: LSH Sampling for Efficient LLM GenerationCode3
CtrLoRA: An Extensible and Efficient Framework for Controllable Image GenerationCode3
High-Speed Stereo Visual SLAM for Low-Powered Computing DevicesCode3
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model TransformationCode3
LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache ManagementCode3
Show:102550
← PrevPage 18 of 563Next →

No leaderboard results yet.