SOTAVerified

GPU

Papers

Showing 441450 of 5629 papers

TitleStatusHype
H_2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language ModelsCode2
Hardware-Aware Parallel Prompt Decoding for Memory-Efficient Acceleration of LLM InferenceCode2
Accelerating Sparse Deep Neural NetworksCode2
AutoFocus: Efficient Multi-Scale InferenceCode2
Deep Snake for Real-Time Instance SegmentationCode2
PocketVina Enables Scalable and Highly Accurate Physically Valid Docking through Multi-Pocket ConditioningCode2
Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion TransformersCode2
HeadInfer: Memory-Efficient LLM Inference by Head-wise OffloadingCode2
HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image PriorsCode2
GPU Performance Portability needs AutotuningCode2
Show:102550
← PrevPage 45 of 563Next →

No leaderboard results yet.