SOTAVerified

GPU

Papers

Showing 241250 of 5629 papers

TitleStatusHype
The Mamba in the Llama: Distilling and Accelerating Hybrid ModelsCode3
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache QuantizationCode3
Cramming: Training a Language Model on a Single GPU in One DayCode3
Dataset Distillation with Neural Characteristic Function: A Minmax PerspectiveCode3
TorchCP: A Python Library for Conformal PredictionCode3
CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers UpCode3
AdaRevD: Adaptive Patch Exiting Reversible Decoder Pushes the Limit of Image DeblurringCode3
LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache ManagementCode3
How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks?Code3
LinFusion: 1 GPU, 1 Minute, 16K ImageCode3
Show:102550
← PrevPage 25 of 563Next →

No leaderboard results yet.