SOTAVerified

GPU

Papers

Showing 211220 of 5629 papers

TitleStatusHype
LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache ManagementCode3
Dataset Distillation with Neural Characteristic Function: A Minmax PerspectiveCode3
BiLLM: Pushing the Limit of Post-Training Quantization for LLMsCode3
Data Generation for Hardware-Friendly Post-Training QuantizationCode3
Biomedical and Clinical English Model Packages in the Stanza Python NLP LibraryCode3
FourCastNet 3: A geometric approach to probabilistic machine-learning weather forecasting at scaleCode3
Cramming: Training a Language Model on a Single GPU in One DayCode3
Allo: A Programming Model for Composable Accelerator DesignCode3
Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language ModelsCode3
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement LearningCode3
Show:102550
← PrevPage 22 of 563Next →

No leaderboard results yet.