SOTAVerified

GPU

Papers

Showing 221230 of 5629 papers

TitleStatusHype
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement LearningCode3
TorchCP: A Python Library for Conformal PredictionCode3
BitDelta: Your Fine-Tune May Only Be Worth One BitCode3
Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts ModelsCode3
EscherNet: A Generative Model for Scalable View SynthesisCode3
BiLLM: Pushing the Limit of Post-Training Quantization for LLMsCode3
Graph-Mamba: Towards Long-Range Graph Sequence Modeling with Selective State SpacesCode3
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache QuantizationCode3
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-DesignCode3
MoE-Infinity: Efficient MoE Inference on Personal Machines with Sparsity-Aware Expert CacheCode3
Show:102550
← PrevPage 23 of 563Next →

No leaderboard results yet.