SOTAVerified

GPU

Papers

Showing 231240 of 5629 papers

TitleStatusHype
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache QuantizationCode3
Cramming: Training a Language Model on a Single GPU in One DayCode3
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement LearningCode3
InstanSeg: an embedding-based instance segmentation algorithm optimized for accurate, efficient and portable cell segmentationCode3
Allo: A Programming Model for Composable Accelerator DesignCode3
GPU-accelerated Evolutionary Multiobjective Optimization Using Tensorized RVEACode3
Data Generation for Hardware-Friendly Post-Training QuantizationCode3
LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache ManagementCode3
IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & LocalizationCode3
Consistency Models Made EasyCode3
Show:102550
← PrevPage 24 of 563Next →

No leaderboard results yet.