SOTAVerified

CPU

Papers

Showing 110 of 2231 papers

TitleStatusHype
Magika: AI-Powered Content-Type DetectionCode11
WebLLM: A High-Performance In-Browser LLM Inference EngineCode11
PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data ConstructionCode9
PowerInfer-2: Fast Large Language Model Inference on a SmartphoneCode9
Data-Juicer 2.0: Cloud-Scale Adaptive Data Processing for and with Foundation ModelsCode9
Full Scaling Automation for Sustainable Development of Green Data CentersCode7
Bridging Evolutionary Multiobjective Optimization and GPU Acceleration via TensorizationCode7
Mooncake: A KVCache-centric Disaggregated Architecture for LLM ServingCode7
Chinese-Vicuna: A Chinese Instruction-following Llama-based ModelCode7
Elixir: Train a Large Language Model on a Small GPU ClusterCode7
Show:102550
← PrevPage 1 of 224Next →

No leaderboard results yet.