SOTAVerified

CPU

Papers

Showing 125 of 2231 papers

TitleStatusHype
WebLLM: A High-Performance In-Browser LLM Inference EngineCode11
Magika: AI-Powered Content-Type DetectionCode11
PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data ConstructionCode9
Data-Juicer 2.0: Cloud-Scale Adaptive Data Processing for and with Foundation ModelsCode9
PowerInfer-2: Fast Large Language Model Inference on a SmartphoneCode9
Chinese-Vicuna: A Chinese Instruction-following Llama-based ModelCode7
Bridging Evolutionary Multiobjective Optimization and GPU Acceleration via TensorizationCode7
Mooncake: A KVCache-centric Disaggregated Architecture for LLM ServingCode7
Full Scaling Automation for Sustainable Development of Green Data CentersCode7
Elixir: Train a Large Language Model on a Small GPU ClusterCode7
Fast On-device LLM Inference with NPUsCode5
XFeat: Accelerated Features for Lightweight Image MatchingCode5
Extreme Compression of Large Language Models via Additive QuantizationCode5
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPUCode5
Faster Segment Anything: Towards Lightweight SAM for Mobile ApplicationsCode5
FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPUCode5
Vectorized and performance-portable QuicksortCode5
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length FloatCode4
SocialED: A Python Library for Social Event DetectionCode4
InternLM2.5-StepProver: Advancing Automated Theorem Proving via Expert Iteration on Large-Scale LEAN ProblemsCode4
Data-Prep-Kit: getting your data ready for LLM application developmentCode4
SigmaRL: A Sample-Efficient and Generalizable Multi-Agent Reinforcement Learning Framework for Motion PlanningCode4
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on EdgeCode4
Look Once to Hear: Target Speech Hearing with Noisy ExamplesCode4
Vidur: A Large-Scale Simulation Framework For LLM InferenceCode4
Show:102550
← PrevPage 1 of 90Next →

No leaderboard results yet.