SOTAVerified|Agents Browse Leaderboard About Blog

Scheduling

Project or Job Scheduling

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 21–30 of 3104 papers

Title	Date	Tasks	Status	Hype
A Survey on Large Language Model Acceleration based on KV Cache Management	Dec 27, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1	Oct 3, 2024	Scheduling	CodeCode Available	3
Taskflow: A Lightweight Parallel and Heterogeneous Task Graph Computing System	Apr 23, 2020	Scheduling	CodeCode Available	3
MNN: A Universal and Efficient Inference Engine	Feb 27, 2020	Deep LearningDiversity	CodeCode Available	3
LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache Management	Oct 1, 2024	GPULanguage Modeling	CodeCode Available	3
Efficiently Serving LLM Reasoning Programs with Certaindex	Dec 30, 2024	Code GenerationMathematical Problem-Solving	CodeCode Available	3
Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve	Mar 4, 2024	GPUScheduling	CodeCode Available	3
A Survey on Inference Optimization Techniques for Mixture of Experts Models	Dec 18, 2024	Computational EfficiencyDistributed Computing	CodeCode Available	3
mLoRA: Fine-Tuning LoRA Adapters via Highly-Efficient Pipeline Parallelism in Multiple GPUs	Dec 5, 2023	GPULarge Language Model	CodeCode Available	2
ElegantRL-Podracer: Scalable and Elastic Library for Cloud-Native Deep Reinforcement Learning	Dec 11, 2021	Deep Reinforcement LearningGPU	CodeCode Available	2

Show:10 25 50

← PrevPage 3 of 311Next →

No leaderboard results yet.