SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
Scheduling
Scheduling
Project or Job Scheduling
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 21–30 of 3104 papers
Title
Date
Tasks
Status
Hype
A Survey on Inference Optimization Techniques for Mixture of Experts Models
Dec 18, 2024
Computational Efficiency
Distributed Computing
Code
Code Available
3
Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1
Oct 3, 2024
Scheduling
Code
Code Available
3
LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache Management
Oct 1, 2024
GPU
Language Modeling
Code
Code Available
3
FlashGS: Efficient 3D Gaussian Splatting for Large-scale and High-resolution Rendering
Aug 15, 2024
Computational Efficiency
Scheduling
Code
Code Available
3
Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve
Mar 4, 2024
GPU
Scheduling
Code
Code Available
3
Fairness in Serving Large Language Models
Dec 31, 2023
Fairness
Scheduling
Code
Code Available
3
Taskflow: A Lightweight Parallel and Heterogeneous Task Graph Computing System
Apr 23, 2020
Scheduling
Code
Code Available
3
MNN: A Universal and Efficient Inference Engine
Feb 27, 2020
Deep Learning
Diversity
Code
Code Available
3
SystolicAttention: Fusing FlashAttention within a Single Systolic Array
Jul 15, 2025
Scheduling
Code
Code Available
2
Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics Learning
Jun 23, 2025
GPU
Large Language Model
Code
Code Available
2
Show:
10
25
50
← Prev
Page 3 of 311
Next →
No leaderboard results yet.