SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
Scheduling
Scheduling
Project or Job Scheduling
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 1–10 of 3104 papers
Title
Date
Tasks
Status
Hype
FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving
Jan 2, 2025
GPU
Scheduling
Code
Code Available
9
PowerInfer-2: Fast Large Language Model Inference on a Smartphone
Jun 10, 2024
CPU
Language Modeling
Code
Code Available
9
Steering Language Models with Game-Theoretic Solvers
Jan 24, 2024
Imitation Learning
Scheduling
Code
Code Available
9
FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model Serving
Nov 27, 2024
Fairness
GPU
Code
Code Available
7
The Road Less Scheduled
May 24, 2024
Scheduling
Code
Code Available
7
Colossal-Auto: Unified Automation of Parallelization and Activation Checkpoint for Large-scale Models
Feb 6, 2023
Scheduling
Code
Code Available
7
AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance
Jun 4, 2025
Benchmarking
Scheduling
Code
Code Available
5
FlowTok: Flowing Seamlessly Across Text and Image Tokens
Mar 13, 2025
Denoising
Image to text
Code
Code Available
5
MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models
Aug 21, 2024
GPU
Quantization
Code
Code Available
5
Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints
Apr 15, 2025
GPU
Inference Optimization
Code
Code Available
4
Show:
10
25
50
← Prev
Page 1 of 311
Next →
No leaderboard results yet.