SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
Scheduling
Scheduling
Project or Job Scheduling
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 21–30 of 3104 papers
Title
Date
Tasks
Status
Hype
A Survey on Large Language Model Acceleration based on KV Cache Management
Dec 27, 2024
Language Modeling
Language Modelling
Code
Code Available
3
Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1
Oct 3, 2024
Scheduling
Code
Code Available
3
Taskflow: A Lightweight Parallel and Heterogeneous Task Graph Computing System
Apr 23, 2020
Scheduling
Code
Code Available
3
MNN: A Universal and Efficient Inference Engine
Feb 27, 2020
Deep Learning
Diversity
Code
Code Available
3
LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache Management
Oct 1, 2024
GPU
Language Modeling
Code
Code Available
3
Efficiently Serving LLM Reasoning Programs with Certaindex
Dec 30, 2024
Code Generation
Mathematical Problem-Solving
Code
Code Available
3
Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-Serve
Mar 4, 2024
GPU
Scheduling
Code
Code Available
3
A Survey on Inference Optimization Techniques for Mixture of Experts Models
Dec 18, 2024
Computational Efficiency
Distributed Computing
Code
Code Available
3
mLoRA: Fine-Tuning LoRA Adapters via Highly-Efficient Pipeline Parallelism in Multiple GPUs
Dec 5, 2023
GPU
Large Language Model
Code
Code Available
2
ElegantRL-Podracer: Scalable and Elastic Library for Cloud-Native Deep Reinforcement Learning
Dec 11, 2021
Deep Reinforcement Learning
GPU
Code
Code Available
2
Show:
10
25
50
← Prev
Page 3 of 311
Next →
No leaderboard results yet.