SOTAVerified

Scheduling

Project or Job Scheduling

Papers

Showing 2650 of 3104 papers

TitleStatusHype
Taming Throughput-Latency Tradeoff in LLM Inference with Sarathi-ServeCode3
Vine Copulas as Differentiable Computational GraphsCode3
A Survey on Inference Optimization Techniques for Mixture of Experts ModelsCode3
Piloting Structure-Based Drug Design via Modality-Specific Optimal ScheduleCode2
Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics LearningCode2
MineLand: Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical NeedsCode2
NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video EditingCode2
Chat AI: A Seamless Slurm-Native Solution for HPC-Based ServicesCode2
Learning to Solve Job Shop Scheduling under UncertaintyCode2
MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further TuningCode2
Preble: Efficient Distributed Prompt Scheduling for LLM ServingCode2
Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-FlowCode2
AutoEval: Autonomous Evaluation of Generalist Robot Manipulation Policies in the Real WorldCode2
Hidet: Task-Mapping Programming Paradigm for Deep Learning Tensor ProgramsCode2
FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource AllocationCode2
ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm EngineeringCode2
Human-in-the-Loop Large-Scale Predictive Maintenance of WorkstationsCode2
Efficient LLM Scheduling by Learning to RankCode2
ChaCha for Online AutoMLCode2
EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive Layer Tuning and VotingCode2
BMInf: An Efficient Toolkit for Big Model Inference and TuningCode2
ElegantRL-Podracer: Scalable and Elastic Library for Cloud-Native Deep Reinforcement LearningCode2
Demystifying and Enhancing the Efficiency of Large Language Model Based Search AgentsCode2
DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)Code2
AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware PlatformsCode2
Show:102550
← PrevPage 2 of 125Next →

No leaderboard results yet.