SOTAVerified

Scheduling

Project or Job Scheduling

Papers

Showing 51100 of 3104 papers

TitleStatusHype
Preble: Efficient Distributed Prompt Scheduling for LLM ServingCode2
WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace SettingCode2
FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource AllocationCode2
MineLand: Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical NeedsCode2
Characterization of Large Language Model Development in the DatacenterCode2
Learning to Solve Job Shop Scheduling under UncertaintyCode2
DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent)Code2
DPoser: Diffusion Model as Robust 3D Human Pose PriorCode2
mLoRA: Fine-Tuning LoRA Adapters via Highly-Efficient Pipeline Parallelism in Multiple GPUsCode2
Zero Bubble Pipeline ParallelismCode2
SkiROS2: A skill-based Robot Control Platform for ROSCode2
evosax: JAX-based Evolution StrategiesCode2
Hidet: Task-Mapping Programming Paradigm for Deep Learning Tensor ProgramsCode2
Human-in-the-Loop Large-Scale Predictive Maintenance of WorkstationsCode2
BMInf: An Efficient Toolkit for Big Model Inference and TuningCode2
TGL: A General Framework for Temporal GNN Training on Billion-Scale GraphsCode2
ElegantRL-Podracer: Scalable and Elastic Library for Cloud-Native Deep Reinforcement LearningCode2
ChaCha for Online AutoMLCode2
ConsumerBench: Benchmarking Generative AI Applications on End-User DevicesCode1
All is Not Lost: LLM Recovery without CheckpointsCode1
A Production Scheduling Framework for Reinforcement Learning Under Real-World ConstraintsCode1
Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia GamesCode1
Diagonal Batching Unlocks Parallelism in Recurrent Memory Transformers for Long ContextsCode1
Empowering Vector Graphics with Consistently Arbitrary Viewing and View-dependent VisibilityCode1
Decoupling Spatio-Temporal Prediction: When Lightweight Large Models Meet Adaptive HypergraphsCode1
Task Memory Engine: Spatial Memory for Robust Multi-Step LLM AgentsCode1
Structured Reinforcement Learning for Combinatorial Decision-MakingCode1
Internal Chain-of-Thought: Empirical Evidence for Layer-wise Subtask Scheduling in LLMsCode1
GATES: Cost-aware Dynamic Workflow Scheduling via Graph Attention Networks and Evolution StrategyCode1
FastCar: Cache Attentive Replay for Fast Auto-Regressive Video Generation on the EdgeCode1
Taming the Titans: A Survey of Efficient LLM Inference ServingCode1
Apt-Serve: Adaptive Request Scheduling on Hybrid Cache for Scalable LLM Inference ServingCode1
Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention DisaggregationCode1
Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-OptimizationCode1
SkyLadder: Better and Faster Pretraining via Context Window SchedulingCode1
SkipPipe: Partial and Reordered Pipelining Framework for Training LLMs in Heterogeneous NetworksCode1
Starjob: Dataset for LLM-Driven Job Shop SchedulingCode1
Learning-Guided Rolling Horizon Optimization for Long-Horizon Flexible Job-Shop SchedulingCode1
The Surprising Agreement Between Convex Optimization Theory and Learning-Rate Scheduling for Large Model TrainingCode1
An Efficient Diffusion-based Non-Autoregressive Solver for Traveling Salesman ProblemCode1
CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement LearningCode1
Dynamics-incorporated Modeling Framework for Stability Constrained Scheduling Under High-penetration of Renewable EnergyCode1
Accelerating AIGC Services with Latent Action Diffusion Scheduling in Edge NetworksCode1
Brain-to-Text Benchmark '24: Lessons LearnedCode1
Multi Agent Reinforcement Learning for Sequential Satellite Assignment ProblemsCode1
Neural Combinatorial Optimization for Stochastic Flexible Job Shop Scheduling ProblemsCode1
Grid: Omni Visual GenerationCode1
From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial InjectionCode1
Digital Transformation in the Water Distribution System based on the Digital Twins ConceptCode1
Robust Planning with Compound LLM Architectures: An LLM-Modulo ApproachCode1
Show:102550
← PrevPage 2 of 63Next →

No leaderboard results yet.