SOTAVerified

Scheduling

Project or Job Scheduling

Papers

Showing 201250 of 3104 papers

TitleStatusHype
Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory ConstraintsCode4
Modeling and solving an integrated periodic vehicle routing and capacitated facility location problem in the context of solid waste collectionCode0
SPOT: Spatio-Temporal Pattern Mining and Optimization for Load Consolidation in Freight Transportation Networks0
AirVista-II: An Agentic System for Embodied UAVs Toward Dynamic Scene Semantic Understanding0
An Enhanced Iterative Deepening Search Algorithm for the Unrestricted Container Rehandling Problem0
InterQ: A DQN Framework for Optimal Intermittent ControlCode0
SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting0
Apt-Serve: Adaptive Request Scheduling on Hybrid Cache for Scalable LLM Inference ServingCode1
Bottleneck Identification in Resource-Constrained Project Scheduling via Constraint Relaxation0
Genetic Programming with Reinforcement Learning Trained Transformer for Real-World Dynamic Scheduling Problems0
Throughput-Optimal Scheduling Algorithms for LLM Inference and AI Agents0
Learning Joint Source-Channel Encoding in IRS-assisted Multi-User Semantic Communications0
Probability Estimation and Scheduling Optimization for Battery Swap Stations via LRU-Enhanced Genetic Algorithm and Dual-Factor Decision SystemCode0
xApp Conflict Mitigation with Scheduler0
NAPER: Fault Protection for Real-Time Resource-Constrained Deep Neural Networks0
Accelerating LLM Inference Throughput via Asynchronous KV Cache Prefetching0
SkillFlow: Efficient Skill and Code Transfer Through Communication in Adapting AI Agents0
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE InferenceCode2
Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language Models0
DyTTP: Trajectory Prediction with Normalization-Free Transformers0
A Constraint Programming Model For Serial Batch Scheduling With Minimum Batch Size0
TRATSS: Transformer-Based Task Scheduling System for Autonomous Vehicles0
L3GS: Layered 3D Gaussian Splats for Efficient 3D Scene DeliveryCode0
Age-of-information minimization under energy harvesting and non-stationary environment0
Opportunistic Beamforming and Dynamic Scheduling for Multi-User MIMO-ISAC Systems0
Improving Mixed-Criticality Scheduling with Reinforcement Learning0
Accurate GPU Memory Prediction for Deep Learning Jobs through Dynamic Analysis0
Performance-Aware Control of Modular Batteries For Fast Frequency Response0
FlowKV: A Disaggregated Inference Framework with Low-Latency KV Cache Transfer and Load-Aware Scheduling0
Secrecy Performance of a Keyhole-based Multi-user System with Multiple Eavesdroppers0
Integrating Human Knowledge Through Action Masking in Reinforcement Learning for Operations Research0
Enhanced Diffusion Sampling via Extrapolation with Multiple ODE SolutionsCode0
Personality-Driven Decision-Making in LLM-Based Autonomous Agents0
Accelerating drug discovery with Artificial: a whole-lab orchestration and scheduling system for self-driving labs0
Optimizing Age of Information in Networks with Large and Small Updates0
AutoEval: Autonomous Evaluation of Generalist Robot Manipulation Policies in the Real WorldCode2
TransMamba: Flexibly Switching between Transformer and Mamba0
Dynamic Operating System Scheduling Using Double DQN: A Reinforcement Learning Approach to Task Optimization0
Machine Learning-assisted High-speed Combinatorial Optimization with Ising Machines for Dynamically Changing Problems0
A Hybrid Reinforcement Learning Framework for Hard Latency Constrained Resource Scheduling0
Quantum Generative Models for Image Generation: Insights from MNIST and MedMNIST0
PartialLoading: User Scheduling and Bandwidth Allocation for Parameter-sharing Edge Inference0
Niyama : Breaking the Silos of LLM Inference Serving0
Dual-Splitting Conformal Prediction for Multi-Step Time Series Forecasting0
How do language models learn facts? Dynamics, curricula and hallucinations0
Optimizing Multi-DNN Inference on Mobile Devices through Heterogeneous Processor Co-Execution0
Exploration of Multi-Element Collaborative Research and Application for Modern Power System Based on Generative Large Models0
β-GNN: A Robust Ensemble Approach Against Graph Structure PerturbationCode0
Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention DisaggregationCode1
Capacity-Constrained Online Learning with Delays: Scheduling Frameworks and Regret Trade-offs0
Show:102550
← PrevPage 5 of 63Next →

No leaderboard results yet.