| StreamRL: Scalable, Heterogeneous, and Elastic RL for LLMs with Disaggregated Stream Generation | Apr 22, 2025 | Reinforcement Learning (RL)Scheduling | —Unverified | 0 |
| Is Intelligence the Right Direction in New OS Scheduling for Multiple Resources in Cloud Environments? | Apr 21, 2025 | SchedulingTransfer Learning | —Unverified | 0 |
| PLANET: A Collection of Benchmarks for Evaluating LLMs' Planning Capabilities | Apr 21, 2025 | Scheduling | —Unverified | 0 |
| Splitwiser: Efficient LM inference with constrained resources | Apr 21, 2025 | GPUScheduling | CodeCode Available | 0 |
| LithOS: An Operating System for Efficient Machine Learning on GPUs | Apr 21, 2025 | BlockingGPU | —Unverified | 0 |
| Fuzzy Logic -- Based Scheduling System for Part-Time Workforce | Apr 21, 2025 | Scheduling | —Unverified | 0 |
| Symmetry-Preserving Architecture for Multi-NUMA Environments (SPANE): A Deep Reinforcement Learning Approach for Dynamic VM Scheduling | Apr 21, 2025 | Cloud ComputingDeep Reinforcement Learning | —Unverified | 0 |
| Sensor Scheduling in Intrusion Detection Games with Uncertain Payoffs | Apr 20, 2025 | Intrusion DetectionScheduling | —Unverified | 0 |
| LLM-Enabled In-Context Learning for Data Collection Scheduling in UAV-assisted Sensor Networks | Apr 20, 2025 | Deep Reinforcement LearningIn-Context Learning | —Unverified | 0 |
| PipeWeaver: Addressing Data Dynamicity in Large Multimodal Model Training with Dynamic Interleaved Pipeline | Apr 19, 2025 | DiversityScheduling | —Unverified | 0 |
| Optimal Scheduling of Dynamic Transport | Apr 19, 2025 | Scheduling | —Unverified | 0 |
| Entropic Time Schedulers for Generative Diffusion Models | Apr 18, 2025 | Scheduling | —Unverified | 0 |
| High-Throughput LLM inference on Heterogeneous Clusters | Apr 18, 2025 | Large Language ModelScheduling | —Unverified | 0 |
| PV-VLM: A Multimodal Vision-Language Approach Incorporating Sky Images for Intra-Hour Photovoltaic Power Forecasting | Apr 18, 2025 | energy managementLanguage Modeling | —Unverified | 0 |
| D^2MoE: Dual Routing and Dynamic Scheduling for Efficient On-Device MoE-based LLM Serving | Apr 17, 2025 | Mixture-of-ExpertsModel Compression | —Unverified | 0 |
| NNTile: a machine learning framework capable of training extremely large GPT language models on a single node | Apr 17, 2025 | CPUGPU | —Unverified | 0 |
| Battery-aware Cyclic Scheduling in Energy-harvesting Federated Learning | Apr 16, 2025 | Federated LearningScheduling | —Unverified | 0 |
| Predictive Multiplicity in Survival Models: A Method for Quantifying Model Uncertainty in Predictive Maintenance Applications | Apr 16, 2025 | SchedulingSurvival Analysis | —Unverified | 0 |
| PGU-SGP: A Pheno-Geno Unified Surrogate Genetic Programming For Real-life Container Terminal Truck Scheduling | Apr 15, 2025 | Combinatorial OptimizationScheduling | —Unverified | 0 |
| Efficient Distributed Retrieval-Augmented Generation for Enhancing Language Model Performance | Apr 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Greedy Restart Schedules: A Baseline for Dynamic Algorithm Selection on Numerical Black-box Optimization Problems | Apr 15, 2025 | Scheduling | CodeCode Available | 0 |
| Modeling and solving an integrated periodic vehicle routing and capacitated facility location problem in the context of solid waste collection | Apr 14, 2025 | ManagementScheduling | CodeCode Available | 0 |
| AirVista-II: An Agentic System for Embodied UAVs Toward Dynamic Scene Semantic Understanding | Apr 13, 2025 | Disaster ResponseScheduling | —Unverified | 0 |
| SPOT: Spatio-Temporal Pattern Mining and Optimization for Load Consolidation in Freight Transportation Networks | Apr 13, 2025 | Scheduling | —Unverified | 0 |
| InterQ: A DQN Framework for Optimal Intermittent Control | Apr 12, 2025 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| An Enhanced Iterative Deepening Search Algorithm for the Unrestricted Container Rehandling Problem | Apr 12, 2025 | Scheduling | —Unverified | 0 |
| SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting | Apr 11, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Probability Estimation and Scheduling Optimization for Battery Swap Stations via LRU-Enhanced Genetic Algorithm and Dual-Factor Decision System | Apr 10, 2025 | Computational EfficiencyDecision Making | CodeCode Available | 0 |
| Learning Joint Source-Channel Encoding in IRS-assisted Multi-User Semantic Communications | Apr 10, 2025 | Deep Reinforcement LearningScheduling | —Unverified | 0 |
| Bottleneck Identification in Resource-Constrained Project Scheduling via Constraint Relaxation | Apr 10, 2025 | Decision MakingScheduling | —Unverified | 0 |
| Genetic Programming with Reinforcement Learning Trained Transformer for Real-World Dynamic Scheduling Problems | Apr 10, 2025 | Reinforcement Learning (RL)Scheduling | —Unverified | 0 |
| Throughput-Optimal Scheduling Algorithms for LLM Inference and AI Agents | Apr 10, 2025 | AI AgentLarge Language Model | —Unverified | 0 |
| NAPER: Fault Protection for Real-Time Resource-Constrained Deep Neural Networks | Apr 9, 2025 | Ensemble LearningFault Detection | —Unverified | 0 |
| xApp Conflict Mitigation with Scheduler | Apr 9, 2025 | Scheduling | —Unverified | 0 |
| SkillFlow: Efficient Skill and Code Transfer Through Communication in Adapting AI Agents | Apr 8, 2025 | Scheduling | —Unverified | 0 |
| Accelerating LLM Inference Throughput via Asynchronous KV Cache Prefetching | Apr 8, 2025 | GPUScheduling | —Unverified | 0 |
| A Constraint Programming Model For Serial Batch Scheduling With Minimum Batch Size | Apr 7, 2025 | Scheduling | —Unverified | 0 |
| Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language Models | Apr 7, 2025 | Question AnsweringScheduling | —Unverified | 0 |
| L3GS: Layered 3D Gaussian Splats for Efficient 3D Scene Delivery | Apr 7, 2025 | 3DGSScheduling | CodeCode Available | 0 |
| Age-of-information minimization under energy harvesting and non-stationary environment | Apr 7, 2025 | Scheduling | —Unverified | 0 |
| DyTTP: Trajectory Prediction with Normalization-Free Transformers | Apr 7, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| TRATSS: Transformer-Based Task Scheduling System for Autonomous Vehicles | Apr 7, 2025 | Autonomous VehiclesScheduling | —Unverified | 0 |
| Opportunistic Beamforming and Dynamic Scheduling for Multi-User MIMO-ISAC Systems | Apr 6, 2025 | ISACScheduling | —Unverified | 0 |
| Improving Mixed-Criticality Scheduling with Reinforcement Learning | Apr 4, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Accurate GPU Memory Prediction for Deep Learning Jobs through Dynamic Analysis | Apr 4, 2025 | CPUGPU | —Unverified | 0 |
| Performance-Aware Control of Modular Batteries For Fast Frequency Response | Apr 4, 2025 | Scheduling | —Unverified | 0 |
| FlowKV: A Disaggregated Inference Framework with Low-Latency KV Cache Transfer and Load-Aware Scheduling | Apr 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Integrating Human Knowledge Through Action Masking in Reinforcement Learning for Operations Research | Apr 3, 2025 | ManagementReinforcement Learning (RL) | —Unverified | 0 |
| Secrecy Performance of a Keyhole-based Multi-user System with Multiple Eavesdroppers | Apr 3, 2025 | Scheduling | —Unverified | 0 |
| Enhanced Diffusion Sampling via Extrapolation with Multiple ODE Solutions | Apr 2, 2025 | Scheduling | CodeCode Available | 0 |