| Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints | Apr 15, 2025 | GPUInference Optimization | CodeCode Available | 4 |
| Modeling and solving an integrated periodic vehicle routing and capacitated facility location problem in the context of solid waste collection | Apr 14, 2025 | ManagementScheduling | CodeCode Available | 0 |
| SPOT: Spatio-Temporal Pattern Mining and Optimization for Load Consolidation in Freight Transportation Networks | Apr 13, 2025 | Scheduling | —Unverified | 0 |
| AirVista-II: An Agentic System for Embodied UAVs Toward Dynamic Scene Semantic Understanding | Apr 13, 2025 | Disaster ResponseScheduling | —Unverified | 0 |
| An Enhanced Iterative Deepening Search Algorithm for the Unrestricted Container Rehandling Problem | Apr 12, 2025 | Scheduling | —Unverified | 0 |
| InterQ: A DQN Framework for Optimal Intermittent Control | Apr 12, 2025 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| SpecEE: Accelerating Large Language Model Inference with Speculative Early Exiting | Apr 11, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Apt-Serve: Adaptive Request Scheduling on Hybrid Cache for Scalable LLM Inference Serving | Apr 10, 2025 | GPULarge Language Model | CodeCode Available | 1 |
| Bottleneck Identification in Resource-Constrained Project Scheduling via Constraint Relaxation | Apr 10, 2025 | Decision MakingScheduling | —Unverified | 0 |
| Genetic Programming with Reinforcement Learning Trained Transformer for Real-World Dynamic Scheduling Problems | Apr 10, 2025 | Reinforcement Learning (RL)Scheduling | —Unverified | 0 |
| Throughput-Optimal Scheduling Algorithms for LLM Inference and AI Agents | Apr 10, 2025 | AI AgentLarge Language Model | —Unverified | 0 |
| Learning Joint Source-Channel Encoding in IRS-assisted Multi-User Semantic Communications | Apr 10, 2025 | Deep Reinforcement LearningScheduling | —Unverified | 0 |
| Probability Estimation and Scheduling Optimization for Battery Swap Stations via LRU-Enhanced Genetic Algorithm and Dual-Factor Decision System | Apr 10, 2025 | Computational EfficiencyDecision Making | CodeCode Available | 0 |
| xApp Conflict Mitigation with Scheduler | Apr 9, 2025 | Scheduling | —Unverified | 0 |
| NAPER: Fault Protection for Real-Time Resource-Constrained Deep Neural Networks | Apr 9, 2025 | Ensemble LearningFault Detection | —Unverified | 0 |
| Accelerating LLM Inference Throughput via Asynchronous KV Cache Prefetching | Apr 8, 2025 | GPUScheduling | —Unverified | 0 |
| SkillFlow: Efficient Skill and Code Transfer Through Communication in Adapting AI Agents | Apr 8, 2025 | Scheduling | —Unverified | 0 |
| HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference | Apr 8, 2025 | CPUGPU | CodeCode Available | 2 |
| Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language Models | Apr 7, 2025 | Question AnsweringScheduling | —Unverified | 0 |
| DyTTP: Trajectory Prediction with Normalization-Free Transformers | Apr 7, 2025 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| A Constraint Programming Model For Serial Batch Scheduling With Minimum Batch Size | Apr 7, 2025 | Scheduling | —Unverified | 0 |
| TRATSS: Transformer-Based Task Scheduling System for Autonomous Vehicles | Apr 7, 2025 | Autonomous VehiclesScheduling | —Unverified | 0 |
| L3GS: Layered 3D Gaussian Splats for Efficient 3D Scene Delivery | Apr 7, 2025 | 3DGSScheduling | CodeCode Available | 0 |
| Age-of-information minimization under energy harvesting and non-stationary environment | Apr 7, 2025 | Scheduling | —Unverified | 0 |
| Opportunistic Beamforming and Dynamic Scheduling for Multi-User MIMO-ISAC Systems | Apr 6, 2025 | ISACScheduling | —Unverified | 0 |
| Improving Mixed-Criticality Scheduling with Reinforcement Learning | Apr 4, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Accurate GPU Memory Prediction for Deep Learning Jobs through Dynamic Analysis | Apr 4, 2025 | CPUGPU | —Unverified | 0 |
| Performance-Aware Control of Modular Batteries For Fast Frequency Response | Apr 4, 2025 | Scheduling | —Unverified | 0 |
| FlowKV: A Disaggregated Inference Framework with Low-Latency KV Cache Transfer and Load-Aware Scheduling | Apr 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Secrecy Performance of a Keyhole-based Multi-user System with Multiple Eavesdroppers | Apr 3, 2025 | Scheduling | —Unverified | 0 |
| Integrating Human Knowledge Through Action Masking in Reinforcement Learning for Operations Research | Apr 3, 2025 | ManagementReinforcement Learning (RL) | —Unverified | 0 |
| Enhanced Diffusion Sampling via Extrapolation with Multiple ODE Solutions | Apr 2, 2025 | Scheduling | CodeCode Available | 0 |
| Personality-Driven Decision-Making in LLM-Based Autonomous Agents | Apr 1, 2025 | Decision MakingScheduling | —Unverified | 0 |
| Accelerating drug discovery with Artificial: a whole-lab orchestration and scheduling system for self-driving labs | Apr 1, 2025 | Decision MakingDrug Discovery | —Unverified | 0 |
| Optimizing Age of Information in Networks with Large and Small Updates | Mar 31, 2025 | Scheduling | —Unverified | 0 |
| AutoEval: Autonomous Evaluation of Generalist Robot Manipulation Policies in the Real World | Mar 31, 2025 | Robot ManipulationScheduling | CodeCode Available | 2 |
| TransMamba: Flexibly Switching between Transformer and Mamba | Mar 31, 2025 | MambaScheduling | —Unverified | 0 |
| Dynamic Operating System Scheduling Using Double DQN: A Reinforcement Learning Approach to Task Optimization | Mar 31, 2025 | Cloud ComputingScheduling | —Unverified | 0 |
| Machine Learning-assisted High-speed Combinatorial Optimization with Ising Machines for Dynamically Changing Problems | Mar 31, 2025 | Combinatorial OptimizationScheduling | —Unverified | 0 |
| A Hybrid Reinforcement Learning Framework for Hard Latency Constrained Resource Scheduling | Mar 30, 2025 | Scheduling | —Unverified | 0 |
| Quantum Generative Models for Image Generation: Insights from MNIST and MedMNIST | Mar 30, 2025 | Image GenerationScheduling | —Unverified | 0 |
| PartialLoading: User Scheduling and Bandwidth Allocation for Parameter-sharing Edge Inference | Mar 29, 2025 | GPUScheduling | —Unverified | 0 |
| Niyama : Breaking the Silos of LLM Inference Serving | Mar 28, 2025 | ChunkingFairness | —Unverified | 0 |
| Dual-Splitting Conformal Prediction for Multi-Step Time Series Forecasting | Mar 27, 2025 | Conformal PredictionLoad Forecasting | —Unverified | 0 |
| How do language models learn facts? Dynamics, curricula and hallucinations | Mar 27, 2025 | Scheduling | —Unverified | 0 |
| Optimizing Multi-DNN Inference on Mobile Devices through Heterogeneous Processor Co-Execution | Mar 27, 2025 | Scheduling | —Unverified | 0 |
| Exploration of Multi-Element Collaborative Research and Application for Modern Power System Based on Generative Large Models | Mar 26, 2025 | ManagementScheduling | —Unverified | 0 |
| β-GNN: A Robust Ensemble Approach Against Graph Structure Perturbation | Mar 26, 2025 | Anomaly DetectionManagement | CodeCode Available | 0 |
| Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention Disaggregation | Mar 26, 2025 | Large Language ModelScheduling | CodeCode Available | 1 |
| Capacity-Constrained Online Learning with Delays: Scheduling Frameworks and Regret Trade-offs | Mar 25, 2025 | Scheduling | —Unverified | 0 |