| Unveiling Redundancy in Diffusion Transformers (DiTs): A Systematic Study | Nov 18, 2024 | Scheduling | CodeCode Available | 1 |
| On the Incorporation of Stability Constraints into Sequential Operational Scheduling | Nov 18, 2024 | Decision MakingScheduling | —Unverified | 0 |
| Topology-aware Preemptive Scheduling for Co-located LLM Workloads | Nov 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Leveraging Bitcoin Mining Machines in Demand-Response Mechanisms to Mitigate Ramping-Induced Transients | Nov 17, 2024 | Scheduling | —Unverified | 0 |
| Adaptive Learning of Design Strategies over Non-Hierarchical Multi-Fidelity Models via Policy Alignment | Nov 16, 2024 | Reinforcement Learning (RL)Scheduling | —Unverified | 0 |
| Adaptive Non-Uniform Timestep Sampling for Diffusion Model Training | Nov 15, 2024 | Image GenerationScheduling | —Unverified | 0 |
| Latency Optimization in LEO Satellite Communications with Hybrid Beam Pattern and Interference Control | Nov 14, 2024 | Graph GenerationScheduling | —Unverified | 0 |
| Robot Tasks with Fuzzy Time Requirements from Natural Language Instructions | Nov 14, 2024 | Scheduling | —Unverified | 0 |
| Time-constrained Federated Learning (FL) in Push-Pull IoT Wireless Access | Nov 13, 2024 | Federated LearningScheduling | —Unverified | 0 |
| Towards Practical Deep Schedulers for Allocating Cellular Radio Resources | Nov 13, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| A Fuzzy Reinforcement LSTM-based Long-term Prediction Model for Fault Conditions in Nuclear Power Plants | Nov 13, 2024 | Anomaly DetectionDecision Making | —Unverified | 0 |
| Exploring Multi-Agent Reinforcement Learning for Unrelated Parallel Machine Scheduling | Nov 12, 2024 | ManagementMulti-agent Reinforcement Learning | —Unverified | 0 |
| Optimizing LLM Inference for Database Systems: Cost-Aware Scheduling for Concurrent Requests | Nov 12, 2024 | Decision MakingGPU | —Unverified | 0 |
| Sensing Capacity for Integrated Sensing and Communication Systems in Low-Altitude Economy | Nov 11, 2024 | Integrated sensing and communicationISAC | —Unverified | 0 |
| Co-Scheduling of Energy and Production in Discrete Manufacturing Considering Decision-Dependent Uncertainties | Nov 11, 2024 | Scheduling | —Unverified | 0 |
| Scalable Distributed Least Squares Algorithm for Linear Algebraic Equations via Scheduling | Nov 11, 2024 | Scheduling | —Unverified | 0 |
| Two-Stage Stochastic Optimization for Low-Carbon Dispatch in a Combined Energy System | Nov 11, 2024 | SchedulingStochastic Optimization | —Unverified | 0 |
| Eavesdropping on Goal-Oriented Communication: Timing Attacks and Countermeasures | Nov 11, 2024 | SchedulingSemantic Communication | —Unverified | 0 |
| PLM-Based Discrete Diffusion Language Models with Entropy-Adaptive Gibbs Sampling | Nov 10, 2024 | DenoisingDiversity | —Unverified | 0 |
| An Energy-Based Self-Adaptive Learning Rate for Stochastic Gradient Descent: Enhancing Unconstrained Optimization with VAV method | Nov 10, 2024 | Scheduling | —Unverified | 0 |
| Reinforcement Learning for Adaptive Resource Scheduling in Complex System Environments | Nov 8, 2024 | Cloud ComputingEdge-computing | —Unverified | 0 |
| Data-Driven Min-Max MPC for LPV Systems with Unknown Scheduling Signal | Nov 8, 2024 | Model Predictive ControlScheduling | —Unverified | 0 |
| SAUCE: Synchronous and Asynchronous User-Customizable Environment for Multi-Agent LLM Interaction | Nov 5, 2024 | Scheduling | CodeCode Available | 0 |
| AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution | Nov 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| P-MOSS: Learned Scheduling For Indexes Over NUMA Servers Using Low-Level Hardware Statistics | Nov 5, 2024 | CPUScheduling | —Unverified | 0 |
| NEO: Saving GPU Memory Crisis with CPU Offloading for Online LLM Inference | Nov 2, 2024 | Code GenerationCPU | CodeCode Available | 0 |
| Enhancing Adaptive Mixed-Criticality Scheduling with Deep Reinforcement Learning | Nov 1, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Multi-Agent Deep Q-Network with Layer-based Communication Channel for Autonomous Internal Logistics Vehicle Scheduling in Smart Manufacturing | Nov 1, 2024 | Scheduling | —Unverified | 0 |
| DynaSplit: A Hardware-Software Co-Design Framework for Energy-Aware Inference on Edge | Oct 31, 2024 | CPUScheduling | —Unverified | 0 |
| ALISE: Accelerating Large Language Model Serving with Speculative Scheduling | Oct 31, 2024 | BlockingLanguage Modeling | —Unverified | 0 |
| EF-LLM: Energy Forecasting LLM with AI-assisted Automation, Enhanced Sparse Prediction, Hallucination Detection | Oct 30, 2024 | Continual LearningHallucination | —Unverified | 0 |
| Automatic programming via large language models with population self-evolution for dynamic job shop scheduling problem | Oct 30, 2024 | Deep Reinforcement LearningEvolutionary Algorithms | —Unverified | 0 |
| Planning and Learning in Risk-Aware Restless Multi-Arm Bandit Problem | Oct 30, 2024 | SchedulingThompson Sampling | —Unverified | 0 |
| Bayesian Counterfactual Prediction Models for HIV Care Retention with Incomplete Outcome and Covariate Information | Oct 29, 2024 | Causal Inferencecounterfactual | —Unverified | 0 |
| How Does Critical Batch Size Scale in Pre-training? | Oct 29, 2024 | Scheduling | CodeCode Available | 1 |
| Carbon-Aware Computing for Data Centers with Probabilistic Performance Guarantees | Oct 28, 2024 | Cloud ComputingComputational Efficiency | —Unverified | 0 |
| Quantum Reinforcement Learning-Based Two-Stage Unit Commitment Framework for Enhanced Power Systems Robustness | Oct 28, 2024 | Computational EfficiencyDecision Making | —Unverified | 0 |
| SparseTem: Boosting the Efficiency of CNN-Based Video Encoders by Exploiting Temporal Continuity | Oct 28, 2024 | Autonomous Drivingobject-detection | —Unverified | 0 |
| Capacity-Aware Planning and Scheduling in Budget-Constrained Monotonic MDPs: A Meta-RL Approach | Oct 28, 2024 | Industrial RobotsScheduling | —Unverified | 0 |
| Age of Information-Oriented Probabilistic Link Scheduling for Device-to-Device Networks | Oct 26, 2024 | Graph Neural NetworkScheduling | —Unverified | 0 |
| Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design | Oct 24, 2024 | Mixture-of-ExpertsMMLU | CodeCode Available | 1 |
| Applying Neural Monte Carlo Tree Search to Unsignalized Multi-intersection Scheduling for Autonomous Vehicles | Oct 24, 2024 | Autonomous VehiclesScheduling | —Unverified | 0 |
| Optimizing Load Scheduling in Power Grids Using Reinforcement Learning and Markov Decision Processes | Oct 23, 2024 | ManagementQ-Learning | —Unverified | 0 |
| Fast Inference for Augmented Large Language Models | Oct 23, 2024 | Scheduling | —Unverified | 0 |
| Exploiting Data Centres and Local Energy Communities Synergies for Market Participation | Oct 23, 2024 | energy managementManagement | —Unverified | 0 |
| Is the GPU Half-Empty or Half-Full? Practical Scheduling Techniques for LLMs | Oct 23, 2024 | GPUScheduling | —Unverified | 0 |
| ExpertFlow: Optimized Expert Activation and Token Allocation for Efficient Mixture-of-Experts Inference | Oct 23, 2024 | Computational EfficiencyCPU | —Unverified | 0 |
| MCUBERT: Memory-Efficient BERT Inference on Commodity Microcontrollers | Oct 23, 2024 | Neural Architecture SearchScheduling | —Unverified | 0 |
| A Surrogate Model for Quay Crane Scheduling Problem | Oct 22, 2024 | modelScheduling | —Unverified | 0 |
| AI-focused HPC Data Centers Can Provide More Power Grid Flexibility and at Lower Cost | Oct 22, 2024 | CPUGPU | —Unverified | 0 |