Safe Reinforcement Learning for Real-World Engine Control Jan 28, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Integrating Reinforcement Learning and AI Agents for Adaptive Robotic Interaction and Assistance in Dementia Care Jan 28, 2025 Reinforcement Learning (RL)
— Unverified 0Heterogeneity-aware Personalized Federated Learning via Adaptive Dual-Agent Reinforcement Learning Jan 28, 2025 Federated Learning Knowledge Distillation
— Unverified 0Challenges in Ensuring AI Safety in DeepSeek-R1 Models: The Shortcomings of Reinforcement Learning Strategies Jan 28, 2025 Reinforcement Learning (RL)
— Unverified 0Exploratory Mean-Variance Portfolio Optimization with Regime-Switching Market Dynamics Jan 28, 2025 Portfolio Optimization Reinforcement Learning (RL)
— Unverified 0Improving Vision-Language-Action Model with Online Reinforcement Learning Jan 28, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0MPC4RL -- A Software Package for Reinforcement Learning based on Model Predictive Control Jan 27, 2025 Model Predictive Control Reinforcement Learning (RL)
— Unverified 0FuzzyLight: A Robust Two-Stage Fuzzy Approach for Traffic Signal Control Works in Real Cities Jan 27, 2025 compressed sensing Reinforcement Learning (RL)
— Unverified 0Towards General-Purpose Model-Free Reinforcement Learning Jan 27, 2025 model reinforcement-learning
— Unverified 0Selective Experience Sharing in Reinforcement Learning Enhances Interference Management Jan 27, 2025 Management Multi-agent Reinforcement Learning
— Unverified 0Benchmarking Quantum Reinforcement Learning Jan 27, 2025 Benchmarking reinforcement-learning
Code Code Available 0Flexible Blood Glucose Control: Offline Reinforcement Learning from Human Feedback Jan 27, 2025 Offline RL Reinforcement Learning (RL)
— Unverified 0Learning-Enhanced Safeguard Control for High-Relative-Degree Systems: Robust Optimization under Disturbances and Faults Jan 26, 2025 Reinforcement Learning (RL) Safe Exploration
— Unverified 0Expert-Free Online Transfer Learning in Multi-Agent Reinforcement Learning Jan 26, 2025 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Data Center Cooling System Optimization Using Offline Reinforcement Learning Jan 25, 2025 Graph Neural Network Offline RL
— Unverified 0Faster Machine Translation Ensembling with Reinforcement Learning and Competitive Correction Jan 25, 2025 Decoder Machine Translation
— Unverified 0Towards Efficient Multi-Objective Optimisation for Real-World Power Grid Topology Control Jan 24, 2025 Multi-Objective Reinforcement Learning reinforcement-learning
— Unverified 0Coordinating Ride-Pooling with Public Transit using Reward-Guided Conservative Q-Learning: An Offline Training and Online Fine-Tuning Reinforcement Learning Framework Jan 24, 2025 Q-Learning Reinforcement Learning (RL)
— Unverified 0Age and Power Minimization via Meta-Deep Reinforcement Learning in UAV Networks Jan 24, 2025 Deep Reinforcement Learning Meta-Learning
— Unverified 0Large Language Model driven Policy Exploration for Recommender Systems Jan 23, 2025 Language Modeling Language Modelling
— Unverified 0On Generalization and Distributional Update for Mimicking Observations with Adequate Exploration Jan 22, 2025 Reinforcement Learning (RL)
— Unverified 0Evolution and The Knightian Blindspot of Machine Learning Jan 22, 2025 Reinforcement Learning (RL)
— Unverified 0Adaptive Data Exploitation in Deep Reinforcement Learning Jan 22, 2025 Computational Efficiency Deep Reinforcement Learning
Code Code Available 0State Combinatorial Generalization In Decision Making With Conditional Diffusion Models Jan 22, 2025 Decision Making Reinforcement Learning (RL)
— Unverified 0Deep Reinforcement Learning with Hybrid Intrinsic Reward Model Jan 22, 2025 Deep Reinforcement Learning Diversity
— Unverified 0Reinforcement learning Based Automated Design of Differential Evolution Algorithm for Black-box Optimization Jan 22, 2025 Evolutionary Algorithms Meta-Learning
— Unverified 0Exploring the Technology Landscape through Topic Modeling, Expert Involvement, and Reinforcement Learning Jan 22, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0MONA: Myopic Optimization with Non-myopic Approval Can Mitigate Multi-step Reward Hacking Jan 22, 2025 Reinforcement Learning (RL)
— Unverified 0To Measure or Not: A Cost-Sensitive, Selective Measuring Environment for Agricultural Management Decisions with Reinforcement Learning Jan 22, 2025 Management Reinforcement Learning (RL)
Code Code Available 0AdaWM: Adaptive World Model based Planning for Autonomous Driving Jan 22, 2025 Autonomous Driving Model-based Reinforcement Learning
— Unverified 0Reinforcement Learning Constrained Beam Search for Parameter Optimization of Paper Drying Under Flexible Constraints Jan 21, 2025 Combinatorial Optimization reinforcement-learning
— Unverified 0Extend Adversarial Policy Against Neural Machine Translation via Unknown Token Jan 21, 2025 Machine Translation NMT
— Unverified 0RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression Jan 21, 2025 Autonomous Driving Object Recognition
— Unverified 0RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems? Jan 20, 2025 Math Reinforcement Learning (RL)
— Unverified 0Improving thermal state preparation of Sachdev-Ye-Kitaev model with reinforcement learning on quantum hardware Jan 20, 2025 Reinforcement Learning (RL)
Code Code Available 0GREEN-CODE: Learning to Optimize Energy Efficiency in LLM-based Code Generation Jan 19, 2025 Bug fixing Code Completion
Code Code Available 0Solving Finite-Horizon MDPs via Low-Rank Tensors Jan 17, 2025 Reinforcement Learning (RL)
— Unverified 0RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection Jan 16, 2025 Autonomous Driving Object
— Unverified 0Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Jan 16, 2025 Reinforcement Learning (RL)
— Unverified 0PixelBrax: Learning Continuous Control from Pixels End-to-End on the GPU Jan 16, 2025 Benchmarking continuous-control
Code Code Available 0From Explainability to Interpretability: Interpretable Policies in Reinforcement Learning Via Model Explanation Jan 16, 2025 Decision Making Deep Reinforcement Learning
— Unverified 0Average-Reward Reinforcement Learning with Entropy Regularization Jan 15, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning-Enhanced Procedural Generation for Dynamic Narrative-Driven AR Experiences Jan 15, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Projection Implicit Q-Learning with Support Constraint for Offline Reinforcement Learning Jan 15, 2025 D4RL Q-Learning
— Unverified 0Hybrid Action Based Reinforcement Learning for Multi-Objective Compatible Autonomous Driving Jan 14, 2025 Attribute Autonomous Driving
— Unverified 0Decision Transformers for RIS-Assisted Systems with Diffusion Model-Based Channel Acquisition Jan 14, 2025 Denoising Imputation
— Unverified 0CHEQ-ing the Box: Safe Variable Impedance Learning for Robotic Polishing Jan 14, 2025 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0FDPP: Fine-tune Diffusion Policy with Human Preference Jan 14, 2025 Imitation Learning Reinforcement Learning (RL)
— Unverified 0Enhancing Online Reinforcement Learning with Meta-Learned Objective from Offline Data Jan 13, 2025 Imitation Learning MuJoCo
Code Code Available 0Combining LLM decision and RL action selection to improve RL policy for adaptive interventions Jan 13, 2025 Reinforcement Learning (RL)
— Unverified 0