Comprehensive Review on the Control of Heat Pumps for Energy Flexibility in Distribution Networks Feb 19, 2025 Model Predictive Control Reinforcement Learning (RL)
— Unverified 0Optimizing Gene-Based Testing for Antibiotic Resistance Prediction Feb 19, 2025 Diagnostic Prediction
— Unverified 0Navigating Demand Uncertainty in Container Shipping: Deep Reinforcement Learning for Enabling Adaptive and Feasible Master Stowage Planning Feb 18, 2025 Combinatorial Optimization Deep Reinforcement Learning
Code Code Available 0LocalEscaper: A Weakly-supervised Framework with Regional Reconstruction for Scalable Neural TSP Solvers Feb 18, 2025 Reinforcement Learning (RL) Traveling Salesman Problem
— Unverified 0Integrating Reinforcement Learning, Action Model Learning, and Numeric Planning for Tackling Complex Tasks Feb 18, 2025 Imitation Learning Minecraft
Code Code Available 0Demystifying Multilingual Chain-of-Thought in Process Reward Modeling Feb 18, 2025 Reinforcement Learning (RL)
— Unverified 0A Survey of Sim-to-Real Methods in RL: Progress, Prospects and Challenges with Foundation Models Feb 18, 2025 Deep Reinforcement Learning Recommendation Systems
— Unverified 0RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning Feb 18, 2025 3DGS Autonomous Driving
— Unverified 0EPO: Explicit Policy Optimization for Strategic Reasoning in LLMs via Reinforcement Learning Feb 18, 2025 Navigate Reinforcement Learning (RL)
— Unverified 0Scaling Test-Time Compute Without Verification or RL is Suboptimal Feb 17, 2025 Math Reinforcement Learning (RL)
— Unverified 0FLAG-Trader: Fusion LLM-Agent with Gradient-based Reinforcement Learning for Financial Trading Feb 17, 2025 Decision Making parameter-efficient fine-tuning
— Unverified 0Robot Deformable Object Manipulation via NMPC-generated Demonstrations in Deep Reinforcement Learning Feb 17, 2025 Deep Reinforcement Learning Deformable Object Manipulation
— Unverified 0Learning Plasma Dynamics and Robust Rampdown Trajectories with Predict-First Experiments at TCV Feb 17, 2025 Reinforcement Learning (RL)
— Unverified 0Hovering Flight of Soft-Actuated Insect-Scale Micro Aerial Vehicles using Deep Reinforcement Learning Feb 17, 2025 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Intersectional Fairness in Reinforcement Learning with Large State and Constraint Spaces Feb 17, 2025 Fairness Reinforcement Learning (RL)
— Unverified 0VLP: Vision-Language Preference Learning for Embodied Manipulation Feb 17, 2025 Reinforcement Learning (RL)
— Unverified 0FitLight: Federated Imitation Learning for Plug-and-Play Autonomous Traffic Signal Control Feb 17, 2025 Imitation Learning reinforcement-learning
— Unverified 0CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning Feb 17, 2025 MuJoCo Reinforcement Learning (RL)
— Unverified 0Addressing Moral Uncertainty using Large Language Models for Ethical Decision-Making Feb 17, 2025 Decision Making Ethics
— Unverified 0Evaluating the Paperclip Maximizer: Are RL-Based Language Models More Likely to Pursue Instrumental Goals? Feb 16, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 0Scalable Multi-Agent Offline Reinforcement Learning and the Role of Information Feb 16, 2025 Informativeness Reinforcement Learning (RL)
— Unverified 0Rule-Bottleneck Reinforcement Learning: Joint Explanation and Decision Optimization for Resource Allocation with Language Agents Feb 15, 2025 Decision Making Deep Reinforcement Learning
— Unverified 0Tackling the Zero-Shot Reinforcement Learning Loss Directly Feb 15, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0BeamDojo: Learning Agile Humanoid Locomotion on Sparse Footholds Feb 14, 2025 Reinforcement Learning (RL)
— Unverified 0Dynamic Reinforcement Learning for Actors Feb 14, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning in Strategy-Based and Atari Games: A Review of Google DeepMinds Innovations Feb 14, 2025 Atari Games Game of Go
— Unverified 0Provably Efficient RL under Episode-Wise Safety in Constrained MDPs with Linear Function Approximation Feb 14, 2025 Reinforcement Learning (RL)
— Unverified 0Causal Information Prioritization for Efficient Reinforcement Learning Feb 14, 2025 continuous-control Continuous Control
— Unverified 0A Survey of Reinforcement Learning for Optimization in Automation Feb 13, 2025 Meta-Learning Navigate
— Unverified 0Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches Feb 13, 2025 D4RL Offline RL
— Unverified 0Safe Reinforcement Learning-based Control for Hydrogen Diesel Dual-Fuel Engines Feb 13, 2025 Model Predictive Control Reinforcement Learning (RL)
— Unverified 0Necessary and Sufficient Oracles: Toward a Computational Taxonomy For Reinforcement Learning Feb 12, 2025 regression Reinforcement Learning (RL)
— Unverified 0A Survey on Data-Centric AI: Tabular Learning from Reinforcement Learning and Generative AI Perspective Feb 12, 2025 Feature Engineering feature selection
— Unverified 0A Real-to-Sim-to-Real Approach to Robotic Manipulation with VLM-Generated Iterative Keypoint Rewards Feb 12, 2025 Reinforcement Learning (RL)
— Unverified 0COMBO-Grasp: Learning Constraint-Based Manipulation for Bimanual Occluded Grasping Feb 12, 2025 Reinforcement Learning (RL) Robot Manipulation
— Unverified 0Hierarchical Multi-Agent Framework for Carbon-Efficient Liquid-Cooled Data Center Clusters Feb 12, 2025 Cloud Computing Reinforcement Learning (RL)
— Unverified 0Optimal Actuator Attacks on Autonomous Vehicles Using Reinforcement Learning Feb 11, 2025 Autonomous Vehicles reinforcement-learning
— Unverified 0Exploratory Diffusion Model for Unsupervised Reinforcement Learning Feb 11, 2025 Efficient Exploration model
— Unverified 0A Survey of In-Context Reinforcement Learning Feb 11, 2025 In-Context Reinforcement Learning reinforcement-learning
— Unverified 0Towards a Formal Theory of the Need for Competence via Computational Intrinsic Motivation Feb 11, 2025 Reinforcement Learning (RL)
— Unverified 0Near-Optimal Sample Complexity in Reward-Free Kernel-Based Reinforcement Learning Feb 11, 2025 Reinforcement Learning (RL)
— Unverified 0Active Advantage-Aligned Online Reinforcement Learning with Offline Data Feb 11, 2025 Offline RL reinforcement-learning
Code Code Available 0Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement Learning Feb 11, 2025 Decision Making reinforcement-learning
— Unverified 0Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol Feb 11, 2025 Model Selection Off-policy evaluation
— Unverified 0A view on learning robust goal-conditioned value functions: Interplay between RL and MPC Feb 10, 2025 Model Predictive Control Reinforcement Learning (RL)
Code Code Available 0Smell of Source: Learning-Based Odor Source Localization with Molecular Communication Feb 10, 2025 Computational Efficiency Disaster Response
— Unverified 0Select before Act: Spatially Decoupled Action Repetition for Continuous Control Feb 10, 2025 continuous-control Continuous Control
— Unverified 0Intelligent Offloading in Vehicular Edge Computing: A Comprehensive Review of Deep Reinforcement Learning Approaches and Architectures Feb 10, 2025 Decision Making Deep Reinforcement Learning
— Unverified 0Learning Conformal Abstention Policies for Adaptive Risk Management in Large Language and Vision-Language Models Feb 8, 2025 Conformal Prediction Decision Making
Code Code Available 0Sequential Stochastic Combinatorial Optimization Using Hierarchal Reinforcement Learning Feb 8, 2025 Combinatorial Optimization Computational Efficiency
— Unverified 0