Subtask-Aware Visual Reward Learning from Segmented Demonstrations Feb 28, 2025 Contrastive Learning Reinforcement Learning (RL)
— Unverified 0Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning Feb 27, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0CarPlanner: Consistent Auto-regressive Trajectory Planning for Large-scale Reinforcement Learning in Autonomous Driving Feb 27, 2025 Autonomous Driving Reinforcement Learning (RL)
— Unverified 0Accelerating Model-Based Reinforcement Learning with State-Space World Models Feb 27, 2025 Model-based Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning Feb 27, 2025 Domain Adaptation Machine Translation
— Unverified 0On the Importance of Reward Design in Reinforcement Learning-based Dynamic Algorithm Configuration: A Case Study on OneMax with (1+(λ,λ))-GA Feb 27, 2025 Reinforcement Learning (RL)
Code Code Available 0AutoBS: Autonomous Base Station Deployment with Reinforcement Learning and Digital Network Twins Feb 27, 2025 Reinforcement Learning (RL)
Code Code Available 0Improving the Efficiency of a Deep Reinforcement Learning-Based Power Management System for HPC Clusters Using Curriculum Learning Feb 27, 2025 Deep Reinforcement Learning Management
— Unverified 0VEM: Environment-Free Exploration for Training GUI Agent with Value Environment Model Feb 26, 2025 Reinforcement Learning (RL)
Code Code Available 1Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning Feb 26, 2025 In-Context Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 1WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management Strategies Feb 26, 2025 Decision Making Management
Code Code Available 0Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones? Feb 26, 2025 GSM8K MMLU
— Unverified 0Efficient Reinforcement Learning by Guiding Generalist World Models with Non-Curated Data Feb 26, 2025 Attribute reinforcement-learning
— Unverified 0Error-related Potential driven Reinforcement Learning for adaptive Brain-Computer Interfaces Feb 25, 2025 EEG Motor Imagery
— Unverified 0Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement Learning Feb 25, 2025 Benchmarking Reinforcement Learning (RL)
Code Code Available 0FetchBot: Object Fetching in Cluttered Shelves via Zero-Shot Sim2Real Feb 25, 2025 Object Reinforcement Learning (RL)
— Unverified 0SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Feb 25, 2025 Math Reinforcement Learning (RL)
— Unverified 0Yes, Q-learning Helps Offline In-Context RL Feb 24, 2025 In-Context Reinforcement Learning MuJoCo
— Unverified 0Survey on Strategic Mining in Blockchain: A Reinforcement Learning Approach Feb 24, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Humanoid Whole-Body Locomotion on Narrow Terrain via Dynamic Balance and Reinforcement Learning Feb 24, 2025 Reinforcement Learning (RL)
— Unverified 0From Perceptions to Decisions: Wildfire Evacuation Decision Prediction with Behavioral Theory-informed LLMs Feb 24, 2025 Language Modeling Language Modelling
Code Code Available 0Predicting Liquidity-Aware Bond Yields using Causal GANs and Deep Reinforcement Learning with LLM Evaluation Feb 24, 2025 Decision Making Deep Reinforcement Learning
— Unverified 0Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models Feb 24, 2025 GSM8K Math
Code Code Available 2TDMPBC: Self-Imitative Reinforcement Learning for Humanoid Robot Control Feb 24, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 4Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies Feb 23, 2025 Decision Making Reinforcement Learning (RL)
— Unverified 0Toward Dependency Dynamics in Multi-Agent Reinforcement Learning for Traffic Signal Control Feb 23, 2025 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Statistical Inference in Reinforcement Learning: A Selective Survey Feb 22, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 0Together We Rise: Optimizing Real-Time Multi-Robot Task Allocation using Coordinated Heterogeneous Plays Feb 22, 2025 Collision Avoidance Management
— Unverified 0An Autonomous Network Orchestration Framework Integrating Large Language Models with Continual Reinforcement Learning Feb 22, 2025 ARC Continual Learning
— Unverified 0Hyperspherical Normalization for Scalable Deep Reinforcement Learning Feb 21, 2025 continuous-control Continuous Control
— Unverified 0On the Design of Safe Continual RL Methods for Control of Nonlinear Systems Feb 21, 2025 Continual Learning MuJoCo
Code Code Available 0The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning Feb 21, 2025 Decision Making reinforcement-learning
— Unverified 0Generating π-Functional Molecules Using STGG+ with Active Learning Feb 20, 2025 Active Learning reinforcement-learning
Code Code Available 1Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models Feb 20, 2025 Reinforcement Learning (RL) Zero-shot Generalization
— Unverified 0Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning Feb 20, 2025 Math reinforcement-learning
Code Code Available 7Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications Feb 20, 2025 Decision Making Deep Reinforcement Learning
— Unverified 0Reinforcement Learning with Graph Attention for Routing and Wavelength Assignment with Lightpath Reuse Feb 20, 2025 Benchmarking Graph Attention
— Unverified 0Discovering highly efficient low-weight quantum error-correcting codes with reinforcement learning Feb 20, 2025 Reinforcement Learning (RL)
— Unverified 0MLGym: A New Framework and Benchmark for Advancing AI Research Agents Feb 20, 2025 Reinforcement Learning (RL)
— Unverified 0Optimizing Gene-Based Testing for Antibiotic Resistance Prediction Feb 19, 2025 Diagnostic Prediction
— Unverified 0Comprehensive Review on the Control of Heat Pumps for Energy Flexibility in Distribution Networks Feb 19, 2025 Model Predictive Control Reinforcement Learning (RL)
— Unverified 0Uncertainty quantification for Markov chains with application to temporal difference learning Feb 19, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Hierarchical RL-MPC for Demand Response Scheduling Feb 19, 2025 Model Predictive Control Reinforcement Learning (RL)
— Unverified 0SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin Feb 19, 2025 GPU Logical Reasoning
— Unverified 0EPO: Explicit Policy Optimization for Strategic Reasoning in LLMs via Reinforcement Learning Feb 18, 2025 Navigate Reinforcement Learning (RL)
— Unverified 0Demystifying Multilingual Chain-of-Thought in Process Reward Modeling Feb 18, 2025 Reinforcement Learning (RL)
— Unverified 0LocalEscaper: A Weakly-supervised Framework with Regional Reconstruction for Scalable Neural TSP Solvers Feb 18, 2025 Reinforcement Learning (RL) Traveling Salesman Problem
— Unverified 0Integrating Reinforcement Learning, Action Model Learning, and Numeric Planning for Tackling Complex Tasks Feb 18, 2025 Imitation Learning Minecraft
Code Code Available 0Navigating Demand Uncertainty in Container Shipping: Deep Reinforcement Learning for Enabling Adaptive and Feasible Master Stowage Planning Feb 18, 2025 Combinatorial Optimization Deep Reinforcement Learning
Code Code Available 0RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning Feb 18, 2025 3DGS Autonomous Driving
— Unverified 0