DreamerV3 for Traffic Signal Control: Hyperparameter Tuning and Performance Mar 4, 2025 Reinforcement Learning (RL) Traffic Signal Control
— Unverified 0All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning Mar 3, 2025 All Reinforcement Learning (RL)
— Unverified 0Accelerating Multi-Task Temporal Difference Learning under Low-Rank Representation Mar 3, 2025 Reinforcement Learning (RL)
— Unverified 0What's Behind PPO's Collapse in Long-CoT? Value Optimization Holds the Secret Mar 3, 2025 Math Reinforcement Learning (RL)
— Unverified 0Active Alignments of Lens Systems with Reinforcement Learning Mar 3, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Adversarial Agents: Black-Box Evasion Attacks with Reinforcement Learning Mar 3, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Quality-Driven Curation of Remote Sensing Vision-Language Data via Learned Scoring Models Mar 2, 2025 Reinforcement Learning (RL)
— Unverified 0Minimax Optimal Reinforcement Learning with Quasi-Optimism Mar 2, 2025 Computational Efficiency reinforcement-learning
— Unverified 0Scalable Reinforcement Learning for Virtual Machine Scheduling Mar 1, 2025 Cloud Computing reinforcement-learning
— Unverified 0Towards Understanding the Benefit of Multitask Representation Learning in Decision Process Mar 1, 2025 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Never too Prim to Swim: An LLM-Enhanced RL-based Adaptive S-Surface Controller for AUVs under Extreme Sea Conditions Mar 1, 2025 Language Modeling Language Modelling
— Unverified 0Adaptive Reinforcement Learning for State Avoidance in Discrete Event Systems Feb 28, 2025 Decision Making Reinforcement Learning (RL)
— Unverified 0Multimodal Dreaming: A Global Workspace Approach to World Model-Based Reinforcement Learning Feb 28, 2025 Model-based Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Subtask-Aware Visual Reward Learning from Segmented Demonstrations Feb 28, 2025 Contrastive Learning Reinforcement Learning (RL)
— Unverified 0Hierarchical and Modular Network on Non-prehensile Manipulation in General Environments Feb 28, 2025 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning Feb 27, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Improving the Efficiency of a Deep Reinforcement Learning-Based Power Management System for HPC Clusters Using Curriculum Learning Feb 27, 2025 Deep Reinforcement Learning Management
— Unverified 0AutoBS: Autonomous Base Station Deployment with Reinforcement Learning and Digital Network Twins Feb 27, 2025 Reinforcement Learning (RL)
Code Code Available 0Accelerating Model-Based Reinforcement Learning with State-Space World Models Feb 27, 2025 Model-based Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0On the Importance of Reward Design in Reinforcement Learning-based Dynamic Algorithm Configuration: A Case Study on OneMax with (1+(λ,λ))-GA Feb 27, 2025 Reinforcement Learning (RL)
Code Code Available 0R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning Feb 27, 2025 Domain Adaptation Machine Translation
— Unverified 0CarPlanner: Consistent Auto-regressive Trajectory Planning for Large-scale Reinforcement Learning in Autonomous Driving Feb 27, 2025 Autonomous Driving Reinforcement Learning (RL)
— Unverified 0Efficient Reinforcement Learning by Guiding Generalist World Models with Non-Curated Data Feb 26, 2025 Attribute reinforcement-learning
— Unverified 0WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management Strategies Feb 26, 2025 Decision Making Management
Code Code Available 0Distill Not Only Data but Also Rewards: Can Smaller Language Models Surpass Larger Ones? Feb 26, 2025 GSM8K MMLU
— Unverified 0Error-related Potential driven Reinforcement Learning for adaptive Brain-Computer Interfaces Feb 25, 2025 EEG Motor Imagery
— Unverified 0FetchBot: Object Fetching in Cluttered Shelves via Zero-Shot Sim2Real Feb 25, 2025 Object Reinforcement Learning (RL)
— Unverified 0SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Feb 25, 2025 Math Reinforcement Learning (RL)
— Unverified 0Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement Learning Feb 25, 2025 Benchmarking Reinforcement Learning (RL)
Code Code Available 0From Perceptions to Decisions: Wildfire Evacuation Decision Prediction with Behavioral Theory-informed LLMs Feb 24, 2025 Language Modeling Language Modelling
Code Code Available 0Predicting Liquidity-Aware Bond Yields using Causal GANs and Deep Reinforcement Learning with LLM Evaluation Feb 24, 2025 Decision Making Deep Reinforcement Learning
— Unverified 0Humanoid Whole-Body Locomotion on Narrow Terrain via Dynamic Balance and Reinforcement Learning Feb 24, 2025 Reinforcement Learning (RL)
— Unverified 0Survey on Strategic Mining in Blockchain: A Reinforcement Learning Approach Feb 24, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Yes, Q-learning Helps Offline In-Context RL Feb 24, 2025 In-Context Reinforcement Learning MuJoCo
— Unverified 0Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies Feb 23, 2025 Decision Making Reinforcement Learning (RL)
— Unverified 0Toward Dependency Dynamics in Multi-Agent Reinforcement Learning for Traffic Signal Control Feb 23, 2025 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Together We Rise: Optimizing Real-Time Multi-Robot Task Allocation using Coordinated Heterogeneous Plays Feb 22, 2025 Collision Avoidance Management
— Unverified 0Statistical Inference in Reinforcement Learning: A Selective Survey Feb 22, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 0An Autonomous Network Orchestration Framework Integrating Large Language Models with Continual Reinforcement Learning Feb 22, 2025 ARC Continual Learning
— Unverified 0The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning Feb 21, 2025 Decision Making reinforcement-learning
— Unverified 0On the Design of Safe Continual RL Methods for Control of Nonlinear Systems Feb 21, 2025 Continual Learning MuJoCo
Code Code Available 0Hyperspherical Normalization for Scalable Deep Reinforcement Learning Feb 21, 2025 continuous-control Continuous Control
— Unverified 0Reinforcement Learning with Graph Attention for Routing and Wavelength Assignment with Lightpath Reuse Feb 20, 2025 Benchmarking Graph Attention
— Unverified 0MLGym: A New Framework and Benchmark for Advancing AI Research Agents Feb 20, 2025 Reinforcement Learning (RL)
— Unverified 0Learning from Reward-Free Offline Data: A Case for Planning with Latent Dynamics Models Feb 20, 2025 Reinforcement Learning (RL) Zero-shot Generalization
— Unverified 0Discovering highly efficient low-weight quantum error-correcting codes with reinforcement learning Feb 20, 2025 Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications Feb 20, 2025 Decision Making Deep Reinforcement Learning
— Unverified 0Comprehensive Review on the Control of Heat Pumps for Energy Flexibility in Distribution Networks Feb 19, 2025 Model Predictive Control Reinforcement Learning (RL)
— Unverified 0Optimizing Gene-Based Testing for Antibiotic Resistance Prediction Feb 19, 2025 Diagnostic Prediction
— Unverified 0Uncertainty quantification for Markov chains with application to temporal difference learning Feb 19, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0