Proposing Hierarchical Goal-Conditioned Policy Planning in Multi-Goal Reinforcement Learning Jan 3, 2025 Multi-Goal Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Noise-Resilient Symbolic Regression with Dynamic Gating Reinforcement Learning Jan 2, 2025 regression reinforcement-learning
Code Code Available 0Neural Motion Simulator Pushing the Limit of World Models in Reinforcement Learning Jan 1, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0RaSS: Improving Denoising Diffusion Samplers with Reinforced Active Sampling Scheduler Jan 1, 2025 Denoising Reinforcement Learning (RL)
— Unverified 0A Graphical Approach to State Variable Selection in Off-policy Learning Jan 1, 2025 Causal Inference Dimensionality Reduction
— Unverified 0Hybridising Reinforcement Learning and Heuristics for Hierarchical Directed Arc Routing Problems Jan 1, 2025 ARC reinforcement-learning
Code Code Available 0FORM: Learning Expressive and Transferable First-Order Logic Reward Machines Dec 31, 2024 Form Reinforcement Learning (RL)
— Unverified 0Towards Unraveling and Improving Generalization in World Models Dec 31, 2024 Reinforcement Learning (RL)
— Unverified 0An Unsupervised Anomaly Detection in Electricity Consumption Using Reinforcement Learning and Time Series Forest Based Framework Dec 30, 2024 Anomaly Detection Model Selection
— Unverified 0Isoperimetry is All We Need: Langevin Posterior Sampling for RL with Sublinear Regret Dec 30, 2024 All Reinforcement Learning (RL)
— Unverified 0Weber-Fechner Law in Temporal Difference learning derived from Control as Inference Dec 30, 2024 Reinforcement Learning (RL)
— Unverified 0UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI Dec 30, 2024 Benchmarking Reinforcement Learning (RL)
— Unverified 0Dynamic Optimization of Storage Systems Using Reinforcement Learning Techniques Dec 29, 2024 CPU Q-Learning
— Unverified 0Enhancing Code LLMs with Reinforcement Learning in Code Generation: A Survey Dec 29, 2024 Code Generation Compiler Optimization
— Unverified 0Exploiting Hybrid Policy in Reinforcement Learning for Interpretable Temporal Logic Manipulation Dec 29, 2024 Reinforcement Learning (RL)
Code Code Available 1Goal-Conditioned Data Augmentation for Offline Reinforcement Learning Dec 29, 2024 D4RL Data Augmentation
— Unverified 0Efficient and Scalable Deep Reinforcement Learning for Mean Field Control Games Dec 28, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Election of Collaborators via Reinforcement Learning for Federated Brain Tumor Segmentation Dec 28, 2024 Brain Tumor Segmentation Federated Learning
— Unverified 0Graph-attention-based Casual Discovery with Trust Region-navigated Clipping Policy Optimization Dec 27, 2024 Causal Discovery Graph Attention
— Unverified 0Preventive Energy Management for Distribution Systems Under Uncertain Events: A Deep Reinforcement Learning Approach Dec 26, 2024 Deep Reinforcement Learning energy management
— Unverified 0Minimal Batch Adaptive Learning Policy Engine for Real-Time Mid-Price Forecasting in High-Frequency Trading Dec 26, 2024 Feature Importance Reinforcement Learning (RL)
— Unverified 0Provably Efficient Exploration in Reward Machines with Low Regret Dec 26, 2024 Efficient Exploration Reinforcement Learning (RL)
— Unverified 0xSRL: Safety-Aware Explainable Reinforcement Learning -- Safety as a Product of Explainability Dec 26, 2024 Autonomous Vehicles Reinforcement Learning (RL)
Code Code Available 0Optimizing Fantasy Sports Team Selection with Deep Reinforcement Learning Dec 26, 2024 Decision Making Deep Reinforcement Learning
— Unverified 0A Reinforcement Learning-Based Task Mapping Method to Improve the Reliability of Clustered Manycores Dec 26, 2024 Q-Learning Reinforcement Learning (RL)
— Unverified 0Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RL Dec 25, 2024 Offline RL Reinforcement Learning (RL)
Code Code Available 0HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Dec 25, 2024 Reinforcement Learning (RL)
Code Code Available 5Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization Dec 24, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0Quantum framework for Reinforcement Learning: Integrating Markov decision process, quantum arithmetic, and trajectory search Dec 24, 2024 Computational Efficiency Decision Making
— Unverified 0Reinforcement Learning for Motor Control: A Comprehensive Review Dec 23, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Optimizing Prompt Strategies for SAM: Advancing lesion Segmentation Across Diverse Medical Imaging Modalities Dec 23, 2024 Lesion Segmentation Reinforcement Learning (RL)
— Unverified 0Multimodal Deep Reinforcement Learning for Portfolio Optimization Dec 23, 2024 Articles Benchmarking
— Unverified 0Adam on Local Time: Addressing Nonstationarity in RL with Relative Adam Timesteps Dec 22, 2024 Reinforcement Learning (RL)
— Unverified 0ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning Dec 22, 2024 D4RL Q-Learning
— Unverified 0Environment Descriptions for Usability and Generalisation in Reinforcement Learning Dec 22, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Subgoal Discovery Using a Free Energy Paradigm and State Aggregations Dec 21, 2024 Reinforcement Learning (RL) Sequential Decision Making
— Unverified 0Mathematics and Machine Creativity: A Survey on Bridging Mathematics with AI Dec 21, 2024 Reinforcement Learning (RL) Survey
— Unverified 0On Enhancing Network Throughput using Reinforcement Learning in Sliced Testbeds Dec 21, 2024 Combinatorial Optimization Reinforcement Learning (RL)
— Unverified 0Optimizing Low-Speed Autonomous Driving: A Reinforcement Learning Approach to Route Stability and Maximum Speed Dec 20, 2024 Autonomous Driving reinforcement-learning
— Unverified 0Autonomous Option Invention for Continual Hierarchical Reinforcement Learning and Planning Dec 20, 2024 Hierarchical Reinforcement Learning reinforcement-learning
Code Code Available 0From General to Specific: Tailoring Large Language Models for Personalized Healthcare Dec 20, 2024 Language Modeling Language Modelling
— Unverified 0VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving Dec 20, 2024 Autonomous Driving Computational Efficiency
— Unverified 0Offline Reinforcement Learning for LLM Multi-Step Reasoning Dec 20, 2024 GSM8K Math
Code Code Available 2AdaCred: Adaptive Causal Decision Transformers with Feature Crediting Dec 19, 2024 Attribute Imitation Learning
— Unverified 0Single-Loop Federated Actor-Critic across Heterogeneous Environments Dec 19, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Offline Safe Reinforcement Learning Using Trajectory Classification Dec 19, 2024 Classification reinforcement-learning
Code Code Available 0Deep reinforcement learning with time-scale invariant memory Dec 19, 2024 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues Dec 19, 2024 Hierarchical Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Learning to Generate Research Idea with Dynamic Control Dec 19, 2024 Reinforcement Learning (RL) scientific discovery
— Unverified 0Bayesian Critique-Tune-Based Reinforcement Learning with Adaptive Pressure for Multi-Intersection Traffic Signal Control Dec 18, 2024 Bayesian Inference reinforcement-learning
— Unverified 0