Minimal Batch Adaptive Learning Policy Engine for Real-Time Mid-Price Forecasting in High-Frequency Trading Dec 26, 2024 Feature Importance Reinforcement Learning (RL)
— Unverified 0A Reinforcement Learning-Based Task Mapping Method to Improve the Reliability of Clustered Manycores Dec 26, 2024 Q-Learning Reinforcement Learning (RL)
— Unverified 0Preventive Energy Management for Distribution Systems Under Uncertain Events: A Deep Reinforcement Learning Approach Dec 26, 2024 Deep Reinforcement Learning energy management
— Unverified 0Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RL Dec 25, 2024 Offline RL Reinforcement Learning (RL)
Code Code Available 0Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization Dec 24, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0Quantum framework for Reinforcement Learning: Integrating Markov decision process, quantum arithmetic, and trajectory search Dec 24, 2024 Computational Efficiency Decision Making
— Unverified 0Multimodal Deep Reinforcement Learning for Portfolio Optimization Dec 23, 2024 Articles Benchmarking
— Unverified 0Optimizing Prompt Strategies for SAM: Advancing lesion Segmentation Across Diverse Medical Imaging Modalities Dec 23, 2024 Lesion Segmentation Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning for Motor Control: A Comprehensive Review Dec 23, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning Dec 22, 2024 D4RL Q-Learning
— Unverified 0Environment Descriptions for Usability and Generalisation in Reinforcement Learning Dec 22, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Adam on Local Time: Addressing Nonstationarity in RL with Relative Adam Timesteps Dec 22, 2024 Reinforcement Learning (RL)
— Unverified 0Mathematics and Machine Creativity: A Survey on Bridging Mathematics with AI Dec 21, 2024 Reinforcement Learning (RL) Survey
— Unverified 0On Enhancing Network Throughput using Reinforcement Learning in Sliced Testbeds Dec 21, 2024 Combinatorial Optimization Reinforcement Learning (RL)
— Unverified 0Subgoal Discovery Using a Free Energy Paradigm and State Aggregations Dec 21, 2024 Reinforcement Learning (RL) Sequential Decision Making
— Unverified 0Optimizing Low-Speed Autonomous Driving: A Reinforcement Learning Approach to Route Stability and Maximum Speed Dec 20, 2024 Autonomous Driving reinforcement-learning
— Unverified 0Autonomous Option Invention for Continual Hierarchical Reinforcement Learning and Planning Dec 20, 2024 Hierarchical Reinforcement Learning reinforcement-learning
Code Code Available 0VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving Dec 20, 2024 Autonomous Driving Computational Efficiency
— Unverified 0From General to Specific: Tailoring Large Language Models for Personalized Healthcare Dec 20, 2024 Language Modeling Language Modelling
— Unverified 0Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues Dec 19, 2024 Hierarchical Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0AdaCred: Adaptive Causal Decision Transformers with Feature Crediting Dec 19, 2024 Attribute Imitation Learning
— Unverified 0Learning to Generate Research Idea with Dynamic Control Dec 19, 2024 Reinforcement Learning (RL) scientific discovery
— Unverified 0Offline Safe Reinforcement Learning Using Trajectory Classification Dec 19, 2024 Classification reinforcement-learning
Code Code Available 0Single-Loop Federated Actor-Critic across Heterogeneous Environments Dec 19, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Deep reinforcement learning with time-scale invariant memory Dec 19, 2024 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Bayesian Critique-Tune-Based Reinforcement Learning with Adaptive Pressure for Multi-Intersection Traffic Signal Control Dec 18, 2024 Bayesian Inference reinforcement-learning
— Unverified 0Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models Dec 18, 2024 HumanEval Imitation Learning
— Unverified 0Harvesting energy from turbulent winds with Reinforcement Learning Dec 18, 2024 Model Predictive Control reinforcement-learning
— Unverified 0Tilted Quantile Gradient Updates for Quantile-Constrained Reinforcement Learning Dec 17, 2024 Form reinforcement-learning
Code Code Available 0Multi-Task Reinforcement Learning for Quadrotors Dec 17, 2024 Autonomous Racing reinforcement-learning
— Unverified 0Learning Visuotactile Estimation and Control for Non-prehensile Manipulation under Occlusions Dec 17, 2024 Reinforcement Learning (RL)
— Unverified 0CLIP-RLDrive: Human-Aligned Autonomous Driving via CLIP-Based Reward Shaping in Reinforcement Learning Dec 17, 2024 Autonomous Driving Autonomous Vehicles
— Unverified 0ParMod: A Parallel and Modular Framework for Learning Non-Markovian Tasks Dec 17, 2024 NMT Reinforcement Learning (RL)
— Unverified 0Design of Restricted Normalizing Flow towards Arbitrary Stochastic Policy with Computational Efficiency Dec 17, 2024 Computational Efficiency Reinforcement Learning (RL)
— Unverified 0Using machine learning to inform harvest control rule design in complex fishery settings Dec 16, 2024 Bayesian Optimization Management
Code Code Available 0Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation Dec 16, 2024 Deep Reinforcement Learning GPU
— Unverified 0Equivariant Action Sampling for Reinforcement Learning and Planning Dec 16, 2024 continuous-control Continuous Control
— Unverified 0Efficient Policy Adaptation with Contrastive Prompt Ensemble for Embodied Agents Dec 16, 2024 Autonomous Driving Language Modeling
— Unverified 0MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization Dec 16, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0MGDA: Model-based Goal Data Augmentation for Offline Goal-conditioned Weighted Supervised Learning Dec 16, 2024 Data Augmentation Reinforcement Learning (RL)
— Unverified 0Adaptive Reward Design for Reinforcement Learning Dec 14, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Automated Driving with Evolution Capability: A Reinforcement Learning Method with Monotonic Performance Enhancement Dec 14, 2024 Decision Making reinforcement-learning
— Unverified 0Continuous-time optimal investment with portfolio constraints: a reinforcement learning approach Dec 14, 2024 Reinforcement Learning (RL)
— Unverified 0Physics Instrument Design with Reinforcement Learning Dec 13, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Reward Machine Inference for Robotic Manipulation Dec 13, 2024 Reinforcement Learning (RL)
— Unverified 0Deep Reinforcement Learning for Scalable Multiagent Spacecraft Inspection Dec 13, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0PickLLM: Context-Aware RL-Assisted Large Language Model Routing Dec 12, 2024 Language Modeling Language Modelling
— Unverified 0Quantum-Train-Based Distributed Multi-Agent Reinforcement Learning Dec 12, 2024 Distributed Computing Multi-agent Reinforcement Learning
— Unverified 0From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning Dec 12, 2024 Reinforcement Learning (RL) Safe Reinforcement Learning
— Unverified 0Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer Dec 12, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0