Constructing Ancestral Recombination Graphs through Reinforcement Learning Jun 17, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Adding Conditional Control to Diffusion Models with Reinforcement Learning Jun 17, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Linear Bellman Completeness Suffices for Efficient Online Reinforcement Learning with Few Actions Jun 17, 2024 regression Reinforcement Learning (RL)
— Unverified 0Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms Jun 17, 2024 Autonomous Driving Decision Making
— Unverified 0Design of Interacting Particle Systems for Fast Linear Quadratic RL Jun 16, 2024 Reinforcement Learning (RL)
— Unverified 0UniZero: Generalized and Efficient Planning with Scalable Latent World Models Jun 15, 2024 Multi-Task Learning Reinforcement Learning (RL)
— Unverified 0Generating and Evolving Reward Functions for Highway Driving with Large Language Models Jun 15, 2024 Autonomous Driving Reinforcement Learning (RL)
— Unverified 0ROAR: Reinforcing Original to Augmented Data Ratio Dynamics for Wav2Vec2.0 Based ASR Jun 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unlock the Correlation between Supervised Fine-Tuning and Reinforcement Learning in Training Code Large Language Models Jun 14, 2024 Code Generation Reinforcement Learning (RL)
— Unverified 0Finite-Time Analysis of Simultaneous Double Q-learning Jun 14, 2024 Q-Learning Reinforcement Learning (RL)
— Unverified 0CIMRL: Combining IMitation and Reinforcement Learning for Safe Autonomous Driving Jun 13, 2024 Autonomous Driving Autonomous Vehicles
— Unverified 0e-COP : Episodic Constrained Optimization of Policies Jun 13, 2024 LEMMA reinforcement-learning
— Unverified 0Adaptive Actor-Critic Based Optimal Regulation for Drift-Free Uncertain Nonlinear Systems Jun 13, 2024 Reinforcement Learning (RL)
— Unverified 0DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning Jun 13, 2024 D4RL Offline RL
— Unverified 0SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual Datasets Jun 13, 2024 D4RL Offline RL
— Unverified 0Is Value Learning Really the Main Bottleneck in Offline RL? Jun 13, 2024 Imitation Learning Offline RL
Code Code Available 3Data-driven modeling and supervisory control system optimization for plug-in hybrid electric vehicles Jun 13, 2024 energy management Management
— Unverified 0Reinforcement Learning to Disentangle Multiqubit Quantum States from Partial Observations Jun 12, 2024 Benchmarking Deep Reinforcement Learning
Code Code Available 0Toward Enhanced Reinforcement Learning-Based Resource Management via Digital Twin: Opportunities, Applications, and Challenges Jun 12, 2024 Management Reinforcement Learning (RL)
— Unverified 0Residual Learning and Context Encoding for Adaptive Offline-to-Online Reinforcement Learning Jun 12, 2024 D4RL MuJoCo
Code Code Available 0When Do Skills Help Reinforcement Learning? A Theoretical Analysis of Temporal Abstractions Jun 12, 2024 Reinforcement Learning (RL)
Code Code Available 0Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning Jun 12, 2024 Reinforcement Learning (RL)
— Unverified 0RILe: Reinforced Imitation Learning Jun 12, 2024 Computational Efficiency Imitation Learning
— Unverified 0Unifying Interpretability and Explainability for Alzheimer's Disease Progression Prediction Jun 11, 2024 Reinforcement Learning (RL)
Code Code Available 0Hybrid Reinforcement Learning from Offline Observation Alone Jun 11, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Enhanced Gene Selection in Single-Cell Genomics: Pre-Filtering Synergy and Reinforced Optimization Jun 11, 2024 Reinforcement Learning (RL)
— Unverified 0Learning Reward and Policy Jointly from Demonstration and Preference Improves Alignment Jun 11, 2024 MuJoCo reinforcement-learning
— Unverified 0Integrating Domain Knowledge for handling Limited Data in Offline RL Jun 11, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0Sample Complexity Reduction via Policy Difference Estimation in Tabular Reinforcement Learning Jun 11, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0CHARME: A chain-based reinforcement learning approach for the minor embedding problem Jun 11, 2024 Combinatorial Optimization Graph Neural Network
— Unverified 0Decoupling regularization from the action space Jun 10, 2024 Reinforcement Learning (RL)
Code Code Available 0Investigating Pre-Training Objectives for Generalization in Vision-Based Reinforcement Learning Jun 10, 2024 Atari Games Reinforcement Learning (RL)
Code Code Available 1Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning? Jun 10, 2024 Deep Reinforcement Learning Offline RL
Code Code Available 0Deep Multi-Objective Reinforcement Learning for Utility-Based Infrastructural Maintenance Optimization Jun 10, 2024 Multi-Objective Reinforcement Learning reinforcement-learning
Code Code Available 0Discovering Multiple Solutions from a Single Task in Offline Reinforcement Learning Jun 10, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0EXPIL: Explanatory Predicate Invention for Learning in Games Jun 10, 2024 Reinforcement Learning (RL)
Code Code Available 0STARLING: Self-supervised Training of Text-based Reinforcement Learning Agent with Large Language Models Jun 9, 2024 Reinforcement Learning (RL) text-based games
Code Code Available 0ICU-Sepsis: A Benchmark MDP Built from Real Medical Data Jun 9, 2024 Benchmarking Management
Code Code Available 1Enhanced Flight Envelope Protection: A Novel Reinforcement Learning Approach Jun 8, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL Jun 8, 2024 Data Augmentation Mamba
Code Code Available 0Diffusion-based Reinforcement Learning for Dynamic UAV-assisted Vehicle Twins Migration in Vehicular Metaverses Jun 8, 2024 Heuristic Search Reinforcement Learning (RL)
— Unverified 0Optimizing Automatic Differentiation with Deep Reinforcement Learning Jun 7, 2024 Computational Efficiency Deep Reinforcement Learning
— Unverified 0Sim-to-Real Transfer of Deep Reinforcement Learning Agents for Online Coverage Path Planning Jun 7, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Primitive Agentic First-Order Optimization Jun 7, 2024 Reinforcement Learning (RL)
— Unverified 0Proofread: Fixes All Errors with One Tap Jun 6, 2024 All Quantization
— Unverified 0Strategically Conservative Q-Learning Jun 6, 2024 D4RL Offline RL
Code Code Available 1Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking Jun 6, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Breeding Programs Optimization with Reinforcement Learning Jun 6, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Optimizing Autonomous Driving for Safety: A Human-Centric Approach with LLM-Enhanced RLHF Jun 6, 2024 Autonomous Driving reinforcement-learning
— Unverified 0Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning Jun 6, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0