MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator Dec 7, 2023 Offline RL reinforcement-learning
Code Code Available 0Efficient Parallel Reinforcement Learning Framework using the Reactor Model Dec 7, 2023 OpenAI Gym Q-Learning
Code Code Available 0Safety-Enhanced Self-Learning for Optimal Power Converter Control Dec 7, 2023 Model Predictive Control Reinforcement Learning (RL)
— Unverified 0On the Role of the Action Space in Robot Manipulation Learning and Sim-to-Real Transfer Dec 6, 2023 Reinforcement Learning (RL) Robot Manipulation
— Unverified 0Language Model Alignment with Elastic Reset Dec 6, 2023 Chatbot Language Modeling
Code Code Available 0Evaluation of Active Feature Acquisition Methods for Static Feature Settings Dec 6, 2023 Offline RL reinforcement-learning
— Unverified 0Diffused Task-Agnostic Milestone Planner Dec 6, 2023 Decision Making Offline RL
— Unverified 0Demand response for residential building heating: Effective Monte Carlo Tree Search control based on physics-informed neural networks Dec 6, 2023 Board Games Model Predictive Control
— Unverified 0Convergence Rates for Stochastic Approximation: Biased Noise with Unbounded Variance, and Applications Dec 5, 2023 Reinforcement Learning (RL)
— Unverified 0LExCI: A Framework for Reinforcement Learning with Embedded Systems Dec 5, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Contact Energy Based Hindsight Experience Prioritization Dec 5, 2023 Reinforcement Learning (RL) Robot Manipulation
— Unverified 0SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World Dec 5, 2023 Benchmarking Diversity
— Unverified 0MASP: Scalable GNN-based Planning for Multi-Agent Navigation Dec 5, 2023 Reinforcement Learning (RL) Zero-shot Generalization
— Unverified 0RL-Based Cargo-UAV Trajectory Planning and Cell Association for Minimum Handoffs, Disconnectivity, and Energy Consumption Dec 5, 2023 Reinforcement Learning (RL) Trajectory Planning
— Unverified 0Score-Aware Policy-Gradient Methods and Performance Guarantees using Local Lyapunov Conditions: Applications to Product-Form Stochastic Networks and Queueing Systems Dec 5, 2023 Form Model-based Reinforcement Learning
— Unverified 0Training Reinforcement Learning Agents and Humans With Difficulty-Conditioned Generators Dec 4, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Integrated Drill Boom Hole-Seeking Control via Reinforcement Learning Dec 4, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Deep Reinforcement Learning for Community Battery Scheduling under Uncertainties of Load, PV Generation, and Energy Prices Dec 4, 2023 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Adaptive operator selection utilising generalised experience Dec 4, 2023 Reinforcement Learning (RL)
— Unverified 0Foundations for Transfer in Reinforcement Learning: A Taxonomy of Knowledge Modalities Dec 4, 2023 Computational Efficiency reinforcement-learning
— Unverified 0Learning Curricula in Open-Ended Worlds Dec 3, 2023 Decision Making Deep Reinforcement Learning
— Unverified 0BenchMARL: Benchmarking Multi-Agent Reinforcement Learning Dec 3, 2023 Benchmarking Multi-agent Reinforcement Learning
— Unverified 0Self-Critical Alternate Learning based Semantic Broadcast Communication Dec 3, 2023 Reinforcement Learning (RL) Semantic Communication
— Unverified 0A Survey of Temporal Credit Assignment in Deep Reinforcement Learning Dec 2, 2023 Decision Making Deep Reinforcement Learning
— Unverified 0DDxT: Deep Generative Transformer Models for Differential Diagnosis Dec 2, 2023 Reinforcement Learning (RL) Self-Supervised Learning
Code Code Available 0A Multifidelity Sim-to-Real Pipeline for Verifiable and Compositional Reinforcement Learning Dec 2, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk Dec 1, 2023 Reinforcement Learning (RL) Safe Reinforcement Learning
— Unverified 0Tracking Object Positions in Reinforcement Learning: A Metric for Keypoint Detection (extended version) Dec 1, 2023 Keypoint Detection Reinforcement Learning (RL)
Code Code Available 0Safe Reinforcement Learning in Tensor Reproducing Kernel Hilbert Space Dec 1, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Optimal Attack and Defense for Reinforcement Learning Nov 30, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Predictable Reinforcement Learning Dynamics through Entropy Rate Minimization Nov 30, 2023 Policy Gradient Methods reinforcement-learning
Code Code Available 0Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control Nov 30, 2023 Autonomous Driving Deep Reinforcement Learning
— Unverified 0Reinforcement Replaces Supervision: Query focused Summarization using Deep Reinforcement Learning Nov 29, 2023 Deep Reinforcement Learning Long Form Question Answering
Code Code Available 0Self-Driving Telescopes: Autonomous Scheduling of Astronomical Observation Campaigns with Offline Reinforcement Learning Nov 29, 2023 Astronomy Offline RL
— Unverified 0Two-Step Reinforcement Learning for Multistage Strategy Card Game Nov 29, 2023 Card Games Decision Making
— Unverified 0Q-learning Based Optimal False Data Injection Attack on Probabilistic Boolean Control Networks Nov 29, 2023 Q-Learning reinforcement-learning
— Unverified 0Safe Reinforcement Learning in a Simulated Robotic Arm Nov 28, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Two-step dynamic obstacle avoidance Nov 28, 2023 Navigate Reinforcement Learning (RL)
Code Code Available 0An Investigation of Time Reversal Symmetry in Reinforcement Learning Nov 28, 2023 Data Augmentation Friction
Code Code Available 0A Graph Neural Network-Based QUBO-Formulated Hamiltonian-Inspired Loss Function for Combinatorial Optimization using Reinforcement Learning Nov 27, 2023 Combinatorial Optimization Graph Neural Network
— Unverified 0A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning Nov 27, 2023 Offline RL Reinforcement Learning (RL)
— Unverified 0Temporal Transfer Learning for Traffic Optimization with Coarse-grained Advisory Autonomy Nov 27, 2023 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Replay across Experiments: A Natural Extension of Off-Policy RL Nov 27, 2023 Reinforcement Learning (RL)
— Unverified 0Optimal Observer Design Using Reinforcement Learning and Quadratic Neural Networks Nov 27, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation Nov 26, 2023 Q-Learning Reinforcement Learning (RL)
— Unverified 0Generative Modelling of Stochastic Actions with Arbitrary Constraints in Reinforcement Learning Nov 26, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Projected Off-Policy Q-Learning (POP-QL) for Stabilizing Offline Reinforcement Learning Nov 25, 2023 Q-Learning Reinforcement Learning (RL)
— Unverified 0Margin Trader: A Reinforcement Learning Framework for Portfolio Management with Margin and Constraints Nov 25, 2023 Deep Reinforcement Learning Management
Code Code Available 0Digital Twin-Native AI-Driven Service Architecture for Industrial Networks Nov 24, 2023 Reinforcement Learning (RL)
— Unverified 0Evaluating Pretrained models for Deployable Lifelong Learning Nov 22, 2023 Atari Games class-incremental learning
— Unverified 0