The Effective Horizon Explains Deep RL Performance in Stochastic Environments Dec 13, 2023 Reinforcement Learning (RL)
Code Code Available 1Safe Exploration in Reinforcement Learning: Training Backup Control Barrier Functions with Zero Training Time Safety Violations Dec 13, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0An Invitation to Deep Reinforcement Learning Dec 13, 2023 Code Generation Deep Reinforcement Learning
— Unverified 0Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation Dec 12, 2023 Decision Making Language Modelling
— Unverified 0Toward a Reinforcement-Learning-Based System for Adjusting Medication to Minimize Speech Disfluency Dec 12, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0A dynamical clipping approach with task feedback for Proximal Policy Optimization Dec 12, 2023 Language Modelling Large Language Model
Code Code Available 0Traffic Signal Control Using Lightweight Transformers: An Offline-to-Online RL Approach Dec 12, 2023 Knowledge Distillation Offline RL
Code Code Available 1Sequential Planning in Large Partially Observable Environments guided by LLMs Dec 12, 2023 Decision Making Reinforcement Learning (RL)
Code Code Available 1Noise Distribution Decomposition based Multi-Agent Distributional Reinforcement Learning Dec 12, 2023 Distributional Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 0Beyond Expected Return: Accounting for Policy Reproducibility when Evaluating Reinforcement Learning Algorithms Dec 12, 2023 Bayesian Optimisation Reinforcement Learning (RL)
— Unverified 0Learning Polynomial Representations of Physical Objects with Application to Certifying Correct Packing Configurations Dec 11, 2023 Object One-Class Classification
— Unverified 0Partial End-to-end Reinforcement Learning for Robustness Against Modelling Error in Autonomous Racing Dec 11, 2023 Autonomous Racing Imitation Learning
— Unverified 0Reward Certification for Policy Smoothed Reinforcement Learning Dec 11, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Spreeze: High-Throughput Parallel Reinforcement Learning Framework Dec 11, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0KnowGPT: Knowledge Graph based Prompting for Large Language Models Dec 11, 2023 Knowledge Graphs Prompt Engineering
— Unverified 0Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a High Replay Ratio and Regularization Dec 10, 2023 Q-Learning Reinforcement Learning (RL)
Code Code Available 0Modifying RL Policies with Imagined Actions: How Predictable Policies Can Enable Users to Perform Novel Tasks Dec 10, 2023 Reinforcement Learning (RL)
— Unverified 0The Generalization Gap in Offline Reinforcement Learning Dec 10, 2023 Offline RL reinforcement-learning
Code Code Available 1Evolving Reservoirs for Meta Reinforcement Learning Dec 9, 2023 Meta Reinforcement Learning reinforcement-learning
Code Code Available 2On the calibration of compartmental epidemiological models Dec 9, 2023 Decision Making Reinforcement Learning (RL)
Code Code Available 0PerfRL: A Small Language Model Framework for Efficient Code Optimization Dec 9, 2023 Language Modeling Language Modelling
— Unverified 0Guaranteed Trust Region Optimization via Two-Phase KL Penalization Dec 8, 2023 Computational Efficiency Reinforcement Learning (RL)
— Unverified 0Multi-Agent Reinforcement Learning via Distributed MPC as a Function Approximator Dec 8, 2023 Model Predictive Control Multi-agent Reinforcement Learning
Code Code Available 1Exploring Parity Challenges in Reinforcement Learning through Curriculum Learning with Noisy Labels Dec 8, 2023 Learning with noisy labels Reinforcement Learning (RL)
Code Code Available 0UniTSA: A Universal Reinforcement Learning Framework for V2X Traffic Signal Control Dec 8, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 1Modeling Risk in Reinforcement Learning: A Literature Mapping Dec 8, 2023 Management reinforcement-learning
— Unverified 0Reinforcement Learning-Based Bionic Reflex Control for Anthropomorphic Robotic Grasping exploiting Domain Randomization Dec 8, 2023 Reinforcement Learning (RL) Robotic Grasping
— Unverified 0Efficient Parallel Reinforcement Learning Framework using the Reactor Model Dec 7, 2023 OpenAI Gym Q-Learning
Code Code Available 0Learning to sample in Cartesian MRI Dec 7, 2023 compressed sensing Computational Efficiency
— Unverified 0MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator Dec 7, 2023 Offline RL reinforcement-learning
Code Code Available 0Is Feedback All You Need? Leveraging Natural Language Feedback in Goal-Conditioned Reinforcement Learning Dec 7, 2023 All Reinforcement Learning (RL)
Code Code Available 0CODEX: A Cluster-Based Method for Explainable Reinforcement Learning Dec 7, 2023 Clustering counterfactual
Code Code Available 0Safety-Enhanced Self-Learning for Optimal Power Converter Control Dec 7, 2023 Model Predictive Control Reinforcement Learning (RL)
— Unverified 0Language Model Alignment with Elastic Reset Dec 6, 2023 Chatbot Language Modeling
Code Code Available 0Pearl: A Production-ready Reinforcement Learning Agent Dec 6, 2023 Benchmarking reinforcement-learning
Code Code Available 4Demand response for residential building heating: Effective Monte Carlo Tree Search control based on physics-informed neural networks Dec 6, 2023 Board Games Model Predictive Control
— Unverified 0On the Role of the Action Space in Robot Manipulation Learning and Sim-to-Real Transfer Dec 6, 2023 Reinforcement Learning (RL) Robot Manipulation
— Unverified 0Diffused Task-Agnostic Milestone Planner Dec 6, 2023 Decision Making Offline RL
— Unverified 0Evaluation of Active Feature Acquisition Methods for Static Feature Settings Dec 6, 2023 Offline RL reinforcement-learning
— Unverified 0Mitigating Open-Vocabulary Caption Hallucinations Dec 6, 2023 Diversity Hallucination
Code Code Available 1RL-Based Cargo-UAV Trajectory Planning and Cell Association for Minimum Handoffs, Disconnectivity, and Energy Consumption Dec 5, 2023 Reinforcement Learning (RL) Trajectory Planning
— Unverified 0Convergence Rates for Stochastic Approximation: Biased Noise with Unbounded Variance, and Applications Dec 5, 2023 Reinforcement Learning (RL)
— Unverified 0Contact Energy Based Hindsight Experience Prioritization Dec 5, 2023 Reinforcement Learning (RL) Robot Manipulation
— Unverified 0MASP: Scalable GNN-based Planning for Multi-Agent Navigation Dec 5, 2023 Reinforcement Learning (RL) Zero-shot Generalization
— Unverified 0LExCI: A Framework for Reinforcement Learning with Embedded Systems Dec 5, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Score-Aware Policy-Gradient Methods and Performance Guarantees using Local Lyapunov Conditions: Applications to Product-Form Stochastic Networks and Queueing Systems Dec 5, 2023 Form Model-based Reinforcement Learning
— Unverified 0SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World Dec 5, 2023 Benchmarking Diversity
— Unverified 0Adaptive operator selection utilising generalised experience Dec 4, 2023 Reinforcement Learning (RL)
— Unverified 0Deep Reinforcement Learning for Community Battery Scheduling under Uncertainties of Load, PV Generation, and Energy Prices Dec 4, 2023 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Training Reinforcement Learning Agents and Humans With Difficulty-Conditioned Generators Dec 4, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0