Sample-Efficient and Safe Deep Reinforcement Learning via Reset Deep Ensemble Agents Oct 31, 2023 Deep Reinforcement Learning Ensemble Learning
— Unverified 0Closed Drafting as a Case Study for First-Principle Interpretability, Memory, and Generalizability in Deep Reinforcement Learning Oct 31, 2023 Decision Making Deep Reinforcement Learning
— Unverified 0Safety-aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving Oct 31, 2023 Autonomous Driving Autonomous Vehicles
— Unverified 0A Tractable Inference Perspective of Offline RL Oct 31, 2023 MuJoCo Offline RL
— Unverified 0Diversify & Conquer: Outcome-directed Curriculum RL via Out-of-Distribution Disagreement Oct 30, 2023 Reinforcement Learning (RL)
— Unverified 0On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics Oct 30, 2023 Reinforcement Learning (RL)
— Unverified 0Posterior Sampling for Competitive RL: Function Approximation and Partial Observation Oct 30, 2023 Reinforcement Learning (RL)
— Unverified 0Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation Oct 29, 2023 Computational Efficiency Reinforcement Learning (RL)
— Unverified 0Spacecraft Autonomous Decision-Planning for Collision Avoidance: a Reinforcement Learning Approach Oct 29, 2023 Collision Avoidance Decision Making
— Unverified 0MAG-GNN: Reinforcement Learning Boosted Graph Neural Network Oct 29, 2023 Combinatorial Optimization Graph Learning
— Unverified 0Real-World Implementation of Reinforcement Learning Based Energy Coordination for a Cluster of Households Oct 29, 2023 energy management Reinforcement Learning (RL)
— Unverified 0Automaton Distillation: Neuro-Symbolic Transfer Learning for Deep Reinforcement Learning Oct 29, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game Oct 29, 2023 Decision Making Reinforcement Learning (RL)
— Unverified 0Behavior Alignment via Reward Function Optimization Oct 29, 2023 Reinforcement Learning (RL)
— Unverified 0Benchmark Generation Framework with Customizable Distortions for Image Classifier Robustness Oct 28, 2023 Benchmarking image-classification
Code Code Available 0Unsupervised Behavior Extraction via Random Intent Priors Oct 28, 2023 Reinforcement Learning (RL)
— Unverified 0Robust Offline Reinforcement learning with Heavy-Tailed Rewards Oct 28, 2023 Offline RL Off-policy evaluation
Code Code Available 0Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning Oct 27, 2023 Autonomous Driving D4RL
— Unverified 0Deep Reinforcement Learning for Weapons to Targets Assignment in a Hypersonic strike Oct 27, 2023 Decision Making Deep Reinforcement Learning
— Unverified 0Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage Oct 27, 2023 Offline RL Reinforcement Learning (RL)
Code Code Available 0Grow Your Limits: Continuous Improvement with Real-World RL for Robotic Locomotion Oct 26, 2023 Deep Reinforcement Learning Efficient Exploration
— Unverified 0Counterfactual-Augmented Importance Sampling for Semi-Offline Policy Evaluation Oct 26, 2023 counterfactual Off-policy evaluation
Code Code Available 0Demonstration-Regularized RL Oct 26, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0CQM: Curriculum Reinforcement Learning with a Quantized World Model Oct 26, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Understanding when Dynamics-Invariant Data Augmentations Benefit Model-Free Reinforcement Learning Updates Oct 26, 2023 Data Augmentation reinforcement-learning
Code Code Available 0Model-enhanced Contrastive Reinforcement Learning for Sequential Recommendation Oct 25, 2023 Contrastive Learning model
— Unverified 0Transfer of Reinforcement Learning-Based Controllers from Model- to Hardware-in-the-Loop Oct 25, 2023 Reinforcement Learning (RL) Transfer Learning
— Unverified 0Privately Aligning Language Models with Reinforcement Learning Oct 25, 2023 Instruction Following Privacy Preserving
— Unverified 0MultiPrompter: Cooperative Prompt Optimization with Multi-Agent Reinforcement Learning Oct 25, 2023 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Controlled Decoding from Language Models Oct 25, 2023 Language Modeling Language Modelling
— Unverified 0Hyperparameter Optimization for Multi-Objective Reinforcement Learning Oct 25, 2023 Hyperparameter Optimization Multi-Objective Reinforcement Learning
— Unverified 0A Contextualized Real-Time Multimodal Emotion Recognition for Conversational Agents using Graph Convolutional Networks in Reinforcement Learning Oct 24, 2023 Emotion Classification Emotion Recognition
— Unverified 0Finetuning Offline World Models in the Real World Oct 24, 2023 Offline RL Reinforcement Learning (RL)
— Unverified 0Fractal Landscapes in Policy Optimization Oct 24, 2023 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0WebWISE: Web Interface Control and Sequential Exploration with Large Language Models Oct 24, 2023 Imitation Learning In-Context Learning
— Unverified 0Reinforcement learning in large, structured action spaces: A simulation study of decision support for spinal cord injury rehabilitation Oct 23, 2023 Decision Making Reinforcement Learning (RL)
— Unverified 0Enhancing Robotic Manipulation: Harnessing the Power of Multi-Task Reinforcement Learning and Single Life Reinforcement Learning in Meta-World Oct 23, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Corruption-Robust Offline Reinforcement Learning with General Function Approximation Oct 23, 2023 Offline RL reinforcement-learning
Code Code Available 0Diverse Priors for Deep Reinforcement Learning Oct 23, 2023 Deep Reinforcement Learning Diversity
— Unverified 0A Review of Reinforcement Learning for Natural Language Processing, and Applications in Healthcare Oct 23, 2023 Decision Making Machine Translation
— Unverified 0Iteratively Learn Diverse Strategies with State Distance Information Oct 23, 2023 Diversity Reinforcement Learning (RL)
— Unverified 0Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes Oct 20, 2023 Decision Making Multi-Task Learning
— Unverified 0SDGym: Low-Code Reinforcement Learning Environments using System Dynamics Models Oct 19, 2023 OpenAI Gym reinforcement-learning
— Unverified 0Using Experience Classification for Training Non-Markovian Tasks Oct 18, 2023 Autonomous Driving Classification
— Unverified 0On The Expressivity of Objective-Specification Formalisms in Reinforcement Learning Oct 18, 2023 Multi-Objective Reinforcement Learning reinforcement-learning
— Unverified 0Accelerate Presolve in Large-Scale Linear Programming via Reinforcement Learning Oct 18, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Improving Generalization of Alignment with Human Preferences through Group Invariant Learning Oct 18, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Learning to Optimise Climate Sensor Placement using a Transformer Oct 18, 2023 Deep Reinforcement Learning Management
— Unverified 0Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning Oct 18, 2023 Offline RL Quantization
— Unverified 0Accelerated Policy Gradient: On the Convergence Rates of the Nesterov Momentum for Reinforcement Learning Oct 18, 2023 Policy Gradient Methods reinforcement-learning
Code Code Available 0