Multiagent Copilot Approach for Shared Autonomy between Human EEG and TD3 Deep Reinforcement Learning Dec 22, 2023 Brain Computer Interface Deep Reinforcement Learning
— Unverified 0A Survey of Reinforcement Learning from Human Feedback Dec 22, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Optimizing Heat Alert Issuance with Reinforcement Learning Dec 21, 2023 Data Augmentation Decision Making
Code Code Available 0Maximum entropy GFlowNets with soft Q-learning Dec 21, 2023 Q-Learning Reinforcement Learning (RL)
— Unverified 0Multi-Agent Probabilistic Ensembles with Trajectory Sampling for Connected Autonomous Vehicles Dec 21, 2023 Autonomous Vehicles Decision Making
— Unverified 0Parameterized Projected Bellman Operator Dec 20, 2023 Decision Making Reinforcement Learning (RL)
Code Code Available 0Optimal coordination of resources: A solution from reinforcement learning Dec 20, 2023 Q-Learning reinforcement-learning
— Unverified 0Towards Machines that Trust: AI Agents Learn to Trust in the Trust Game Dec 20, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Neural Network Approximation for Pessimistic Offline Reinforcement Learning Dec 19, 2023 Deep Reinforcement Learning Offline RL
— Unverified 0Stable Relay Learning Optimization Approach for Fast Power System Production Cost Minimization Simulation Dec 19, 2023 Imitation Learning Reinforcement Learning (RL)
— Unverified 0A Dual Curriculum Learning Framework for Multi-UAV Pursuit-Evasion in Diverse Environments Dec 19, 2023 Reinforcement Learning (RL) Zero-shot Generalization
— Unverified 0CUDC: A Curiosity-Driven Unsupervised Data Collection Method with Adaptive Temporal Distances for Offline Reinforcement Learning Dec 19, 2023 Navigate Offline RL
— Unverified 0Data-Driven Merton's Strategies via Policy Randomization Dec 19, 2023 Reinforcement Learning (RL)
— Unverified 0BadRL: Sparse Targeted Backdoor Attack Against Reinforcement Learning Dec 19, 2023 Backdoor Attack reinforcement-learning
Code Code Available 0Active search and coverage using point-cloud reinforcement learning Dec 18, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Solving the swing-up and balance task for the Acrobot and Pendubot with SAC Dec 18, 2023 Acrobot Position
— Unverified 0Safeguarded Progress in Reinforcement Learning: Safe Bayesian Exploration for Control Policy Synthesis Dec 18, 2023 Bayesian Inference Reinforcement Learning (RL)
— Unverified 0Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints Dec 16, 2023 Decision Making Fairness
— Unverified 0Improving Environment Robustness of Deep Reinforcement Learning Approaches for Autonomous Racing Using Bayesian Optimization-based Curriculum Learning Dec 16, 2023 Autonomous Driving Autonomous Racing
Code Code Available 0Advancing RAN Slicing with Offline Reinforcement Learning Dec 16, 2023 Management Offline RL
— Unverified 0Fractional Deep Reinforcement Learning for Age-Minimal Mobile Edge Computing Dec 16, 2023 Autonomous Driving Deep Reinforcement Learning
— Unverified 0Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement Learning Dec 16, 2023 Reinforcement Learning (RL) Safe Reinforcement Learning
Code Code Available 0Assume-Guarantee Reinforcement Learning Dec 15, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Toward Computationally Efficient Inverse Reinforcement Learning via Reward Shaping Dec 15, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0ReCoRe: Regularized Contrastive Representation Learning of World Model Dec 14, 2023 Contrastive Learning Denoising
— Unverified 0Towards Automatic Data Augmentation for Disordered Speech Recognition Dec 14, 2023 Data Augmentation Reinforcement Learning (RL)
— Unverified 0iOn-Profiler: intelligent Online multi-objective VNF Profiling with Reinforcement Learning Dec 14, 2023 CPU reinforcement-learning
— Unverified 0An Invitation to Deep Reinforcement Learning Dec 13, 2023 Code Generation Deep Reinforcement Learning
— Unverified 0Safe Exploration in Reinforcement Learning: Training Backup Control Barrier Functions with Zero Training Time Safety Violations Dec 13, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Toward a Reinforcement-Learning-Based System for Adjusting Medication to Minimize Speech Disfluency Dec 12, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Noise Distribution Decomposition based Multi-Agent Distributional Reinforcement Learning Dec 12, 2023 Distributional Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 0Beyond Expected Return: Accounting for Policy Reproducibility when Evaluating Reinforcement Learning Algorithms Dec 12, 2023 Bayesian Optimisation Reinforcement Learning (RL)
— Unverified 0A dynamical clipping approach with task feedback for Proximal Policy Optimization Dec 12, 2023 Language Modelling Large Language Model
Code Code Available 0Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation Dec 12, 2023 Decision Making Language Modelling
— Unverified 0Learning Polynomial Representations of Physical Objects with Application to Certifying Correct Packing Configurations Dec 11, 2023 Object One-Class Classification
— Unverified 0KnowGPT: Knowledge Graph based Prompting for Large Language Models Dec 11, 2023 Knowledge Graphs Prompt Engineering
— Unverified 0Reward Certification for Policy Smoothed Reinforcement Learning Dec 11, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Spreeze: High-Throughput Parallel Reinforcement Learning Framework Dec 11, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Partial End-to-end Reinforcement Learning for Robustness Against Modelling Error in Autonomous Racing Dec 11, 2023 Autonomous Racing Imitation Learning
— Unverified 0Modifying RL Policies with Imagined Actions: How Predictable Policies Can Enable Users to Perform Novel Tasks Dec 10, 2023 Reinforcement Learning (RL)
— Unverified 0Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a High Replay Ratio and Regularization Dec 10, 2023 Q-Learning Reinforcement Learning (RL)
Code Code Available 0PerfRL: A Small Language Model Framework for Efficient Code Optimization Dec 9, 2023 Language Modeling Language Modelling
— Unverified 0On the calibration of compartmental epidemiological models Dec 9, 2023 Decision Making Reinforcement Learning (RL)
Code Code Available 0Reinforcement Learning-Based Bionic Reflex Control for Anthropomorphic Robotic Grasping exploiting Domain Randomization Dec 8, 2023 Reinforcement Learning (RL) Robotic Grasping
— Unverified 0Modeling Risk in Reinforcement Learning: A Literature Mapping Dec 8, 2023 Management reinforcement-learning
— Unverified 0Guaranteed Trust Region Optimization via Two-Phase KL Penalization Dec 8, 2023 Computational Efficiency Reinforcement Learning (RL)
— Unverified 0Exploring Parity Challenges in Reinforcement Learning through Curriculum Learning with Noisy Labels Dec 8, 2023 Learning with noisy labels Reinforcement Learning (RL)
Code Code Available 0CODEX: A Cluster-Based Method for Explainable Reinforcement Learning Dec 7, 2023 Clustering counterfactual
Code Code Available 0Is Feedback All You Need? Leveraging Natural Language Feedback in Goal-Conditioned Reinforcement Learning Dec 7, 2023 All Reinforcement Learning (RL)
Code Code Available 0Learning to sample in Cartesian MRI Dec 7, 2023 compressed sensing Computational Efficiency
— Unverified 0