A Study of Plasticity Loss in On-Policy Deep Reinforcement Learning May 29, 2024 Continual Learning Deep Reinforcement Learning
Code Code Available 0RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning May 29, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF May 29, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies May 29, 2024 Metric Learning Off-policy evaluation
Code Code Available 0Large Language Model-Driven Curriculum Design for Mobile Networks May 28, 2024 Language Modeling Language Modelling
Code Code Available 0Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment May 28, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0LeDex: Training LLMs to Better Self-Debug and Explain Code May 28, 2024 Code Generation Reinforcement Learning (RL)
— Unverified 0Extreme Value Monte Carlo Tree Search May 28, 2024 Board Games Reinforcement Learning (RL)
— Unverified 0Safe Reinforcement Learning in Black-Box Environments via Adaptive Shielding May 28, 2024 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Rethinking Pruning for Backdoor Mitigation: An Optimization Perspective May 28, 2024 backdoor defense Graph Neural Network
— Unverified 0Highway Reinforcement Learning May 28, 2024 Q-Learning reinforcement-learning
— Unverified 0Imitating from auxiliary imperfect demonstrations via Adversarial Density Weighted Regression May 28, 2024 Imitation Learning MuJoCo
Code Code Available 0Mollification Effects of Policy Gradient Methods May 28, 2024 continuous-control Continuous Control
— Unverified 0Structured Graph Network for Constrained Robot Crowd Navigation with Low Fidelity Simulation May 27, 2024 Reinforcement Learning (RL)
— Unverified 0Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement Learning May 27, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Ontology-Enhanced Decision-Making for Autonomous Agents in Dynamic and Partially Observable Environments May 27, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0Oracle-Efficient Reinforcement Learning for Max Value Ensembles May 27, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scales May 27, 2024 Atari Games MuJoCo
Code Code Available 0Trajectory Data Suffices for Statistically Efficient Learning in Offline RL with Linear q^π-Realizability and Concentrability May 27, 2024 Computational Efficiency Offline RL
— Unverified 0Biological Neurons Compete with Deep Reinforcement Learning in Sample Efficiency in a Simulated Gameworld May 27, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement Learning May 26, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning May 26, 2024 Multi-Objective Reinforcement Learning reinforcement-learning
Code Code Available 0Competing for pixels: a self-play algorithm for weakly-supervised segmentation May 26, 2024 Binary Classification Image Segmentation
Code Code Available 0Reinforcement Learning for Jump-Diffusions, with Financial Applications May 26, 2024 Q-Learning reinforcement-learning
— Unverified 0An Evolutionary Framework for Connect-4 as Test-Bed for Comparison of Advanced Minimax, Q-Learning and MCTS May 26, 2024 Decision Making Q-Learning
— Unverified 0Adaptive Q-Network: On-the-fly Target Selection for Deep Reinforcement Learning May 25, 2024 Atari Games AutoML
— Unverified 0AIGB: Generative Auto-bidding via Conditional Diffusion Modeling May 25, 2024 Reinforcement Learning (RL)
— Unverified 0Constrained Ensemble Exploration for Unsupervised Skill Discovery May 25, 2024 Reinforcement Learning (RL) Unsupervised Reinforcement Learning
— Unverified 0Human-in-the-loop Reinforcement Learning for Data Quality Monitoring in Particle Physics Experiments May 24, 2024 Data Augmentation Reinforcement Learning (RL)
— Unverified 0SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning May 24, 2024 Deep Reinforcement Learning Q-Learning
— Unverified 0Embedding-Aligned Language Models May 24, 2024 Reinforcement Learning (RL) Text Generation
— Unverified 0Extracting Heuristics from Large Language Models for Reward Shaping in Reinforcement Learning May 24, 2024 Language Modelling Large Language Model
— Unverified 0Cooperative Backdoor Attack in Decentralized Reinforcement Learning with Theoretical Guarantee May 24, 2024 Backdoor Attack reinforcement-learning
— Unverified 0TrojanForge: Generating Adversarial Hardware Trojan Examples Using Reinforcement Learning May 24, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Knowledge-Informed Auto-Penetration Testing Based on Reinforcement Learning with Reward Machine May 24, 2024 Q-Learning Reinforcement Learning (RL)
— Unverified 0Model-free reinforcement learning with noisy actions for automated experimental control in optics May 24, 2024 Reinforcement Learning (RL)
Code Code Available 0Offline Reinforcement Learning from Datasets with Structured Non-Stationarity May 23, 2024 continuous-control Continuous Control
Code Code Available 0Efficiently Training Deep-Learning Parametric Policies using Lagrangian Duality May 23, 2024 Decision Making Decision Making Under Uncertainty
— Unverified 0Which Experiences Are Influential for RL Agents? Efficiently Estimating The Influence of Experiences May 23, 2024 Reinforcement Learning (RL)
Code Code Available 0Variational Delayed Policy Optimization May 23, 2024 MuJoCo Reinforcement Learning (RL)
Code Code Available 0Exclusively Penalized Q-learning for Offline Reinforcement Learning May 23, 2024 Offline RL Q-Learning
— Unverified 0A finite time analysis of distributed Q-learning May 23, 2024 Decision Making Multi-agent Reinforcement Learning
— Unverified 0Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence May 23, 2024 Distributional Reinforcement Learning Policy Gradient Methods
— Unverified 0Blood Glucose Control Via Pre-trained Counterfactual Invertible Neural Networks May 23, 2024 counterfactual Counterfactual Inference
— Unverified 0Large Language Models (LLMs) Assisted Wireless Network Deployment in Urban Settings May 22, 2024 Navigate Reinforcement Learning (RL)
— Unverified 0Autonomous Algorithm for Training Autonomous Vehicles with Minimal Human Intervention May 22, 2024 Autonomous Driving Autonomous Vehicles
— Unverified 0Learning to sample fibers for goodness-of-fit testing May 22, 2024 Reinforcement Learning (RL)
— Unverified 0Lusifer: LLM-based User SImulated Feedback Environment for online Recommender systems May 22, 2024 Collaborative Filtering Recommendation Systems
Code Code Available 0Leader Reward for POMO-Based Neural Combinatorial Optimization May 22, 2024 Combinatorial Optimization Reinforcement Learning (RL)
— Unverified 0HighwayLLM: Decision-Making and Navigation in Highway Driving with RL-Informed Language Model May 22, 2024 Autonomous Driving Autonomous Vehicles
— Unverified 0