Monitored Markov Decision Processes Feb 9, 2024 Reinforcement Learning (RL)
Code Code Available 0Value function interference and greedy action selection in value-based multi-objective reinforcement learning Feb 9, 2024 Multi-Objective Reinforcement Learning Q-Learning
— Unverified 0ACTER: Diverse and Actionable Counterfactual Sequences for Explaining and Diagnosing RL Policies Feb 9, 2024 counterfactual Counterfactual Reasoning
— Unverified 0High-Precision Geosteering via Reinforcement Learning and Particle Filters Feb 9, 2024 Decision Making reinforcement-learning
— Unverified 0Learn to Teach: Sample-Efficient Privileged Learning for Humanoid Locomotion over Diverse Terrains Feb 9, 2024 Depth Estimation MuJoCo
— Unverified 0Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices Feb 8, 2024 Federated Learning Offline RL
— Unverified 0Differentially Private Deep Model-Based Reinforcement Learning Feb 8, 2024 continuous-control Continuous Control
— Unverified 0Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization Feb 8, 2024 Q-Learning reinforcement-learning
Code Code Available 0Model-Based RL for Mean-Field Games is not Statistically Harder than Single-Agent RL Feb 8, 2024 Computational Efficiency Reinforcement Learning (RL)
Code Code Available 0Scaling Intelligent Agents in Combat Simulations for Wargaming Feb 8, 2024 Deep Reinforcement Learning Hierarchical Reinforcement Learning
— Unverified 0Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning Feb 8, 2024 Deep Reinforcement Learning Offline RL
— Unverified 0OIL-AD: An Anomaly Detection Framework for Sequential Decision Sequences Feb 7, 2024 Anomaly Detection Behavioural cloning
Code Code Available 0Convergence for Natural Policy Gradient on Infinite-State Queueing MDPs Feb 7, 2024 Reinforcement Learning (RL)
— Unverified 0Code as Reward: Empowering Reinforcement Learning with VLMs Feb 7, 2024 Code Generation reinforcement-learning
— Unverified 0Learning Diverse Policies with Soft Self-Generated Guidance Feb 7, 2024 continuous-control Continuous Control
— Unverified 0Learning by Doing: An Online Causal Reinforcement Learning Framework with Causal-Aware Policy Feb 7, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0Context in Public Health for Underserved Communities: A Bayesian Approach to Online Restless Bandits Feb 7, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0A Primal-Dual Algorithm for Offline Constrained Reinforcement Learning with Linear MDPs Feb 7, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0Averaging n-step Returns Reduces Variance in Reinforcement Learning Feb 6, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning Agents Feb 6, 2024 continuous-control Continuous Control
Code Code Available 0No-Regret Reinforcement Learning in Smooth MDPs Feb 6, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning from Bagged Reward Feb 6, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Understanding What Affects the Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence Feb 5, 2024 continuous-control Continuous Control
— Unverified 0Replication of Impedance Identification Experiments on a Reinforcement-Learning-Controlled Digital Twin of Human Elbows Feb 5, 2024 Reinforcement Learning (RL)
Code Code Available 0Vision-Language Models Provide Promptable Representations for Reinforcement Learning Feb 5, 2024 Common Sense Reasoning Instruction Following
— Unverified 0Utility-Based Reinforcement Learning: Unifying Single-objective and Multi-objective Reinforcement Learning Feb 5, 2024 Multi-Objective Reinforcement Learning reinforcement-learning
— Unverified 0DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised Environment Design Feb 5, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning Feb 5, 2024 Contrastive Learning D4RL
— Unverified 0Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences Feb 5, 2024 continuous-control Continuous Control
— Unverified 0Boosting Reinforcement Learning with Strongly Delayed Feedback Through Auxiliary Short Delays Feb 5, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent Feb 5, 2024 Atari Games Atari Games 100k
Code Code Available 0Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem Feb 5, 2024 Montezuma's Revenge NetHack
Code Code Available 0Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate Feb 5, 2024 Image Classification Language Modelling
— Unverified 0Abstracted Trajectory Visualization for Explainability in Reinforcement Learning Feb 5, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Assessing the Impact of Distribution Shift on Reinforcement Learning Performance Feb 5, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Evading Deep Learning-Based Malware Detectors via Obfuscation: A Deep Reinforcement Learning Approach Feb 4, 2024 Deep Reinforcement Learning Malware Detection
— Unverified 0DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching Feb 4, 2024 D4RL Data Augmentation
— Unverified 0A Safe Reinforcement Learning driven Weights-varying Model Predictive Control for Autonomous Vehicle Motion Control Feb 4, 2024 Bayesian Optimization Deep Reinforcement Learning
— Unverified 0The Virtues of Pessimism in Inverse Reinforcement Learning Feb 4, 2024 Offline RL reinforcement-learning
— Unverified 0Adaptive Q-Aid for Conditional Supervised Learning in Offline Reinforcement Learning Feb 3, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0A Survey of Constraint Formulations in Safe Reinforcement Learning Feb 3, 2024 Diversity reinforcement-learning
— Unverified 0An Auction-based Marketplace for Model Trading in Federated Learning Feb 2, 2024 Federated Learning Marketing
— Unverified 0Efficient Reinforcement Learning for Routing Jobs in Heterogeneous Queueing Systems Feb 2, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Rethinking the Role of Proxy Rewards in Language Model Alignment Feb 2, 2024 Language Modeling Language Modelling
Code Code Available 0To the Max: Reinventing Reward in Reinforcement Learning Feb 2, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models Feb 2, 2024 Reinforcement Learning (RL)
— Unverified 0The Political Preferences of LLMs Feb 2, 2024 Reinforcement Learning (RL)
— Unverified 0Leveraging Approximate Model-based Shielding for Probabilistic Safety Guarantees in Continuous Environments Feb 1, 2024 Reinforcement Learning (RL) Safe Reinforcement Learning
Code Code Available 0Expert Proximity as Surrogate Rewards for Single Demonstration Imitation Learning Feb 1, 2024 Imitation Learning MuJoCo
Code Code Available 0Causal Coordinated Concurrent Reinforcement Learning Jan 31, 2024 Causal Inference reinforcement-learning
— Unverified 0