Context in Public Health for Underserved Communities: A Bayesian Approach to Online Restless Bandits Feb 7, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0OIL-AD: An Anomaly Detection Framework for Sequential Decision Sequences Feb 7, 2024 Anomaly Detection Behavioural cloning
Code Code Available 0Learning Diverse Policies with Soft Self-Generated Guidance Feb 7, 2024 continuous-control Continuous Control
— Unverified 0A Primal-Dual Algorithm for Offline Constrained Reinforcement Learning with Linear MDPs Feb 7, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0Learning by Doing: An Online Causal Reinforcement Learning Framework with Causal-Aware Policy Feb 7, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0RL-VLM-F: Reinforcement Learning from Vision Language Foundation Model Feedback Feb 6, 2024 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 2SEABO: A Simple Search-Based Method for Offline Imitation Learning Feb 6, 2024 D4RL Imitation Learning
Code Code Available 1Averaging n-step Returns Reduces Variance in Reinforcement Learning Feb 6, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0No-Regret Reinforcement Learning in Smooth MDPs Feb 6, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning from Bagged Reward Feb 6, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Logical Specifications-guided Dynamic Task Sampling for Reinforcement Learning Agents Feb 6, 2024 continuous-control Continuous Control
Code Code Available 0Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning Feb 6, 2024 D4RL Offline RL
Code Code Available 1Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent Feb 5, 2024 Atari Games Atari Games 100k
Code Code Available 0Abstracted Trajectory Visualization for Explainability in Reinforcement Learning Feb 5, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences Feb 5, 2024 continuous-control Continuous Control
— Unverified 0Assessing the Impact of Distribution Shift on Reinforcement Learning Performance Feb 5, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised Environment Design Feb 5, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem Feb 5, 2024 Montezuma's Revenge NetHack
Code Code Available 0Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning Feb 5, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 3Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning Feb 5, 2024 Contrastive Learning D4RL
— Unverified 0Utility-Based Reinforcement Learning: Unifying Single-objective and Multi-objective Reinforcement Learning Feb 5, 2024 Multi-Objective Reinforcement Learning reinforcement-learning
— Unverified 0Understanding What Affects the Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence Feb 5, 2024 continuous-control Continuous Control
— Unverified 0Vision-Language Models Provide Promptable Representations for Reinforcement Learning Feb 5, 2024 Common Sense Reasoning Instruction Following
— Unverified 0Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate Feb 5, 2024 Image Classification Language Modelling
— Unverified 0Boosting Reinforcement Learning with Strongly Delayed Feedback Through Auxiliary Short Delays Feb 5, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Replication of Impedance Identification Experiments on a Reinforcement-Learning-Controlled Digital Twin of Human Elbows Feb 5, 2024 Reinforcement Learning (RL)
Code Code Available 0DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching Feb 4, 2024 D4RL Data Augmentation
— Unverified 0The Virtues of Pessimism in Inverse Reinforcement Learning Feb 4, 2024 Offline RL reinforcement-learning
— Unverified 0Evading Deep Learning-Based Malware Detectors via Obfuscation: A Deep Reinforcement Learning Approach Feb 4, 2024 Deep Reinforcement Learning Malware Detection
— Unverified 0A Safe Reinforcement Learning driven Weights-varying Model Predictive Control for Autonomous Vehicle Motion Control Feb 4, 2024 Bayesian Optimization Deep Reinforcement Learning
— Unverified 0Adaptive Q-Aid for Conditional Supervised Learning in Offline Reinforcement Learning Feb 3, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0A Survey of Constraint Formulations in Safe Reinforcement Learning Feb 3, 2024 Diversity reinforcement-learning
— Unverified 0Rethinking the Role of Proxy Rewards in Language Model Alignment Feb 2, 2024 Language Modeling Language Modelling
Code Code Available 0The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models Feb 2, 2024 Reinforcement Learning (RL)
— Unverified 0The Political Preferences of LLMs Feb 2, 2024 Reinforcement Learning (RL)
— Unverified 0An Auction-based Marketplace for Model Trading in Federated Learning Feb 2, 2024 Federated Learning Marketing
— Unverified 0To the Max: Reinventing Reward in Reinforcement Learning Feb 2, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback Feb 2, 2024 Code Completion Code Generation
Code Code Available 2Efficient Reinforcement Learning for Routing Jobs in Heterogeneous Queueing Systems Feb 2, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Expert Proximity as Surrogate Rewards for Single Demonstration Imitation Learning Feb 1, 2024 Imitation Learning MuJoCo
Code Code Available 0ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update Feb 1, 2024 Imitation Learning Offline RL
Code Code Available 1Towards Efficient Exact Optimization of Language Model Alignment Feb 1, 2024 Language Modeling Language Modelling
Code Code Available 2Developing A Multi-Agent and Self-Adaptive Framework with Deep Reinforcement Learning for Dynamic Portfolio Risk Management Feb 1, 2024 Deep Reinforcement Learning Management
Code Code Available 2Leveraging Approximate Model-based Shielding for Probabilistic Safety Guarantees in Continuous Environments Feb 1, 2024 Reinforcement Learning (RL) Safe Reinforcement Learning
Code Code Available 0Safe Reinforcement Learning-Based Eco-Driving Control for Mixed Traffic Flows With Disturbances Jan 31, 2024 Reinforcement Learning (RL) Safe Reinforcement Learning
— Unverified 0A Reinforcement Learning Based Controller to Minimize Forces on the Crutches of a Lower-Limb Exoskeleton Jan 31, 2024 Deep Reinforcement Learning MuJoCo
— Unverified 0A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees Jan 31, 2024 Reinforcement Learning (RL)
Code Code Available 0Causal Coordinated Concurrent Reinforcement Learning Jan 31, 2024 Causal Inference reinforcement-learning
— Unverified 0Attention Graph for Multi-Robot Social Navigation with Deep Reinforcement Learning Jan 31, 2024 Deep Reinforcement Learning Graph Neural Network
— Unverified 0Zero-Shot Reinforcement Learning via Function Encoders Jan 30, 2024 Decision Making reinforcement-learning
Code Code Available 0