Seeing is not Believing: Robust Reinforcement Learning against Spurious Correlation Jul 15, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets Jul 15, 2023 Drug Discovery Reinforcement Learning (RL)
— Unverified 0Combining model-predictive control and predictive reinforcement learning for stable quadrupedal robot locomotion Jul 15, 2023 Model Predictive Control reinforcement-learning
— Unverified 0Efficient Action Robust Reinforcement Learning with Probabilistic Policy Execution Uncertainty Jul 15, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0SafeDreamer: Safe Reinforcement Learning with World Models Jul 14, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 1Why Guided Dialog Policy Learning performs well? Understanding the role of adversarial learning and its alternative Jul 13, 2023 Reinforcement Learning (RL)
— Unverified 0Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning Jul 13, 2023 Benchmarking Offline RL
Code Code Available 1PID-Inspired Inductive Biases for Deep Reinforcement Learning in Partially Observable Control Tasks Jul 12, 2023 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 1Transformers in Reinforcement Learning: A Survey Jul 12, 2023 Cloud Computing Combinatorial Optimization
— Unverified 0Learning Decentralized Partially Observable Mean Field Control for Artificial Collective Behavior Jul 12, 2023 Policy Gradient Methods Reinforcement Learning (RL)
— Unverified 0Payload-Independent Direct Cost Learning for Image Steganography Jul 11, 2023 Image Steganography Reinforcement Learning (RL)
Code Code Available 1Empowering recommender systems using automatically generated Knowledge Graphs and Reinforcement Learning Jul 11, 2023 Decision Making Knowledge Graphs
Code Code Available 0Scaling Distributed Multi-task Reinforcement Learning with Experience Sharing Jul 11, 2023 Lifelong learning OpenAI Gym
— Unverified 0Probabilistic Counterexample Guidance for Safer Reinforcement Learning (Extended Version) Jul 10, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Alleviating Matthew Effect of Offline Reinforcement Learning in Interactive Recommendation Jul 10, 2023 Decision Making Interactive Recommendation
Code Code Available 1Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning Jul 10, 2023 continuous-control Continuous Control
— Unverified 0RLTF: Reinforcement Learning from Unit Test Feedback Jul 10, 2023 Code Generation mbpp
Code Code Available 1Investigating the Edge of Stability Phenomenon in Reinforcement Learning Jul 9, 2023 Q-Learning reinforcement-learning
— Unverified 0A User Study on Explainable Online Reinforcement Learning for Adaptive Systems Jul 9, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Active Collection of Well-Being and Health Data in Mobile Devices Jul 7, 2023 Q-Learning Reinforcement Learning (RL)
Code Code Available 0Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning Jul 7, 2023 Contrastive Learning reinforcement-learning
Code Code Available 1When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment Jul 7, 2023 Reinforcement Learning (RL)
Code Code Available 2Learning Multi-Agent Intention-Aware Communication for Optimal Multi-Order Execution in Finance Jul 6, 2023 Reinforcement Learning (RL)
— Unverified 0Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human Feedback Jul 6, 2023 Decision Making LEMMA
— Unverified 0Offline Reinforcement Learning with Imbalanced Datasets Jul 6, 2023 D4RL Offline RL
— Unverified 0A Neuromorphic Architecture for Reinforcement Learning from Real-Valued Observations Jul 6, 2023 Acrobot Decision Making
— Unverified 0Dynamic Observation Policies in Observation Cost-Sensitive Reinforcement Learning Jul 5, 2023 OpenAI Gym reinforcement-learning
Code Code Available 0Generative Job Recommendations with Large Language Model Jul 5, 2023 Collaborative Filtering Language Modeling
— Unverified 0First-Explore, then Exploit: Meta-Learning to Solve Hard Exploration-Exploitation Trade-Offs Jul 5, 2023 Meta-Learning Reinforcement Learning (RL)
Code Code Available 1LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning Jul 5, 2023 Offline RL Q-Learning
— Unverified 0A Scalable Reinforcement Learning-based System Using On-Chain Data for Cryptocurrency Portfolio Management Jul 4, 2023 Management Reinforcement Learning (RL)
— Unverified 0Environmental effects on emergent strategy in micro-scale multi-agent reinforcement learning Jul 3, 2023 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 1Model-Bellman Inconsistency for Model-based Offline Reinforcement Learning Jul 1, 2023 D4RL model
Code Code Available 1Decentralized Motor Skill Learning for Complex Robotic Systems Jun 30, 2023 Reinforcement Learning (RL)
— Unverified 0Navigation of micro-robot swarms for targeted delivery using reinforcement learning Jun 30, 2023 Navigate reinforcement-learning
— Unverified 0Comparing Reinforcement Learning and Human Learning using the Game of Hidden Rules Jun 30, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0ArrayBot: Reinforcement Learning for Generalizable Distributed Manipulation through Touch Jun 29, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Probabilistic Constraint for Safety-Critical Reinforcement Learning Jun 29, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Safety-Aware Task Composition for Discrete and Continuous Reinforcement Learning Jun 29, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0RL4CO: an Extensive Reinforcement Learning for Combinatorial Optimization Benchmark Jun 29, 2023 Combinatorial Optimization Computational Efficiency
Code Code Available 4SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores Jun 29, 2023 CPU reinforcement-learning
Code Code Available 1Laxity-Aware Scalable Reinforcement Learning for HVAC Control Jun 29, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Eigensubspace of Temporal-Difference Dynamics and How It Improves Value Approximation in Reinforcement Learning Jun 29, 2023 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0MRHER: Model-based Relay Hindsight Experience Replay for Sequential Object Manipulation Tasks with Sparse Rewards Jun 28, 2023 FetchPush-v1 Multi-Goal Reinforcement Learning
Code Code Available 1Structure in Deep Reinforcement Learning: A Survey and Open Problems Jun 28, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Sharper Model-free Reinforcement Learning for Average-reward Markov Decision Processes Jun 28, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Action and Trajectory Planning for Urban Autonomous Driving with Hierarchical Reinforcement Learning Jun 28, 2023 Autonomous Driving Autonomous Vehicles
— Unverified 0Learning to Sail Dynamic Networks: The MARLIN Reinforcement Learning Framework for Congestion Control in Tactical Environments Jun 27, 2023 Reinforcement Learning (RL)
— Unverified 0Automatic Truss Design with Reinforcement Learning Jun 27, 2023 Combinatorial Optimization Layout Design
Code Code Available 1Machine-learning based noise characterization and correction on neutral atoms NISQ devices Jun 27, 2023 Reinforcement Learning (RL)
— Unverified 0