Strongly-polynomial time and validation analysis of policy gradient methods Sep 28, 2024 Policy Gradient Methods Reinforcement Learning (RL)
— Unverified 0Structural Credit Assignment in Neural Networks using Reinforcement Learning Dec 1, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Structural Credit Assignment with Coordinated Exploration Jul 25, 2023 Reinforcement Learning (RL)
— Unverified 0Structural Return Maximization for Reinforcement Learning May 12, 2014 Learning Theory reinforcement-learning
— Unverified 0Structural Similarity for Improved Transfer in Reinforcement Learning Jul 27, 2022 Q-Learning reinforcement-learning
— Unverified 0Structure-aware reinforcement learning for node-overload protection in mobile edge computing Jun 29, 2021 Edge-computing reinforcement-learning
— Unverified 0Structure-Aware Transformer Policy for Inhomogeneous Multi-Task Reinforcement Learning Sep 29, 2021 Multi-Task Learning reinforcement-learning
— Unverified 0Structured Dialogue Policy with Graph Neural Networks Aug 1, 2018 Automatic Speech Recognition (ASR) Decision Making
— Unverified 0Structured Graph Network for Constrained Robot Crowd Navigation with Low Fidelity Simulation May 27, 2024 Reinforcement Learning (RL)
— Unverified 0Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks Apr 25, 2024 Fairness Multi-Armed Bandits
— Unverified 0Structured World Belief for Reinforcement Learning in POMDP Jul 19, 2021 Inductive Bias Object
— Unverified 0Structure-Enhanced Deep Reinforcement Learning for Optimal Transmission Scheduling Nov 20, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Structure in Deep Reinforcement Learning: A Survey and Open Problems Jun 28, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Structure Learning in Human Sequential Decision-Making Dec 1, 2008 Decision Making reinforcement-learning
— Unverified 0Structure Learning in Motor Control:A Deep Reinforcement Learning Model Jun 21, 2017 Deep Reinforcement Learning Model-based Reinforcement Learning
— Unverified 0Student/Teacher Advising through Reward Augmentation Feb 7, 2020 General Reinforcement Learning reinforcement-learning
— Unverified 0Student-Teacher Curriculum Learning via Reinforcement Learning: Predicting Hospital Inpatient Admission Location Jul 1, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Stylistic Dialogue Generation via Information-Guided Reinforcement Learning Strategy Apr 5, 2020 Dialogue Generation reinforcement-learning
— Unverified 0Sub-domain Modelling for Dialogue Management with Hierarchical Reinforcement Learning Jun 19, 2017 Dialogue Management Hierarchical Reinforcement Learning
— Unverified 0Subgoal-based Reward Shaping to Improve Efficiency in Reinforcement Learning Apr 13, 2021 AI Agent reinforcement-learning
— Unverified 0Subgoal Discovery Using a Free Energy Paradigm and State Aggregations Dec 21, 2024 Reinforcement Learning (RL) Sequential Decision Making
— Unverified 0Sub-Goal Trees -- a Framework for Goal-Based Reinforcement Learning Feb 27, 2020 Motion Planning reinforcement-learning
— Unverified 0Relative Entropy Regularized Policy Iteration Dec 5, 2018 continuous-control Continuous Control
Code Code Available 0Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning Oct 25, 2019 Imitation Learning reinforcement-learning
Code Code Available 0Towards More Sample Efficiency in Reinforcement Learning with Data Augmentation Oct 19, 2019 Data Augmentation Deep Reinforcement Learning
Code Code Available 0Sequential memory improves sample and memory efficiency in Episodic Control Dec 29, 2021 Deep Reinforcement Learning Hippocampus
Code Code Available 0Understanding Multi-Step Deep Reinforcement Learning: A Systematic Study of the DQN Target Jan 22, 2019 Deep Reinforcement Learning Q-Learning
Code Code Available 0Proper Value Equivalence Jun 18, 2021 Model-based Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Reliability and Learnability of Human Bandit Feedback for Sequence-to-Sequence Reinforcement Learning May 27, 2018 Machine Translation NMT
Code Code Available 0Sequential Recommendation for Optimizing Both Immediate Feedback and Long-term Retention Apr 4, 2024 Contrastive Learning Multi-Task Learning
Code Code Available 0Meta-Reinforcement Learning via Buffering Graph Signatures for Live Video Streaming Events Oct 3, 2021 Meta-Learning Meta Reinforcement Learning
Code Code Available 0On the Effectiveness of Offline RL for Dialogue Response Generation Jul 23, 2023 Offline RL reinforcement-learning
Code Code Available 0Relational Graph Learning for Crowd Navigation Sep 28, 2019 Deep Reinforcement Learning Graph Learning
Code Code Available 0Relational Deep Reinforcement Learning Jun 5, 2018 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0ReLAX: Reinforcement Learning Agent eXplainer for Arbitrary Predictive Models Oct 22, 2021 counterfactual Decision Making
Code Code Available 0Task-Oriented Language Grounding for Language Input with Multiple Sub-Goals of Non-Linear Order Oct 27, 2019 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Task-Oriented Query Reformulation with Reinforcement Learning Apr 15, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 0Task Phasing: Automated Curriculum Learning from Demonstrations Oct 20, 2022 Reinforcement Learning (RL)
Code Code Available 0UNSAT Solver Synthesis via Monte Carlo Forest Search Nov 22, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Sequicity: Simplifying Task-oriented Dialogue Systems with Single Sequence-to-Sequence Architectures Jul 1, 2018 reinforcement-learning Reinforcement Learning
Code Code Available 0Remember and Forget for Experience Replay Jul 16, 2018 Deep Reinforcement Learning Policy Gradient Methods
Code Code Available 0Value Iteration for Learning Concurrently Executable Robotic Control Tasks Apr 1, 2025 Reinforcement Learning (RL)
Code Code Available 0Monolithic vs. hybrid controller for multi-objective Sim-to-Real learning Aug 17, 2021 Reinforcement Learning (RL)
Code Code Available 0Value Iteration Networks Feb 9, 2016 reinforcement-learning Reinforcement Learning
Code Code Available 0Renaissance Robot: Optimal Transport Policy Fusion for Learning Diverse Skills Jul 3, 2022 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Propagating Uncertainty in Reinforcement Learning via Wasserstein Barycenters Dec 1, 2019 Atari Games Q-Learning
Code Code Available 0Model-based Offline Policy Optimization with Adversarial Network Sep 5, 2023 model Offline RL
Code Code Available 0Setting up a Reinforcement Learning Task with a Real-World Robot Mar 19, 2018 reinforcement-learning Reinforcement Learning
Code Code Available 0Monitored Markov Decision Processes Feb 9, 2024 Reinforcement Learning (RL)
Code Code Available 0TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets Dec 5, 2022 D4RL MuJoCo
Code Code Available 0