Learning by Playing - Solving Sparse Reward Tasks from Scratch Feb 28, 2018 reinforcement-learning Reinforcement Learning
Code Code Available 0Angrier Birds: Bayesian reinforcement learning Jan 6, 2016 Efficient Exploration Q-Learning
Code Code Available 0DCUR: Data Curriculum for Teaching via Samples with Reinforcement Learning Sep 15, 2021 Deep Reinforcement Learning Offline RL
Code Code Available 0CAGES: Cost-Aware Gradient Entropy Search for Efficient Local Multi-Fidelity Bayesian Optimization May 13, 2024 Bayesian Optimization Reinforcement Learning (RL)
Code Code Available 0Implicit Quantile Networks for Distributional Reinforcement Learning Jun 14, 2018 Atari Games Distributional Reinforcement Learning
Code Code Available 0Data Valuation using Reinforcement Learning Sep 25, 2019 Data Valuation Domain Adaptation
Code Code Available 0A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning May 25, 2025 Reinforcement Learning (RL)
Code Code Available 0Ask the Right Questions: Active Question Reformulation with Reinforcement Learning May 22, 2017 Information Retrieval Question Answering
Code Code Available 0Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning Mar 2, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy Jul 25, 2022 continuous-control Continuous Control
Code Code Available 0General Policy Evaluation and Improvement by Learning to Identify Few But Crucial States Jul 4, 2022 continuous-control Continuous Control
Code Code Available 0General policy mapping: online continual reinforcement learning inspired on the insect brain Nov 30, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0C-3PO: Cyclic-Three-Phase Optimization for Human-Robot Motion Retargeting based on Reinforcement Learning Sep 25, 2019 Deep Reinforcement Learning motion retargeting
Code Code Available 0Adaptive Risk-Aware Bidding with Budget Constraint in Display Advertising Dec 6, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning May 30, 2022 Data Poisoning Deep Reinforcement Learning
Code Code Available 0Efficient Ridesharing Dispatch Using Multi-Agent Reinforcement Learning Jun 18, 2020 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Data sharing games Jan 26, 2021 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0A Generative User Simulator with GPT-based Architecture and Goal State Tracking for Reinforced Multi-Domain Dialog Systems Oct 17, 2022 Reinforcement Learning (RL)
Code Code Available 0Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning Oct 16, 2023 Chatbot Offline RL
Code Code Available 0Importance Prioritized Policy Distillation Aug 25, 2022 Atari Games Decision Making
Code Code Available 0Bridging the Gap in Vision Language Models in Identifying Unsafe Concepts Across Modalities Jul 15, 2025 Reinforcement Learning (RL)
Code Code Available 0Depth Self-Optimized Learning Toward Data Science Nov 2, 2020 Reinforcement Learning (RL)
Code Code Available 0Generating Classical Chinese Poems from Vernacular Chinese Aug 31, 2019 Cultural Vocal Bursts Intensity Prediction Machine Translation
Code Code Available 0Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage Oct 27, 2023 Offline RL Reinforcement Learning (RL)
Code Code Available 0Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning Jan 30, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Efficient Sparse-Reward Goal-Conditioned Reinforcement Learning with a High Replay Ratio and Regularization Dec 10, 2023 Q-Learning Reinforcement Learning (RL)
Code Code Available 0An Evaluation Study of Intrinsic Motivation Techniques applied to Reinforcement Learning over Hard Exploration Environments May 23, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0A neurally plausible model learns successor representations in partially observable environments Jun 22, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition Apr 20, 2019 Deep Reinforcement Learning Reinforcement Learning
Code Code Available 0Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control Jun 20, 2017 Gaussian Processes Model Predictive Control
Code Code Available 0Learning Scheduling Algorithms for Data Processing Clusters Oct 3, 2018 Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Brick Tic-Tac-Toe: Exploring the Generalizability of AlphaZero to Novel Test Environments Jul 13, 2022 Reinforcement Learning (RL)
Code Code Available 0Efficient time stepping for numerical integration using reinforcement learning Apr 8, 2021 Meta-Learning Numerical Integration
Code Code Available 0Efficient Transformer-based Hyper-parameter Optimization for Resource-constrained IoT Environments Mar 18, 2024 Reinforcement Learning (RL)
Code Code Available 0Generating Multi-type Temporal Sequences to Mitigate Class-imbalanced Problem Apr 7, 2021 BIG-bench Machine Learning Click-Through Rate Prediction
Code Code Available 0A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation Aug 21, 2019 Multi-Objective Reinforcement Learning reinforcement-learning
Code Code Available 0Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning Apr 4, 2016 reinforcement-learning Reinforcement Learning
Code Code Available 0Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening Nov 5, 2016 Atari Games Deep Reinforcement Learning
Code Code Available 0Data-Efficient Hierarchical Reinforcement Learning May 21, 2018 Hierarchical Reinforcement Learning reinforcement-learning
Code Code Available 0Bregman Gradient Policy Optimization Jun 23, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 0Learning Complex Teamwork Tasks Using a Given Sub-task Decomposition Feb 9, 2023 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Data driven approach towards more efficient Newton-Raphson power flow calculation for distribution grids Apr 15, 2025 Reinforcement Learning (RL)
Code Code Available 0Data center cooling using model-predictive control Dec 1, 2018 model Model Predictive Control
Code Code Available 0Data Assimilation in Chaotic Systems Using Deep Reinforcement Learning Jan 1, 2024 Autonomous Vehicles Deep Reinforcement Learning
Code Code Available 0Learning Conformal Abstention Policies for Adaptive Risk Management in Large Language and Vision-Language Models Feb 8, 2025 Conformal Prediction Decision Making
Code Code Available 0Ego-Pose Estimation and Forecasting as Real-Time PD Control Jun 7, 2019 Egocentric Pose Estimation Human Pose Forecasting
Code Code Available 0Adaptive Reward Design for Reinforcement Learning Dec 14, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0DARLR: Dual-Agent Offline Reinforcement Learning for Recommender Systems with Dynamic Reward May 12, 2025 Recommendation Systems Reinforcement Learning (RL)
Code Code Available 0Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling Jul 12, 2019 Imitation Learning reinforcement-learning
Code Code Available 0BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement Learning Oct 2, 2021 Offline RL reinforcement-learning
Code Code Available 0