Concrete Dropout May 22, 2017 Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Discrete Action On-Policy Learning with Action-Value Critic Feb 10, 2020 OpenAI Gym Reinforcement Learning
Code Code Available 0Discrete and Continuous Action Representation for Practical RL in Video Games Dec 23, 2019 Control with Prametrised Actions Reinforcement Learning
Code Code Available 0Deep reinforcement learning from human preferences Jun 12, 2017 Atari Games Deep Reinforcement Learning
Code Code Available 0Hindsight Learning for MDPs with Exogenous Inputs Jul 13, 2022 counterfactual Decision Making
Code Code Available 0Hindsight policy gradients Nov 16, 2017 Policy Gradient Methods reinforcement-learning
Code Code Available 0Learning to Perform Local Rewriting for Combinatorial Optimization Sep 30, 2018 Combinatorial Optimization Reinforcement Learning
Code Code Available 0Feature-Attending Recurrent Modules for Generalization in Reinforcement Learning Dec 15, 2021 Object reinforcement-learning
Code Code Available 0Action Advising with Advice Imitation in Deep Reinforcement Learning Apr 17, 2021 Atari Games Behavioural cloning
Code Code Available 0Logic-based Reward Shaping for Multi-Agent Reinforcement Learning Jun 17, 2022 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Discrete State-Action Abstraction via the Successor Representation Jun 7, 2022 Reinforcement Learning (RL) Transfer Learning
Code Code Available 0Hindsight Trust Region Policy Optimization Jul 29, 2019 Atari Games Policy Gradient Methods
Code Code Available 0Discrete-to-Deep Supervised Policy Learning May 5, 2020 Reinforcement Learning (RL)
Code Code Available 0Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning May 18, 2017 Hierarchical Reinforcement Learning Montezuma's Revenge
Code Code Available 0Deep Reinforcement Learning from Hierarchical Preference Design Sep 6, 2023 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0H_ Model-free Reinforcement Learning with Robust Stability Guarantee Nov 7, 2019 Autonomous Driving reinforcement-learning
Code Code Available 0Deep Reinforcement Learning framework for Autonomous Driving Apr 8, 2017 Atari Games Autonomous Driving
Code Code Available 0Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods Feb 28, 2018 Deep Reinforcement Learning Diversity
Code Code Available 0Learning to Discretize: Solving 1D Scalar Conservation Laws via Deep Reinforcement Learning May 27, 2019 Decision Making Deep Reinforcement Learning
Code Code Available 0Hint assisted reinforcement learning: an application in radio astronomy Jan 10, 2023 Astronomy Model-based Reinforcement Learning
Code Code Available 0Disentangled (Un)Controllable Features Oct 31, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Learning robust control for LQR systems with multiplicative noise via policy gradient May 28, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0Disentangling Abstraction from Statistical Pattern Matching in Human and Machine Learning Apr 4, 2022 BIG-bench Machine Learning Inductive Bias
Code Code Available 0Automatic Goal Generation for Reinforcement Learning Agents May 17, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 0Federated Control with Hierarchical Multi-Agent Deep Reinforcement Learning Dec 22, 2017 Deep Reinforcement Learning Efficient Exploration
Code Code Available 0ComSD: Balancing Behavioral Quality and Diversity in Unsupervised Skill Discovery Sep 29, 2023 Contrastive Learning Diversity
Code Code Available 0APEX: Empowering LLMs with Physics-Based Task Planning for Real-time Insight May 20, 2025 Causal Inference Decision Making
Code Code Available 0Computing the Feedback Capacity of Finite State Channels using Reinforcement Learning Jan 27, 2020 Computational Efficiency Decision Making
Code Code Available 0Automatic Discovery of Interpretable Planning Strategies May 24, 2020 Clustering Decision Making
Code Code Available 0Aligning an optical interferometer with beam divergence control and continuous action space Jul 9, 2021 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Language Model Alignment with Elastic Reset Dec 6, 2023 Chatbot Language Modeling
Code Code Available 0A Lightweight Calibrated Simulation Enabling Efficient Offline Learning for Optimal Control of Real Buildings Oct 12, 2023 Reinforcement Learning (RL)
Code Code Available 0Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees Jul 10, 2018 continuous-control Continuous Control
Code Code Available 0Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward Dec 29, 2017 Decision Making Deep Reinforcement Learning
Code Code Available 0Automatically Exposing Problems with Neural Dialog Models Sep 14, 2021 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Dissecting Long Reasoning Models: An Empirical Study Jun 5, 2025 Reinforcement Learning (RL)
Code Code Available 0HOList: An Environment for Machine Learning of Higher-Order Theorem Proving Apr 5, 2019 Automated Theorem Proving BIG-bench Machine Learning
Code Code Available 0A learning gap between neuroscience and reinforcement learning Apr 22, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 0Deep Reinforcement Learning for Traffic Light Control in Vehicular Networks Mar 29, 2018 Deep Reinforcement Learning Q-Learning
Code Code Available 0Distance Weighted Supervised Learning for Offline Interaction Data Apr 26, 2023 Decision Making Imitation Learning
Code Code Available 0Distantly Supervised NER with Partial Annotation Learning and Reinforcement Learning Aug 1, 2018 Chinese Named Entity Recognition named-entity-recognition
Code Code Available 0APES: a Python toolbox for simulating reinforcement learning environments Aug 31, 2018 reinforcement-learning Reinforcement Learning
Code Code Available 0Distillation of RL Policies with Formal Guarantees via Variational Abstraction of Markov Decision Processes (Technical Report) Dec 17, 2021 Reinforcement Learning (RL)
Code Code Available 0Homogenization of Multi-agent Learning Dynamics in Finite-state Markov Games Jun 26, 2025 Reinforcement Learning (RL)
Code Code Available 0Intelligent Traffic Light via Policy-based Deep Reinforcement Learning Dec 27, 2021 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Automated quantum programming via reinforcement learning for combinatorial optimization Aug 21, 2019 Combinatorial Optimization reinforcement-learning
Code Code Available 0Language Understanding for Text-based Games Using Deep Reinforcement Learning Jun 30, 2015 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Intelligent Trainer for Model-Based Reinforcement Learning May 24, 2018 model Model-based Reinforcement Learning
Code Code Available 0Deep reinforcement learning for time series: playing idealized trading games Mar 11, 2018 Deep Reinforcement Learning Q-Learning
Code Code Available 0Automated Proof of Polynomial Inequalities via Reinforcement Learning Mar 9, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 0