Temporal Regularization for Markov Decision Process Dec 1, 2018 Atari Games reinforcement-learning
Code Code Available 0Simplifying Deep Reinforcement Learning via Self-Supervision Jun 10, 2021 Deep Reinforcement Learning regression
Code Code Available 0Dynamic Multi-Reward Weighting for Multi-Style Controllable Generation Feb 21, 2024 Multi-Objective Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Pretraining the Vision Transformer using self-supervised methods for vision based Deep Reinforcement Learning Sep 22, 2022 Atari Games Atari Games 100k
Code Code Available 0Temporal Regularization in Markov Decision Process Nov 1, 2018 Atari Games reinforcement-learning
Code Code Available 0Rethinking Supervised Learning and Reinforcement Learning in Task-Oriented Dialogue Systems Sep 21, 2020 Decoder Multi-Label Classification
Code Code Available 0Pretrained Bayesian Non-parametric Knowledge Prior in Robotic Long-Horizon Reinforcement Learning Mar 27, 2025 Reinforcement Learning (RL)
Code Code Available 0Rethinking the Role of Proxy Rewards in Language Model Alignment Feb 2, 2024 Language Modeling Language Modelling
Code Code Available 0Reinforcement Learning with Dynamic Boltzmann Softmax Updates Mar 14, 2019 Atari Games Q-Learning
Code Code Available 0Reinforcement Learning with Deep Energy-Based Policies Feb 27, 2017 Q-Learning reinforcement-learning
Code Code Available 0Molecular De Novo Design through Deep Reinforcement Learning Apr 25, 2017 Activity Prediction Deep Reinforcement Learning
Code Code Available 0Reinforcement Learning with Brain-Inspired Modulation can Improve Adaptation to Environmental Changes May 19, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0ZPD Teaching Strategies for Deep Reinforcement Learning from Demonstrations Oct 26, 2019 Atari Games Deep Reinforcement Learning
Code Code Available 0Prejudge-Before-Think: Enhancing Large Language Models at Test-Time by Process Prejudge Reasoning Apr 18, 2025 Reinforcement Learning (RL)
Code Code Available 0Retrospex: Language Agent Meets Offline Reinforcement Learning Critic May 17, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 0Reinforcement Learning with a Terminator May 30, 2022 Autonomous Driving reinforcement-learning
Code Code Available 0Sim-to-Real Reinforcement Learning for Deformable Object Manipulation Jun 20, 2018 Deep Reinforcement Learning Deformable Object Manipulation
Code Code Available 0Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning Oct 5, 2022 Deep Reinforcement Learning Q-Learning
Code Code Available 0Preferences Implicit in the State of the World Feb 12, 2019 Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Tensor and Matrix Low-Rank Value-Function Approximation in Reinforcement Learning Jan 21, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Reinforcement Learning with Algorithms from Probabilistic Structure Estimation Mar 15, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 0Towards Safe Policy Improvement for Non-Stationary MDPs Oct 23, 2020 Decision Making reinforcement-learning
Code Code Available 0TensorFlow Agents: Efficient Batched Reinforcement Learning in TensorFlow Sep 8, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 0Preference-Guided Reinforcement Learning for Efficient Exploration Jul 9, 2024 Efficient Exploration reinforcement-learning
Code Code Available 0MOFGPT: Generative Design of Metal-Organic Frameworks using Language Models May 30, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 0Reinforcement Learning with Adaptive Regularization for Safe Control of Critical Systems Apr 23, 2024 Reinforcement Learning (RL)
Code Code Available 0Online Cyber-Attack Detection in Smart Grid: A Reinforcement Learning Approach Sep 14, 2018 Anomaly Detection Cyber Attack Detection
Code Code Available 0Reinforcement Learning with a Corrupted Reward Channel May 23, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 0NARS vs. Reinforcement learning: ONA vs. Q-Learning Dec 23, 2022 Q-Learning reinforcement-learning
Code Code Available 0Integrating Distributed Architectures in Highly Modular RL Libraries Jul 6, 2020 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Myopic Bayesian Design of Experiments via Posterior Sampling and Probabilistic Programming May 25, 2018 Bayesian Inference Multi-Armed Bandits
Code Code Available 0Preference-based Interactive Multi-Document Summarisation Jun 7, 2019 Active Learning reinforcement-learning
Code Code Available 0Predictive World Models from Real-World Partial Observations Jan 12, 2023 Continual Learning Open-Ended Question Answering
Code Code Available 0Simulation-Based Benchmarking of Reinforcement Learning Agents for Personalized Retail Promotions May 16, 2024 Benchmarking Reinforcement Learning (RL)
Code Code Available 0Simulation-based reinforcement learning for real-world autonomous driving Nov 29, 2019 Autonomous Driving reinforcement-learning
Code Code Available 0Unified Distributed Environment May 14, 2022 OpenAI Gym reinforcement-learning
Code Code Available 0Reinforcement Learning with A* and a Deep Heuristic Nov 19, 2018 Q-Learning reinforcement-learning
Code Code Available 0Simulation of Nanorobots with Artificial Intelligence and Reinforcement Learning for Advanced Cancer Cell Detection and Tracking Nov 4, 2024 Cell Detection Navigate
Code Code Available 0Revisiting Fundamentals of Experience Replay Jul 13, 2020 Deep Reinforcement Learning DQN Replay Dataset
Code Code Available 0Towards Sample Efficient Agents through Algorithmic Alignment Aug 7, 2020 Deep Reinforcement Learning Graph Neural Network
Code Code Available 0Reinforcement Learning When All Actions are Not Always Available Jun 5, 2019 All Decision Making
Code Code Available 0Reinforcement Learning via Recurrent Convolutional Neural Networks Jan 9, 2017 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Predicting Research Trends From Arxiv Mar 7, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0Revisiting Prioritized Experience Replay: A Value Perspective Feb 5, 2021 Atari Games Q-Learning
Code Code Available 0Reinforcement Learning via Auxiliary Task Distillation Jun 24, 2024 Object Rearrangement reinforcement-learning
Code Code Available 0Online Baum-Welch algorithm for Hierarchical Imitation Learning Mar 22, 2021 Hierarchical Reinforcement Learning Imitation Learning
Code Code Available 0Simultaneous Double Q-learning with Conservative Advantage Learning for Actor-Critic Methods May 8, 2022 continuous-control Continuous Control
Code Code Available 0Revisiting State Augmentation methods for Reinforcement Learning with Stochastic Delays Aug 17, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 0ViZDoom: A Doom-based AI Research Platform for Visual Reinforcement Learning May 6, 2016 Atari Games FPS Games
Code Code Available 0Towards Scalable Verification of Deep Reinforcement Learning May 25, 2021 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0