Safe Reinforcement Learning in Black-Box Environments via Adaptive Shielding May 28, 2024 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0RecoGym: A Reinforcement Learning Environment for the problem of Product Recommendation in Online Advertising Aug 2, 2018 Product Recommendation Recommendation Systems
Code Code Available 0Parameterized Indexed Value Function for Efficient Exploration in Reinforcement Learning Dec 23, 2019 Efficient Exploration reinforcement-learning
Code Code Available 0Safe Reinforcement Learning of Control-Affine Systems with Vertex Networks Mar 20, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 0StepCountJITAI: simulation environment for RL with application to physical activity adaptive intervention Nov 1, 2024 Reinforcement Learning (RL)
Code Code Available 0Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients Sep 24, 2021 continuous-control Continuous Control
Code Code Available 0Noisy Natural Gradient as Variational Inference Dec 6, 2017 Active Learning Efficient Exploration
Code Code Available 0ReCCoVER: Detecting Causal Confusion for Explainable Reinforcement Learning Mar 21, 2022 Deep Reinforcement Learning feature selection
Code Code Available 0Safe Reinforcement Learning Using Black-Box Reachability Analysis Apr 15, 2022 Motion Planning reinforcement-learning
Code Code Available 0StepSearch: Igniting LLMs Search Ability via Step-Wise Proximal Policy Optimization May 21, 2025 Question Answering Reinforcement Learning (RL)
Code Code Available 0Noise-Resilient Symbolic Regression with Dynamic Gating Reinforcement Learning Jan 2, 2025 regression reinforcement-learning
Code Code Available 0Next-Best-View Estimation based on Deep Reinforcement Learning for Active Object Classification Oct 13, 2021 Deep Reinforcement Learning Object
Code Code Available 0reBandit: Random Effects based Online RL algorithm for Reducing Cannabis Use Feb 27, 2024 Reinforcement Learning (RL)
Code Code Available 0Safe Reinforcement Learning with Nonlinear Dynamics via Model Predictive Shielding May 25, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0Safe Reinforcement Learning via Probabilistic Logic Shields Mar 6, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0STL-Based Synthesis of Feedback Controllers Using Reinforcement Learning Dec 2, 2022 continuous-control Continuous Control
Code Code Available 0Safe Reinforcement Learning via Shielding Aug 29, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 0Parameter-Based Value Functions Jun 16, 2020 continuous-control Continuous Control
Code Code Available 0VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization May 25, 2025 Reinforcement Learning (RL)
Code Code Available 0Stochastic Answer Networks for Machine Reading Comprehension Dec 10, 2017 Machine Reading Comprehension Question Answering
Code Code Available 0Time-R1: Towards Comprehensive Temporal Reasoning in LLMs May 16, 2025 Question Answering Reinforcement Learning (RL)
Code Code Available 0Missingness as Stability: Understanding the Structure of Missingness in Longitudinal EHR data and its Impact on Reinforcement Learning in Healthcare Nov 16, 2019 Imputation reinforcement-learning
Code Code Available 0Newton-type Methods for Minimax Optimization Jun 25, 2020 Reinforcement Learning (RL) Vocal Bursts Type Prediction
Code Code Available 0Newsvendor Model with Deep Reinforcement Learning Dec 22, 2021 Deep Reinforcement Learning model
Code Code Available 0Meta-Inverse Reinforcement Learning with Probabilistic Context Variables Sep 20, 2019 continuous-control Continuous Control
Code Code Available 0Upside-Down Reinforcement Learning Can Diverge in Stochastic Environments With Episodic Resets May 13, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Mirror Descent Search and its Acceleration Sep 8, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 0Urban Driving with Multi-Objective Deep Reinforcement Learning Nov 21, 2018 Autonomous Driving Deep Reinforcement Learning
Code Code Available 0Neuro-symbolic Natural Logic with Introspective Revision for Natural Language Inference Mar 9, 2022 Natural Language Inference reinforcement-learning
Code Code Available 0Safe Reinforcement Learning with Scene Decomposition for Navigating Complex Urban Environments Apr 25, 2019 Decision Making Navigate
Code Code Available 0Marginal Policy Gradients: A Unified Family of Estimators for Bounded Action Spaces with Applications Jun 13, 2018 continuous-control Continuous Control
Code Code Available 0Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model Jul 1, 2019 continuous-control Continuous Control
Code Code Available 0TinyQMIX: Distributed Access Control for mMTC via Multi-agent Reinforcement Learning Nov 21, 2022 Deep Reinforcement Learning Multi-agent Reinforcement Learning
Code Code Available 0Safer Reinforcement Learning through Transferable Instinct Networks Jul 14, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 0Neuro-Symbolic Approaches for Text-Based Policy Learning Nov 1, 2021 Reinforcement Learning (RL) text-based games
Code Code Available 0Reasoning Under 1 Billion: Memory-Augmented Reinforcement Learning for Large Language Models Apr 3, 2025 GSM8K Reinforcement Learning (RL)
Code Code Available 0Stochastic Neural Networks for Hierarchical Reinforcement Learning Apr 10, 2017 Deep Reinforcement Learning Hierarchical Reinforcement Learning
Code Code Available 0Stochastic optimal well control in subsurface reservoirs using reinforcement learning Jul 7, 2022 Management reinforcement-learning
Code Code Available 0Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models Sep 4, 2023 Reinforcement Learning (RL) Transfer Learning
Code Code Available 0Neuronal Circuit Policies Mar 22, 2018 Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Neurogenetic Programming Framework for Explainable Reinforcement Learning Feb 8, 2021 OpenAI Gym reinforcement-learning
Code Code Available 0Multi-Agent Reinforcement Learning for Visibility-based Persistent Monitoring Nov 2, 2020 Graph Attention Multi-agent Reinforcement Learning
Code Code Available 0TreeC: a method to generate interpretable energy management systems using a metaheuristic algorithm Apr 17, 2023 energy management Management
Code Code Available 0TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning Oct 31, 2017 Atari Games Deep Reinforcement Learning
Code Code Available 0Multiagent Reinforcement Learning based Energy Beamforming Control Jun 15, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 0Multi-Agent Reinforcement Learning: A Report on Challenges and Approaches Jul 25, 2018 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Two-step reinforcement learning for model-free redesign of nonlinear optimal regulator Mar 5, 2021 Offline RL reinforcement-learning
Code Code Available 0Model-free reinforcement learning with noisy actions for automated experimental control in optics May 24, 2024 Reinforcement Learning (RL)
Code Code Available 0Reasoning and Generalization in RL: A Tool Use Perspective Jul 3, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0Reasoning about Counterfactuals to Improve Human Inverse Reinforcement Learning Mar 3, 2022 counterfactual Counterfactual Reasoning
Code Code Available 0