From Problem-Solving to Teaching Problem-Solving: Aligning LLMs with Pedagogy using Reinforcement Learning May 21, 2025 Question Answering Reinforcement Learning (RL)
Code Code Available 1From Scratch to Sketch: Deep Decoupled Hierarchical Reinforcement Learning for Robotic Sketching Agent Aug 9, 2022 Hierarchical Reinforcement Learning reinforcement-learning
Code Code Available 1ACN-Sim: An Open-Source Simulator for Data-Driven Electric Vehicle Charging Research Dec 4, 2020 OpenAI Gym Reinforcement Learning (RL)
Code Code Available 1Continuous control with deep reinforcement learning Sep 9, 2015 Action Detection continuous-control
Code Code Available 1Context-aware Dynamics Model for Generalization in Model-Based Reinforcement Learning May 14, 2020 model Model-based Reinforcement Learning
Code Code Available 1Content Masked Loss: Human-Like Brush Stroke Planning in a Reinforcement Learning Painting Agent Dec 18, 2020 object-detection Object Detection
Code Code Available 1Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second Jun 13, 2023 GPU Reinforcement Learning (RL)
Code Code Available 1Gamma and Vega Hedging Using Deep Distributional Reinforcement Learning May 10, 2022 Distributional Reinforcement Learning Position
Code Code Available 1Gated Hierarchical Attention for Image Captioning Oct 30, 2018 Decoder Image Captioning
Code Code Available 1Gaussian RAM: Lightweight Image Classification via Stochastic Retina-Inspired Glimpse and Reinforcement Learning Nov 12, 2020 Classification General Classification
Code Code Available 1Contextualized Rewriting for Text Summarization Jan 31, 2021 Extractive Summarization reinforcement-learning
Code Code Available 1Digital Twin-Enhanced Wireless Indoor Navigation: Achieving Efficient Environment Sensing with Zero-Shot Reinforcement Learning Jun 11, 2023 Navigate reinforcement-learning
Code Code Available 1Generalization to New Actions in Reinforcement Learning Nov 3, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 1Generalize a Small Pre-trained Model to Arbitrarily Large TSP Instances Dec 19, 2020 Graph Sampling Reinforcement Learning (RL)
Code Code Available 1Constructions in combinatorics via neural networks Apr 29, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 1Generalizing Across Multi-Objective Reward Functions in Deep Reinforcement Learning Sep 17, 2018 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Contention Window Optimization in IEEE 802.11ax Networks with Deep Reinforcement Learning Mar 3, 2020 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning Jun 20, 2020 continuous-control Continuous Control
Code Code Available 1Generating π-Functional Molecules Using STGG+ with Active Learning Feb 20, 2025 Active Learning reinforcement-learning
Code Code Available 1Contextualize Me -- The Case for Context in Reinforcement Learning Feb 9, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 1Continuous Coordination As a Realistic Scenario for Lifelong Learning Mar 4, 2021 Continual Learning Deep Reinforcement Learning
Code Code Available 1Adversarial Deep Reinforcement Learning in Portfolio Management Aug 29, 2018 Deep Reinforcement Learning Management
Code Code Available 1AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents Oct 15, 2023 In-Context Learning In-Context Reinforcement Learning
Code Code Available 1Adversarial Deep Reinforcement Learning for Improving the Robustness of Multi-agent Autonomous Driving Policies Dec 22, 2021 Autonomous Driving Deep Reinforcement Learning
Code Code Available 1A Max-Min Entropy Framework for Reinforcement Learning Jun 19, 2021 Disentanglement reinforcement-learning
Code Code Available 1Giving Up Control: Neurons as Reinforcement Learning Agents Mar 17, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 1A Benchmark Environment for Offline Reinforcement Learning in Racing Games Jul 12, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning Oct 25, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 1Constrained Update Projection Approach to Safe Policy Optimization Sep 15, 2022 Reinforcement Learning (RL) Safe Reinforcement Learning
Code Code Available 1Goal-Guided Transformer-Enabled Reinforcement Learning for Efficient Autonomous Navigation Jan 1, 2023 Autonomous Navigation Decision Making
Code Code Available 1Accelerating Exploration with Unlabeled Prior Data Nov 9, 2023 Reinforcement Learning (RL)
Code Code Available 1A Meta-Reinforcement Learning Algorithm for Causal Discovery Jul 18, 2022 Causal Discovery Meta Reinforcement Learning
Code Code Available 1Constrained Policy Optimization via Bayesian World Models Jan 24, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 1Gradient Imitation Reinforcement Learning for Low Resource Relation Extraction Sep 14, 2021 Meta-Learning Pseudo Label
Code Code Available 1A Benchmark Environment Motivated by Industrial Control Problems Sep 27, 2017 OpenAI Gym Reinforcement Learning
Code Code Available 1Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning Oct 9, 2020 Deep Reinforcement Learning Multi-agent Reinforcement Learning
Code Code Available 1Graph Meta-Reinforcement Learning for Transferable Autonomous Mobility-on-Demand Feb 15, 2022 Meta Reinforcement Learning reinforcement-learning
Code Code Available 1Graph Neural Network Reinforcement Learning for Autonomous Mobility-on-Demand Systems Apr 23, 2021 Decision Making Deep Reinforcement Learning
Code Code Available 1Constrained Variational Policy Optimization for Safe Reinforcement Learning Jan 28, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 1Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning Jun 10, 2025 Large Language Model reinforcement-learning
Code Code Available 1Active Exploration for Inverse Reinforcement Learning Jul 18, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 1GreenLight-Gym: Reinforcement learning benchmark environment for control of greenhouse production systems Oct 6, 2024 Numerical Integration Reinforcement Learning (RL)
Code Code Available 1Reliable Conditioning of Behavioral Cloning for Offline Reinforcement Learning Oct 11, 2022 Offline RL reinforcement-learning
Code Code Available 1Zero-Shot Reinforcement Learning from Low Quality Data Sep 26, 2023 Offline RL reinforcement-learning
Code Code Available 1Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning Sep 29, 2023 Image Generation Offline RL
Code Code Available 1Guiding Online Reinforcement Learning with Action-Free Offline Pretraining Jan 30, 2023 Offline RL reinforcement-learning
Code Code Available 1Constrained episodic reinforcement learning in concave-convex and knapsack settings Jun 9, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 1Constraint-Guided Reinforcement Learning: Augmenting the Agent-Environment-Interaction Apr 24, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 1Continuous Deep Q-Learning with Model-based Acceleration Mar 2, 2016 continuous-control Continuous Control
Code Code Available 1Deep Active Inference for Partially Observable MDPs Sep 8, 2020 Deep Reinforcement Learning Q-Learning
Code Code Available 1