Is Vanilla Policy Gradient Overlooked? Analyzing Deep Reinforcement Learning for Hanabi Mar 22, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Benchmarking Reinforcement Learning Algorithms on Real-World Robots Sep 20, 2018 Benchmarking continuous-control
Code Code Available 0Gradual Transition from Bellman Optimality Operator to Bellman Operator in Online Reinforcement Learning Jun 6, 2025 Reinforcement Learning (RL)
Code Code Available 0Learn to Steer through Deep Reinforcement Learning Oct 27, 2018 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0An Efficient Application of Neuroevolution for Competitive Multiagent Learning May 23, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 0Grammars and reinforcement learning for molecule optimization Nov 27, 2018 reinforcement-learning Reinforcement Learning
Code Code Available 0Improving Policy Learning via Language Dynamics Distillation Sep 30, 2022 NetHack Reinforcement Learning (RL)
Code Code Available 0Correcting Momentum in Temporal Difference Learning Jun 7, 2021 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Corpus-Level End-to-End Exploration for Interactive Systems Nov 23, 2019 Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Improving Policy Optimization with Generalist-Specialist Learning Jun 26, 2022 Deep Reinforcement Learning Imitation Learning
Code Code Available 0CO-PILOT: COllaborative Planning and reInforcement Learning On sub-Task curriculum Dec 1, 2021 continuous-control Continuous Control
Code Code Available 0Improving Portfolio Optimization Results with Bandit Networks Oct 5, 2024 Portfolio Optimization Recommendation Systems
Code Code Available 0Improving Post-Processing of Audio Event Detectors Using Reinforcement Learning Aug 19, 2022 Classification reinforcement-learning
Code Code Available 0COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning Attacks Mar 16, 2022 Offline RL reinforcement-learning
Code Code Available 0Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control Nov 12, 2021 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Graph Backup: Data Efficient Backup Exploiting Markovian Transitions May 31, 2022 Atari Games counterfactual
Code Code Available 0Environment Design for Inverse Reinforcement Learning Oct 26, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 0Improving reinforcement learning algorithms: towards optimal learning rate policies Nov 6, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0Improving Reinforcement Learning Based Image Captioning with Natural Language Prior Sep 13, 2018 Image Captioning reinforcement-learning
Code Code Available 0Environment Probing Interaction Policies Jul 26, 2019 Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0RH-Net: Improving Neural Relation Extraction via Reinforcement Learning and Hierarchical Relational Searching Oct 27, 2020 Denoising reinforcement-learning
Code Code Available 0Environments for Lifelong Reinforcement Learning Nov 26, 2018 reinforcement-learning Reinforcement Learning
Code Code Available 0Benchmarking Quantum Reinforcement Learning Jan 27, 2025 Benchmarking reinforcement-learning
Code Code Available 0Benchmarking MOEAs for solving continuous multi-objective RL problems May 19, 2025 Benchmarking Evolutionary Algorithms
Code Code Available 0Cooperative Inverse Reinforcement Learning Jun 9, 2016 Active Learning reinforcement-learning
Code Code Available 0Improving Generalization in Reinforcement Learning Training Regimes for Social Robot Navigation Aug 29, 2023 Decision Making Navigate
Code Code Available 0Cooperation-Aware Reinforcement Learning for Merging in Dense Traffic Jun 26, 2019 Autonomous Vehicles Decision Making
Code Code Available 0Graph Convolutional Reinforcement Learning Oct 22, 2018 Decision Making reinforcement-learning
Code Code Available 0Learning Task-relevant Representations for Generalization via Characteristic Functions of Reward Sequence Distributions May 20, 2022 Reinforcement Learning (RL)
Code Code Available 0Benchmarking Model-Based Reinforcement Learning Jul 3, 2019 Benchmarking model
Code Code Available 0Benchmark Generation Framework with Customizable Distortions for Image Classifier Robustness Oct 28, 2023 Benchmarking image-classification
Code Code Available 0ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback Jun 25, 2024 Reinforcement Learning (RL) Sentence
Code Code Available 0Convolutional Reservoir Computing for World Models Jul 18, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0A framework for reinforcement learning with autocorrelated actions Sep 10, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 0Learning to reinforcement learn Nov 17, 2016 Deep Reinforcement Learning Meta-Learning
Code Code Available 0Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes Jan 29, 2022 Decision Making Model-based Reinforcement Learning
Code Code Available 0GraphNAS: Graph Neural Architecture Search with Reinforcement Learning Apr 22, 2019 General Classification Inductive Learning
Code Code Available 0Improving Robustness of Deep Reinforcement Learning Agents: Environment Attack based on the Critic Network Apr 7, 2021 Adversarial Attack Deep Reinforcement Learning
Code Code Available 0Learning Fair Policies in Multiobjective (Deep) Reinforcement Learning with Average and Discounted Rewards Aug 18, 2020 Deep Reinforcement Learning Fairness
Code Code Available 0Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations Mar 6, 2024 Q-Learning Reinforcement Learning (RL)
Code Code Available 0Behaviour Suite for Reinforcement Learning Aug 9, 2019 reinforcement-learning Reinforcement Learning
Code Code Available 0Quantum enhancements for deep reinforcement learning in large spaces Oct 28, 2019 BIG-bench Machine Learning Decision Making
Code Code Available 0Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM May 16, 2025 Language Modeling Language Modelling
Code Code Available 0Behavior Prior Representation learning for Offline Reinforcement Learning Nov 2, 2022 Offline RL reinforcement-learning
Code Code Available 0Convergent Policy Optimization for Safe Reinforcement Learning Oct 26, 2019 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0A Framework for Automated Cellular Network Tuning with Reinforcement Learning Aug 13, 2018 Management Q-Learning
Code Code Available 0Iterative Reward Shaping using Human Feedback for Correcting Reward Misspecification Aug 30, 2023 Reinforcement Learning (RL)
Code Code Available 0A nearly Blackwell-optimal policy gradient method May 28, 2021 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Arena: a toolkit for Multi-Agent Reinforcement Learning Jul 20, 2019 Multi-agent Reinforcement Learning OpenAI Gym
Code Code Available 0Control with adaptive Q-learning Nov 3, 2020 OpenAI Gym Q-Learning
Code Code Available 0