A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning Aug 16, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 2Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models May 15, 2025 Math reinforcement-learning
Code Code Available 2Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models Mar 18, 2025 Anatomy Attribute
Code Code Available 2Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning Feb 14, 2025 Reinforcement Learning (RL) Skills Assessment
Code Code Available 2AGILE: A Novel Reinforcement Learning Framework of LLM Agents May 23, 2024 Question Answering reinforcement-learning
Code Code Available 2A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning Aug 5, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 2Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning Oct 24, 2019 Meta-Learning Meta Reinforcement Learning
Code Code Available 2Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization May 25, 2024 continuous-control Continuous Control
Code Code Available 2MO-Gym: A Library of Multi-Objective Reinforcement Learning Environments Nov 30, 2022 Multi-Objective Reinforcement Learning OpenAI Gym
Code Code Available 2MOMAland: A Set of Benchmarks for Multi-Objective Multi-Agent Reinforcement Learning Jul 23, 2024 Benchmarking Decision Making
Code Code Available 2Efficient World Models with Context-Aware Tokenization Jun 27, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 2MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning Apr 14, 2025 Machine Translation Reinforcement Learning (RL)
Code Code Available 2Benchmarking Potential Based Rewards for Learning Humanoid Locomotion Jul 19, 2023 Benchmarking Reinforcement Learning (RL)
Code Code Available 2GenRL: Multimodal-foundation world models for generalization in embodied agents Jun 26, 2024 Benchmarking Reinforcement Learning (RL)
Code Code Available 2Benchmarking Deep Reinforcement Learning for Continuous Control Apr 22, 2016 Action Triplet Recognition Atari Games
Code Code Available 2DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated and Musculoskeletal Systems May 30, 2022 Diversity reinforcement-learning
Code Code Available 2Developing A Multi-Agent and Self-Adaptive Framework with Deep Reinforcement Learning for Dynamic Portfolio Risk Management Feb 1, 2024 Deep Reinforcement Learning Management
Code Code Available 2Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot Feb 20, 2023 Efficient Exploration reinforcement-learning
Code Code Available 2OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling Jun 25, 2025 Language Modeling Language Modelling
Code Code Available 2ODRL: A Benchmark for Off-Dynamics Reinforcement Learning Oct 28, 2024 Benchmarking reinforcement-learning
Code Code Available 2Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling Jan 20, 2025 Imitation Learning Language Modeling
Code Code Available 2Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models Feb 24, 2025 GSM8K Math
Code Code Available 2Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration May 26, 2025 Domain Generalization Hallucination
Code Code Available 2Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models May 30, 2018 Deep Reinforcement Learning Model-based Reinforcement Learning
Code Code Available 2Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph Matching Dec 16, 2020 Combinatorial Optimization Decision Making
Code Code Available 2Dialogue Learning With Human-In-The-Loop Nov 29, 2016 Question Answering reinforcement-learning
Code Code Available 2Decoupling Representation Learning from Reinforcement Learning Sep 14, 2020 Data Augmentation Deep Reinforcement Learning
Code Code Available 2DayDreamer: World Models for Physical Robot Learning Jun 28, 2022 Deep Reinforcement Learning Navigate
Code Code Available 2Datasets and Benchmarks for Offline Safe Reinforcement Learning Jun 15, 2023 Autonomous Driving Benchmarking
Code Code Available 2D4RL: Datasets for Deep Data-Driven Reinforcement Learning Apr 15, 2020 D4RL Offline RL
Code Code Available 2Craftium: An Extensible Framework for Creating Reinforcement Learning Environments Jul 4, 2024 Benchmarking Minecraft
Code Code Available 2PC-Gym: Benchmark Environments For Process Control Problems Oct 29, 2024 Benchmarking Chemical Process
Code Code Available 2CTR-Driven Advertising Image Generation with Multimodal Large Language Models Feb 5, 2025 Image Generation Reinforcement Learning (RL)
Code Code Available 2AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers Nov 17, 2024 In-Context Learning Meta-Learning
Code Code Available 2Curiosity-driven Red-teaming for Large Language Models Feb 29, 2024 Red Teaming Reinforcement Learning (RL)
Code Code Available 2Deep Reinforcement Learning for Multi-Agent Interaction Aug 2, 2022 BIG-bench Machine Learning Causal Inference
Code Code Available 2DIAMBRA Arena: a New Reinforcement Learning Platform for Research and Experimentation Oct 19, 2022 Deep Reinforcement Learning Imitation Learning
Code Code Available 2EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data Mar 1, 2024 continuous-control Continuous Control
Code Code Available 2Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning Sep 18, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 2Policy improvement by planning with Gumbel Sep 29, 2021 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 2Controlgym: Large-Scale Control Environments for Benchmarking Reinforcement Learning Algorithms Nov 30, 2023 Benchmarking OpenAI Gym
Code Code Available 1A Comprehensive Survey of Data Augmentation in Visual Reinforcement Learning Oct 10, 2022 Data Augmentation reinforcement-learning
Code Code Available 1Control-Informed Reinforcement Learning for Chemical Processes Aug 24, 2024 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1A Composable Specification Language for Reinforcement Learning Tasks Aug 21, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 1A Boolean Task Algebra for Reinforcement Learning Jan 6, 2020 Lifelong learning Negation
Code Code Available 1Contrastive Variational Reinforcement Learning for Complex Observations Aug 6, 2020 Atari Games Continuous Control
Code Code Available 1Controlling the Risk of Conversational Search via Reinforcement Learning Jan 15, 2021 Conversational Search reinforcement-learning
Code Code Available 1Contrastive Retrospection: honing in on critical steps for rapid learning and generalization in RL Oct 12, 2022 Contrastive Learning Out-of-Distribution Generalization
Code Code Available 1Contrastive Reinforcement Learning of Symbolic Reasoning Domains Jun 16, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 1Contrastive State Augmentations for Reinforcement Learning-Based Recommender Systems May 18, 2023 Recommendation Systems reinforcement-learning
Code Code Available 1