Accelerating Goal-Conditioned RL Algorithms and Research Aug 20, 2024 GPU reinforcement-learning
Code Code Available 3Reinforcement Learning Meets Visual Odometry Jul 22, 2024 Decision Making reinforcement-learning
Code Code Available 3Simplifying Deep Temporal Difference Learning Jul 5, 2024 Q-Learning Reinforcement Learning (RL)
Code Code Available 3Is Value Learning Really the Main Bottleneck in Offline RL? Jun 13, 2024 Imitation Learning Offline RL
Code Code Available 3CarDreamer: Open-Source Learning Platform for World Model based Autonomous Driving May 15, 2024 Autonomous Driving Autonomous Vehicles
Code Code Available 3ACEGEN: Reinforcement learning of generative chemical agents for drug discovery May 7, 2024 Benchmarking Decision Making
Code Code Available 3ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL Feb 29, 2024 Language Modeling Language Modelling
Code Code Available 3Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning Feb 26, 2024 GPU Minecraft
Code Code Available 3Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning Feb 5, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 3Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey on Hybrid Algorithms Jan 22, 2024 Evolutionary Algorithms reinforcement-learning
Code Code Available 3Generating Synergistic Formulaic Alpha Collections via Reinforcement Learning May 25, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 3OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research May 16, 2023 Philosophy reinforcement-learning
Code Code Available 3Learning Bipedal Walking for Humanoids with Current Feedback Mar 7, 2023 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 3EvoTorch: Scalable Evolutionary Computation in Python Feb 24, 2023 GPU reinforcement-learning
Code Code Available 3Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning Jan 26, 2023 Benchmarking Deep Reinforcement Learning
Code Code Available 3imitation: Clean Imitation Learning Implementations Nov 22, 2022 Imitation Learning reinforcement-learning
Code Code Available 3Adversarial Cheap Talk Nov 20, 2022 Meta-Learning Reinforcement Learning (RL)
Code Code Available 3Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning Oct 11, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 3MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library Oct 11, 2022 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 3Discovered Policy Optimisation Oct 11, 2022 Ingenuity Meta-Learning
Code Code Available 3Learning Bipedal Walking On Planned Footsteps For Humanoid Robots Jul 26, 2022 Deep Reinforcement Learning MuJoCo
Code Code Available 3Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos Jun 23, 2022 Imitation Learning Minecraft
Code Code Available 3CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms Nov 16, 2021 Benchmarking Deep Reinforcement Learning
Code Code Available 3On the Use and Misuse of Absorbing States in Multi-agent Reinforcement Learning Nov 10, 2021 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 3Tianshou: a Highly Modularized Deep Reinforcement Learning Library Jul 29, 2021 Deep Reinforcement Learning MuJoCo
Code Code Available 3FinRL: A Deep Reinforcement Learning Library for Automated Stock Trading in Quantitative Finance Nov 19, 2020 Deep Reinforcement Learning reinforcement-learning
Code Code Available 3Fine-Tuning Language Models from Human Preferences Sep 18, 2019 Descriptive Language Modelling
Code Code Available 3OpenSpiel: A Framework for Reinforcement Learning in Games Aug 26, 2019 General Reinforcement Learning reinforcement-learning
Code Code Available 3Dopamine: A Research Framework for Deep Reinforcement Learning Dec 14, 2018 Deep Reinforcement Learning reinforcement-learning
Code Code Available 3Practical Deep Reinforcement Learning Approach for Stock Trading Nov 19, 2018 Deep Reinforcement Learning reinforcement-learning
Code Code Available 3Deep Reinforcement Learning Oct 15, 2018 Deep Reinforcement Learning Management
Code Code Available 3Distributed Prioritized Experience Replay Mar 2, 2018 Atari Games Deep Reinforcement Learning
Code Code Available 3Rainbow: Combining Improvements in Deep Reinforcement Learning Oct 6, 2017 Atari Games Deep Reinforcement Learning
Code Code Available 3GTA1: GUI Test-time Scaling Agent Jul 8, 2025 Reinforcement Learning (RL) Task Planning
Code Code Available 2High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning Jul 8, 2025 MME Reinforcement Learning (RL)
Code Code Available 2AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs Jul 8, 2025 GPU reinforcement-learning
Code Code Available 2Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning Jun 27, 2025 Foreground Segmentation object-detection
Code Code Available 2HumanOmniV2: From Understanding to Omni-Modal Reasoning with Context Jun 26, 2025 Large Language Model Multimodal Reasoning
Code Code Available 2OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling Jun 25, 2025 Language Modeling Language Modelling
Code Code Available 2Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics Learning Jun 23, 2025 GPU Large Language Model
Code Code Available 2Graphs Meet AI Agents: Taxonomy, Progress, and Future Opportunities Jun 22, 2025 Reinforcement Learning (RL)
Code Code Available 2TimeMaster: Training Time-Series Multimodal LLMs to Reason via Reinforcement Learning Jun 16, 2025 Reinforcement Learning (RL) Time Series
Code Code Available 2Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models Jun 15, 2025 Reinforcement Learning (RL)
Code Code Available 2TreeRL: LLM Reinforcement Learning with On-Policy Tree Search Jun 13, 2025 Math reinforcement-learning
Code Code Available 2Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning Jun 10, 2025 Model Selection Reinforcement Learning (RL)
Code Code Available 2Play to Generalize: Learning to Reason Through Game Play Jun 9, 2025 Domain Generalization Math
Code Code Available 2Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction Jun 9, 2025 Reinforcement Learning (RL)
Code Code Available 2Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions Jun 9, 2025 Large Language Model Reinforcement Learning (RL)
Code Code Available 2Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning Jun 2, 2025 Fact Verification Language Modeling
Code Code Available 2ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL May 30, 2025 Image Generation Language Modeling
Code Code Available 2