Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning Feb 26, 2025 In-Context Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 1Distilling Reinforcement Learning Tricks for Video Games Jul 1, 2021 Q-Learning reinforcement-learning
Code Code Available 1A Deep Reinforcement Learning Algorithm Using Dynamic Attention Model for Vehicle Routing Problems Feb 9, 2020 Combinatorial Optimization Decoder
Code Code Available 1Distributed Heuristic Multi-Agent Path Finding with Communication Jun 21, 2021 Multi-Agent Path Finding Q-Learning
Code Code Available 1CompoSuite: A Compositional Reinforcement Learning Benchmark Jul 8, 2022 reinforcement-learning Reinforcement Learning
Code Code Available 1Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow Mar 26, 2021 Model-based Reinforcement Learning reinforcement-learning
Code Code Available 1Diversify Question Generation with Retrieval-Augmented Style Transfer Oct 23, 2023 Diversity Question Answering
Code Code Available 1Diversity is All You Need: Learning Skills without a Reward Function Feb 16, 2018 All Diversity
Code Code Available 1Compound AI Systems Optimization: A Survey of Methods, Challenges, and Future Directions Jun 9, 2025 Reinforcement Learning (RL)
Code Code Available 1DNA: Proximal Policy Optimization with a Dual Network Architecture Jun 20, 2022 Atari Games Reinforcement Learning (RL)
Code Code Available 1Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning Jun 7, 2021 Multi-agent Reinforcement Learning Offline RL
Code Code Available 1Does Zero-Shot Reinforcement Learning Exist? Sep 29, 2022 Contrastive Learning reinforcement-learning
Code Code Available 1Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs May 19, 2025 Reinforcement Learning (RL)
Code Code Available 1Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning Jan 31, 2022 Diversity Offline RL
Code Code Available 1Compile Scene Graphs with Reinforcement Learning Apr 18, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 1DPN: Decoupling Partition and Navigation for Neural Solvers of Min-max Vehicle Routing Problems May 27, 2024 Reinforcement Learning (RL)
Code Code Available 1Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient Oct 11, 2024 Mamba Model-based Reinforcement Learning
Code Code Available 1Dream and Search to Control: Latent Space Planning for Continuous Control Oct 19, 2020 continuous-control Continuous Control
Code Code Available 1DreamShard: Generalizable Embedding Table Placement for Recommender Systems Oct 5, 2022 GPU Recommendation Systems
Code Code Available 1Dream to Control: Learning Behaviors by Latent Imagination Dec 3, 2019 Continuous Control reinforcement-learning
Code Code Available 1Compositional Reinforcement Learning from Logical Specifications Jun 25, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 1DRLComplex: Reconstruction of protein quaternary structures using deep reinforcement learning May 26, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Computational Performance of Deep Reinforcement Learning to find Nash Equilibria Apr 26, 2021 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-based Autonomous Urban Driving Feb 17, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1B-Pref: Benchmarking Preference-Based Reinforcement Learning Nov 4, 2021 Benchmarking reinforcement-learning
Code Code Available 1DTR-Bench: An in silico Environment and Benchmark Platform for Reinforcement Learning Based Dynamic Treatment Regime May 28, 2024 Benchmarking Reinforcement Learning (RL)
Code Code Available 1Bridging RL Theory and Practice with the Effective Horizon Apr 19, 2023 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 1Competitiveness of MAP-Elites against Proximal Policy Optimization on locomotion tasks in deterministic simulations Sep 17, 2020 Evolutionary Algorithms Reinforcement Learning (RL)
Code Code Available 1DyNODE: Neural Ordinary Differential Equations for Dynamics Modeling in Continuous Control Sep 9, 2020 continuous-control Continuous Control
Code Code Available 1EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL Jun 20, 2022 Question Answering Question Generation
Code Code Available 1A Crash Course on Reinforcement Learning Mar 8, 2021 reinforcement-learning Reinforcement Learning
Code Code Available 1Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining Apr 10, 2025 Mathematical Reasoning Reinforcement Learning (RL)
Code Code Available 1An Experimental Design Perspective on Model-Based Reinforcement Learning Dec 9, 2021 continuous-control Continuous Control
Code Code Available 1Reinforcement Learning in High-frequency Market Making Jul 14, 2024 Q-Learning reinforcement-learning
Code Code Available 1Effective Diversity in Population Based Reinforcement Learning Feb 3, 2020 Diversity Point Processes
Code Code Available 1Effective Multi-User Delay-Constrained Scheduling with Deep Recurrent Reinforcement Learning Aug 30, 2022 Cloud Computing Deep Reinforcement Learning
Code Code Available 1Efficient Active Search for Combinatorial Optimization Problems Jun 9, 2021 BIG-bench Machine Learning Combinatorial Optimization
Code Code Available 1Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning Oct 12, 2022 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning Oct 23, 2020 Deep Reinforcement Learning Model-based Reinforcement Learning
Code Code Available 1Compiler Optimization for Quantum Computing Using Reinforcement Learning Dec 8, 2022 Compiler Optimization reinforcement-learning
Code Code Available 1Efficient Pressure: Improving efficiency for signalized intersections Dec 4, 2021 Reinforcement Learning (RL) Traffic Signal Control
Code Code Available 1Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate May 24, 2024 Decision Making Reinforcement Learning (RL)
Code Code Available 1Concise Reasoning via Reinforcement Learning Apr 7, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 1Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and Classification Dec 1, 2021 Decision Making Diagnostic
Code Code Available 1Constrained episodic reinforcement learning in concave-convex and knapsack settings Jun 9, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 1Efficient Wasserstein Natural Gradients for Reinforcement Learning Oct 12, 2020 Policy Gradient Methods reinforcement-learning
Code Code Available 1Contrastive Retrospection: honing in on critical steps for rapid learning and generalization in RL Oct 12, 2022 Contrastive Learning Out-of-Distribution Generalization
Code Code Available 1DataLight: Offline Data-Driven Traffic Signal Control Mar 20, 2023 Offline RL Reinforcement Learning (RL)
Code Code Available 1Evolutionary Planning in Latent Space Nov 23, 2020 reinforcement-learning Reinforcement Learning
Code Code Available 1Improved Representation of Asymmetrical Distances with Interval Quasimetric Embeddings Nov 28, 2022 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 1