Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods Jan 25, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Direct Mutation and Crossover in Genetic Algorithms Applied to Reinforcement Learning Tasks Jan 13, 2022 OpenAI Gym reinforcement-learning
— Unverified 0Direct optimization of F-measure for retrieval-based personal question answering Sep 28, 2018 Question Answering reinforcement-learning
— Unverified 0Direct Uncertainty Estimation in Reinforcement Learning Jun 6, 2013 reinforcement-learning Reinforcement Learning
— Unverified 0Dirichlet policies for reinforced factor portfolios Nov 10, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Discerning Temporal Difference Learning Oct 12, 2023 Reinforcement Learning (RL)
— Unverified 0DisCoRL: Continual Reinforcement Learning via Policy Distillation Jul 11, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0DisCo RL: Distribution-Conditioned Reinforcement Learning for General-Purpose Policies Apr 23, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Discounted Reinforcement Learning Is Not an Optimization Problem Oct 4, 2019 Misconceptions reinforcement-learning
— Unverified 0Discourse-Aware Neural Rewards for Coherent Text Generation May 10, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Discourse Coherence, Reference Grounding and Goal Oriented Dialogue Jul 8, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Discovering an Aid Policy to Minimize Student Evasion Using Offline Reinforcement Learning Apr 20, 2021 Clustering Decision Making
— Unverified 0Discovering and Removing Exogenous State Variables and Rewards for Reinforcement Learning Jun 5, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Discovering Blind Spots in Reinforcement Learning May 23, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Discovering Command and Control (C2) Channels on Tor and Public Networks Using Reinforcement Learning Feb 14, 2024 Reinforcement Learning (RL)
— Unverified 0Discovering Command and Control Channels Using Reinforcement Learning Jan 13, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Discovering Exfiltration Paths Using Reinforcement Learning with Attack Graphs Jan 28, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Discovering Generalizable Skills via Automated Generation of Diverse Tasks Jun 26, 2021 Diversity Hierarchical Reinforcement Learning
— Unverified 0Discovering highly efficient low-weight quantum error-correcting codes with reinforcement learning Feb 20, 2025 Reinforcement Learning (RL)
— Unverified 0Discovering Latent States for Model Learning: Applying Sensorimotor Contingencies Theory and Predictive Processing to Model Context Aug 1, 2016 model reinforcement-learning
— Unverified 0Discovering Multiple Solutions from a Single Task in Offline Reinforcement Learning Jun 10, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0Discovering Options for Exploration by Minimizing Cover Time Mar 2, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Discovering User Types: Mapping User Traits by Task-Specific Behaviors in Reinforcement Learning Jul 16, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Discover the Hidden Attack Path in Multi-domain Cyberspace Based on Reinforcement Learning Apr 15, 2021 Reinforcement Learning (RL)
— Unverified 0Discovery of False Data Injection Schemes on Frequency Controllers with Reinforcement Learning Aug 30, 2024 Reinforcement Learning (RL)
— Unverified 0Discovery of Optimal Quantum Error Correcting Codes via Reinforcement Learning May 10, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Discovery of Options via Meta-Learned Subgoals Feb 12, 2021 Reinforcement Learning (RL)
— Unverified 0Discovery of Useful Questions as Auxiliary Tasks Sep 10, 2019 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Discrete Control in Real-World Driving Environments using Deep Reinforcement Learning Nov 29, 2022 Data Augmentation Deep Reinforcement Learning
— Unverified 0Discrete Factorial Representations as an Abstraction for Goal Conditioned Reinforcement Learning Nov 1, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Smaller World Models for Reinforcement Learning Oct 12, 2020 Atari Games reinforcement-learning
— Unverified 0Discrete linear-complexity reinforcement learning in continuous action spaces for Q-learning algorithms Jul 16, 2018 Q-Learning reinforcement-learning
— Unverified 0Discrete MDL Predicts in Total Variation Dec 1, 2009 reinforcement-learning Reinforcement Learning
— Unverified 0Discrete Predictive Representation for Long-horizon Planning Jan 1, 2021 Deep Reinforcement Learning Object
— Unverified 0Discrete-Time Mean Field Control with Environment States Apr 30, 2021 Deep Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 0Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning May 12, 2025 Image Generation Reinforcement Learning (RL)
— Unverified 0Discriminator Augmented Model-Based Reinforcement Learning Mar 24, 2021 model Model-based Reinforcement Learning
— Unverified 0Disentangled Predictive Representation for Meta-Reinforcement Learning Jun 13, 2021 Meta Reinforcement Learning reinforcement-learning
— Unverified 0Disentangled Skill Embeddings for Reinforcement Learning Jun 21, 2019 Hierarchical Reinforcement Learning reinforcement-learning
— Unverified 0Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning Mar 11, 2025 Disentanglement Reinforcement Learning (RL)
— Unverified 0Disentangling causal effects for hierarchical reinforcement learning Oct 3, 2020 counterfactual Descriptive
— Unverified 0Disentangling Controllable and Uncontrollable Factors of Variation by Interacting with the World Apr 19, 2018 Disentanglement reinforcement-learning
— Unverified 0Disentangling Controllable Object through Video Prediction Improves Visual Reinforcement Learning Feb 21, 2020 Atari Games Object
— Unverified 0Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction May 27, 2019 continuous-control Continuous Control
— Unverified 0Disentangling Epistemic and Aleatoric Uncertainty in Reinforcement Learning Jun 3, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Disentangling Generalization in Reinforcement Learning Sep 29, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Disentangling Options with Hellinger Distance Regularizer Apr 15, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Disentangling Recognition and Decision Regrets in Image-Based Reinforcement Learning Sep 19, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0Disentangling Sources of Risk for Distributional Multi-Agent Reinforcement Learning Sep 29, 2021 Multi-agent Reinforcement Learning quantile regression
— Unverified 0Disentangling Transfer in Continual Reinforcement Learning Sep 28, 2022 Continual Learning continuous-control
— Unverified 0