Backward Imitation and Forward Reinforcement Learning via Bi-directional Model Rollouts Aug 4, 2022 Generative Adversarial Network Model-based Reinforcement Learning
— Unverified 0Continuous-time Value Function Approximation in Reproducing Kernel Hilbert Spaces Jun 8, 2018 Atari Games Gaussian Processes
— Unverified 0Backward Curriculum Reinforcement Learning Dec 29, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0A Multiagent CyberBattleSim for RL Cyber Operation Agents Apr 3, 2023 CyberBattleSim Reinforcement Learning (RL)
— Unverified 0Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning Feb 8, 2021 Misconceptions Multi-agent Reinforcement Learning
— Unverified 0AMRL: Aggregated Memory For Reinforcement Learning May 1, 2020 Minecraft reinforcement-learning
— Unverified 0Backstepping Temporal Difference Learning Feb 20, 2023 Reinforcement Learning (RL)
— Unverified 02048: Reinforcement Learning in a Delayed Reward Environment Jul 7, 2025 quantile regression reinforcement-learning
— Unverified 0Back-stepping Experience Replay with Application to Model-free Reinforcement Learning for a Soft Snake Robot Jan 21, 2024 Friction Reinforcement Learning (RL)
— Unverified 0A Comparative Study of Deep Reinforcement Learning for Crop Production Management Nov 6, 2024 Deep Reinforcement Learning Management
— Unverified 0Continuous-time Risk-sensitive Reinforcement Learning via Quadratic Variation Penalty Apr 19, 2024 Q-Learning reinforcement-learning
— Unverified 0A Better Baseline for Second Order Gradient Estimation in Stochastic Computation Graphs Sep 27, 2018 Meta-Learning Multi-agent Reinforcement Learning
— Unverified 0Backpropamine: training self-modifying neural networks with differentiable neuromodulated plasticity Feb 24, 2020 Language Modeling Language Modelling
— Unverified 0Backpropagation through Time and Space: Learning Numerical Methods with Multi-Agent Reinforcement Learning Mar 16, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Adaptive Reward-Poisoning Attacks against Reinforcement Learning Mar 27, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Continuous-time optimal investment with portfolio constraints: a reinforcement learning approach Dec 14, 2024 Reinforcement Learning (RL)
— Unverified 0A Modular Test Bed for Reinforcement Learning Incorporation into Industrial Applications Jun 2, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Backplay: 'Man muss immer umkehren' May 1, 2019 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Scene Induced Multi-Modal Trajectory Forecasting via Planning May 23, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Backdoors in DRL: Four Environments Focusing on In-distribution Triggers May 22, 2025 Backdoor Attack Data Poisoning
— Unverified 0MSPM: A Modularized and Scalable Multi-Agent Reinforcement Learning-based System for Financial Portfolio Management Feb 6, 2021 Decision Making Management
— Unverified 0Continuous-Time Reinforcement Learning: New Design Algorithms with Theoretical Insights and Performance Guarantees Jul 18, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning Feb 5, 2024 Contrastive Learning D4RL
— Unverified 0Control-Aware Representations for Model-based Reinforcement Learning Jun 24, 2020 model Model-based Reinforcement Learning
— Unverified 0BACKDOORL: Backdoor Attack against Competitive Reinforcement Learning May 2, 2021 Atari Games Backdoor Attack
— Unverified 0PolicyCleanse: Backdoor Detection and Mitigation in Reinforcement Learning Feb 8, 2022 Machine Unlearning reinforcement-learning
— Unverified 0Backbones-Review: Feature Extraction Networks for Deep Learning and Deep Reinforcement Learning Approaches Jun 16, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Bach2Bach: Generating Music Using A Deep Reinforcement Learning Approach Dec 3, 2018 Deep Reinforcement Learning reinforcement-learning
— Unverified 0A Modular and Transferable Reinforcement Learning Framework for the Fleet Rebalancing Problem May 27, 2021 Decision Making reinforcement-learning
— Unverified 0Adaptive Reinforcement Learning through Evolving Self-Modifying Neural Networks May 22, 2020 Meta-Learning reinforcement-learning
— Unverified 0A Modified Q-Learning Algorithm for Rate-Profiling of Polarization Adjusted Convolutional (PAC) Codes Oct 4, 2021 Q-Learning reinforcement-learning
— Unverified 0B3C: A Minimalist Approach to Offline Multi-Agent Reinforcement Learning Jan 30, 2025 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0A Model Selection Approach for Corruption Robust Reinforcement Learning Oct 7, 2021 Model Selection Multi-Armed Bandits
— Unverified 0Adaptive Reinforcement Learning Model for Simulation of Urban Mobility during Crises Sep 2, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Continuously Learning Neural Dialogue Management Jun 8, 2016 Dialogue Management Management
— Unverified 0AWD3: Dynamic Reduction of the Estimation Bias Nov 12, 2021 continuous-control Continuous Control
— Unverified 0A model of discrete choice based on reinforcement learning under short-term memory Aug 16, 2019 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Avoiding Wireheading with Value Reinforcement Learning May 10, 2016 reinforcement-learning Reinforcement Learning
— Unverified 0A Model-free Learning Algorithm for Infinite-horizon Average-reward MDPs with Near-optimal Regret Jun 8, 2020 Q-Learning reinforcement-learning
— Unverified 0Adaptive Reinforcement Learning for State Avoidance in Discrete Event Systems Feb 28, 2025 Decision Making Reinforcement Learning (RL)
— Unverified 0Avoiding Negative Side-Effects and Promoting Safe Exploration with Imaginative Planning Sep 25, 2019 Reinforcement Learning (RL) Safe Exploration
— Unverified 0Avoiding mode collapse in diffusion models fine-tuned with reinforcement learning Oct 10, 2024 Denoising Diversity
— Unverified 0A Comparative Study of AI-based Intrusion Detection Techniques in Critical Infrastructures Jul 24, 2020 Intrusion Detection Management
— Unverified 0Avoiding Jammers: A Reinforcement Learning Approach Nov 20, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Avoiding Catastrophic States with Intrinsic Fear Jan 1, 2018 Atari Games Deep Reinforcement Learning
— Unverified 0Adaptive Reinforcement Learning for Unobservable Random Delays Jun 17, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management Nov 29, 2017 Benchmarking Deep Reinforcement Learning
— Unverified 0Continuous Input Embedding Size Search For Recommender Systems Apr 7, 2023 Recommendation Systems Reinforcement Learning (RL)
— Unverified 0Avoidance Learning Using Observational Reinforcement Learning Sep 24, 2019 Imitation Learning reinforcement-learning
— Unverified 0A Visual Communication Map for Multi-Agent Deep Reinforcement Learning Feb 27, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0