Bandit-Based Policy Invariant Explicit Shaping for Incorporating External Advice in Reinforcement Learning Apr 14, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Adaptive Sampling Quasi-Newton Methods for Zeroth-Order Stochastic Optimization Sep 24, 2021 Reinforcement Learning (RL) Stochastic Optimization
— Unverified 0A comparison of controller architectures and learning mechanisms for arbitrary robot morphologies Sep 25, 2023 Reinforcement Learning (RL)
— Unverified 0Continual and Multi-task Reinforcement Learning With Shared Episodic Memory May 7, 2019 Continual Learning reinforcement-learning
— Unverified 0Bandit approach to conflict-free multi-agent Q-learning in view of photonic implementation Dec 20, 2022 Decision Making Multi-agent Reinforcement Learning
— Unverified 0BANANAS: Bayesian Optimization with Neural Networks for Neural Architecture Search Sep 25, 2019 Bayesian Optimization Neural Architecture Search
— Unverified 0Adaptive Sampling Quasi-Newton Methods for Derivative-Free Stochastic Optimization Oct 29, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0BAMDP Shaping: a Unified Theoretical Framework for Intrinsic Motivation and Reward Shaping Sep 9, 2024 Reinforcement Learning (RL)
— Unverified 0A Multifidelity Sim-to-Real Pipeline for Verifiable and Compositional Reinforcement Learning Dec 2, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0A Bibliometric Analysis and Review on Reinforcement Learning for Transportation Applications Oct 26, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Contingency-constrained economic dispatch with safe reinforcement learning May 12, 2022 Computational Efficiency reinforcement-learning
— Unverified 0Balancing Two-Player Stochastic Games with Soft Q-Learning Feb 9, 2018 Q-Learning Reinforcement Learning
— Unverified 0Adaptive Safe Reinforcement Learning-Enabled Optimization of Battery Fast-Charging Protocols Jun 18, 2024 Reinforcement Learning (RL) Safe Reinforcement Learning
— Unverified 0A Comparison of Classical and Deep Reinforcement Learning Methods for HVAC Control Aug 10, 2023 Deep Reinforcement Learning Q-Learning
— Unverified 0Balancing SoC in Battery Cells using Safe Action Perturbations Mar 11, 2025 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Balancing Reinforcement Learning Training Experiences in Interactive Information Retrieval Jun 5, 2020 Information Retrieval reinforcement-learning
— Unverified 0A Multi-Document Coverage Reward for RELAXed Multi-Document Summarization Nov 16, 2021 Computational Efficiency Document Summarization
— Unverified 0Multiagent Model-based Credit Assignment for Continuous Control Dec 27, 2021 continuous-control Continuous Control
— Unverified 0Continual Adversarial Reinforcement Learning (CARL) of False Data Injection detection: forgetting and explainability Nov 15, 2024 Continual Learning Reinforcement Learning (RL)
— Unverified 0Continual Auxiliary Task Learning Feb 22, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Balancing Progress and Safety: A Novel Risk-Aware Objective for RL in Autonomous Driving May 10, 2025 Autonomous Driving Reinforcement Learning (RL)
— Unverified 0Balancing Profit, Risk, and Sustainability for Portfolio Management Jun 6, 2022 Management Portfolio Optimization
— Unverified 0A Multi-Agent Reinforcement Learning Testbed for Cognitive Radio Applications Oct 28, 2024 Multi-agent Reinforcement Learning OpenAI Gym
— Unverified 0Balancing Profit and Fairness in Risk-Based Pricing Markets May 30, 2025 Fairness Reinforcement Learning (RL)
— Unverified 0Adaptive routing protocols for determining optimal paths in AI multi-agent systems: a priority- and learning-enhanced approach Mar 10, 2025 Reinforcement Learning (RL)
— Unverified 0A Comparison of Action Spaces for Learning Manipulation Tasks Aug 23, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Balancing Constraints and Rewards with Meta-Gradient D4PG Oct 13, 2020 MuJoCo Reinforcement Learning (RL)
— Unverified 0Balancing Act: Prioritization Strategies for LLM-Designed Restless Bandit Rewards Aug 22, 2024 Language Modeling Language Modelling
— Unverified 0A Multi-Agent Reinforcement Learning Method for Impression Allocation in Online Display Advertising Sep 10, 2018 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Balancing Accuracy and Fairness for Interactive Recommendation with Reinforcement Learning Jun 25, 2021 Fairness Interactive Recommendation
— Unverified 0Balancing a CartPole System with Reinforcement Learning -- A Tutorial Jun 8, 2020 OpenAI Gym Q-Learning
— Unverified 0A Multi-agent Reinforcement Learning Approach for Efficient Client Selection in Federated Learning Jan 9, 2022 Federated Learning Multi-agent Reinforcement Learning
— Unverified 0Adaptive Rollout Length for Model-Based RL Using Model-Free Deep RL Jun 6, 2022 Decision Making Deep Reinforcement Learning
— Unverified 0Contextual Transformer for Offline Meta Reinforcement Learning Nov 15, 2022 D4RL Meta Reinforcement Learning
— Unverified 0A Multiagent Reinforcement Learning Algorithm with Non-linear Dynamics Jan 15, 2014 reinforcement-learning Reinforcement Learning
— Unverified 0Contextual Latent-Movements Off-Policy Optimization for Robotic Manipulation Skills Oct 26, 2020 Imitation Learning Reinforcement Learning (RL)
— Unverified 0A Comparative Study of Reinforcement Learning Techniques on Dialogue Management Apr 1, 2012 Dialogue Management Management
— Unverified 0Bag of Policies for Distributional Deep Exploration Aug 3, 2023 Atari Games Efficient Exploration
— Unverified 0No-regret Exploration in Contextual Reinforcement Learning Mar 14, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Adaptive Road Configurations for Improved Autonomous Vehicle-Pedestrian Interactions using Reinforcement Learning Mar 22, 2023 Autonomous Vehicles Management
— Unverified 0Bad-Policy Density: A Measure of Reinforcement Learning Hardness Oct 7, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0AbFlowNet: Optimizing Antibody-Antigen Binding Energy via Diffusion-GFlowNet Fusion May 18, 2025 Reinforcement Learning (RL)
— Unverified 0BadGPT: Exploring Security Vulnerabilities of ChatGPT via Backdoor Attacks to InstructGPT Feb 21, 2023 Backdoor Attack Language Modeling
— Unverified 0BADDr: Bayes-Adaptive Deep Dropout RL for POMDPs Feb 17, 2022 Reinforcement Learning (RL) State Estimation
— Unverified 0A Multi-Agent Deep Reinforcement Learning Coordination Framework for Connected and Automated Vehicles at Merging Roadways Sep 23, 2021 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Contextual Policy Transfer in Reinforcement Learning Domains via Deep Mixtures-of-Experts Feb 29, 2020 Mixture-of-Experts OpenAI Gym
— Unverified 0Contingency-Aware Exploration in Reinforcement Learning Nov 5, 2018 Atari Games Montezuma's Revenge
— Unverified 0A Multi-Agent Deep Reinforcement Learning Approach for a Distributed Energy Marketplace in Smart Grids Sep 23, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Backward Imitation and Forward Reinforcement Learning via Bi-directional Model Rollouts Aug 4, 2022 Generative Adversarial Network Model-based Reinforcement Learning
— Unverified 0Contextual Exploration Using a Linear Approximation Method Based on Satisficing Dec 13, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0