CORAL: Contextual Response Retrievability Loss Function for Training Dialog Generation Models May 21, 2022 Reinforcement Learning (RL) Text Generation
— Unverified 0Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback May 13, 2023 MuJoCo Reinforcement Learning (RL)
— Unverified 0ACTRCE: Augmenting Experience via Teacher’s Advice May 1, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation Mar 11, 2024 Recommendation Systems Reinforcement Learning (RL)
— Unverified 0Delay-aware Resource Allocation in Fog-assisted IoT Networks Through Reinforcement Learning Apr 30, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Delay Constrained Buffer-Aided Relay Selection in the Internet of Things with Decision-Assisted Reinforcement Learning Nov 20, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Optimism and Delays in Episodic Reinforcement Learning Nov 15, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Delayed Geometric Discounts: An Alternative Criterion for Reinforcement Learning Sep 26, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Delayed Reinforcement Learning by Imitation May 11, 2022 Imitation Learning reinforcement-learning
— Unverified 0Delays in Reinforcement Learning Sep 20, 2023 Decision Making reinforcement-learning
— Unverified 0Delegative Reinforcement Learning: learning to avoid traps with a little help Jul 19, 2019 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0A Multifidelity Sim-to-Real Pipeline for Verifiable and Compositional Reinforcement Learning Dec 2, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding Jun 1, 2023 Management Offline RL
— Unverified 0BAMDP Shaping: a Unified Theoretical Framework for Intrinsic Motivation and Reward Shaping Sep 9, 2024 Reinforcement Learning (RL)
— Unverified 0Delving into adversarial attacks on deep policies May 18, 2017 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Delving into Macro Placement with Reinforcement Learning Sep 6, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Assessment of Reward Functions in Reinforcement Learning for Multi-Modal Urban Traffic Control under Real-World limitations Oct 17, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Demand response for residential building heating: Effective Monte Carlo Tree Search control based on physics-informed neural networks Dec 6, 2023 Board Games Model Predictive Control
— Unverified 0A General Perspective on Objectives of Reinforcement Learning Jun 5, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Demonstration-efficient Inverse Reinforcement Learning in Procedurally Generated Environments Dec 4, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Diverse Exploration for Fast and Safe Policy Improvement Feb 22, 2018 Diversity reinforcement-learning
— Unverified 0Demonstration-Guided Deep Reinforcement Learning of Control Policies for Dexterous Human-Robot Interaction Jun 27, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches Feb 13, 2025 D4RL Offline RL
— Unverified 0Assessment of Reinforcement Learning Algorithms for Nuclear Power Plant Fuel Optimization May 9, 2023 Combinatorial Optimization Deep Reinforcement Learning
— Unverified 0Demonstration-Regularized RL Oct 26, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Barrier-Certified Adaptive Reinforcement Learning with Applications to Brushbot Navigation Jan 29, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0CoordiQ : Coordinated Q-learning for Electric Vehicle Charging Recommendation Jan 28, 2021 Decision Making Q-Learning
— Unverified 0A Generalized Reinforcement Learning Algorithm for Online 3D Bin-Packing Jul 1, 2020 3D Bin Packing Deep Reinforcement Learning
— Unverified 0Demystify Painting with RL Dec 14, 2020 Decision Making Reinforcement Learning (RL)
— Unverified 0Coordination of PV Smart Inverters Using Deep Reinforcement Learning for Grid Voltage Regulation Oct 14, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Coordination in Adversarial Sequential Team Games via Multi-Agent Deep Reinforcement Learning Dec 16, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0De Novo Molecular Design Enabled by Direct Preference Optimization and Curriculum Learning Apr 2, 2025 Drug Discovery Reinforcement Learning (RL)
— Unverified 0Assessing Transferability from Simulation to Reality for Reinforcement Learning Jul 10, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations Dec 2, 2024 continuous-control Continuous Control
— Unverified 05G MIMO Data for Machine Learning: Application to Beam-Selection using Deep Learning Jun 9, 2021 BIG-bench Machine Learning Deep Learning
— Unverified 0Density-Based Bonuses on Learned Representations for Reward-Free Exploration in Deep Reinforcement Learning Jun 13, 2021 Deep Reinforcement Learning Density Estimation
— Unverified 0Density Constrained Reinforcement Learning Jun 24, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Basic protocols in quantum reinforcement learning with superconducting circuits Jan 18, 2017 BIG-bench Machine Learning Quantum Machine Learning
— Unverified 0Coordination-driven learning in multi-agent problem spaces Sep 13, 2018 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Dependency Parsing with Backtracking using Deep Reinforcement Learning Jun 28, 2022 Deep Reinforcement Learning Dependency Parsing
— Unverified 0Depending on yourself when you should: Mentoring LLM with RL agents to become the master in cybersecurity games Mar 26, 2024 Reinforcement Learning (RL)
— Unverified 0Batch-Augmented Multi-Agent Reinforcement Learning for Efficient Traffic Signal Optimization May 19, 2020 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Coordinating Ride-Pooling with Public Transit using Reward-Guided Conservative Q-Learning: An Offline Training and Online Fine-Tuning Reinforcement Learning Framework Jan 24, 2025 Q-Learning Reinforcement Learning (RL)
— Unverified 0Deploying Reinforcement Learning in Water Transport Dec 14, 2020 Q-Learning reinforcement-learning
— Unverified 0Assessing the Zero-Shot Capabilities of LLMs for Action Evaluation in RL Sep 19, 2024 Reinforcement Learning (RL)
— Unverified 0Coordinating Policies Among Multiple Agents via an Intelligent Communication Channel May 21, 2022 Intelligent Communication Multi-agent Reinforcement Learning
— Unverified 0Depth and nonlinearity induce implicit exploration for RL May 29, 2018 Q-Learning reinforcement-learning
— Unverified 0Depth-Constrained ASV Navigation with Deep RL and Limited Sensing Apr 25, 2025 Decision Making Reinforcement Learning (RL)
— Unverified 0Coordinating Disaster Emergency Response with Heuristic Reinforcement Learning Nov 12, 2018 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning Apr 28, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0