A Hybrid Approach for Reinforcement Learning Using Virtual Policy Gradient for Balancing an Inverted Pendulum Feb 6, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0A Fast Convergence Theory for Offline Decision Making Jun 3, 2024 Decision Making Offline RL
— Unverified 0ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search Nov 6, 2018 continuous-control Continuous Control
— Unverified 0AACC: Asymmetric Actor-Critic in Contextual Reinforcement Learning Aug 3, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Deep Curiosity Loops in Social Environments Jun 10, 2018 Hand Detection Optical Flow Estimation
— Unverified 0A Theory of Abstraction in Reinforcement Learning Mar 1, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0A Theoretical Connection Between Statistical Physics and Reinforcement Learning Jun 24, 2019 Decision Making reinforcement-learning
— Unverified 0A Hybrid Approach Between Adversarial Generative Networks and Actor-Critic Policy Gradient for Low Rate High-Resolution Image Compression Jun 11, 2019 Decoder Image Compression
— Unverified 0A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes May 15, 2023 2k Reinforcement Learning (RL)
— Unverified 0A Human Mixed Strategy Approach to Deep Reinforcement Learning Apr 5, 2018 Atari Games Deep Reinforcement Learning
— Unverified 0Adaptive Actor-Critic Based Optimal Regulation for Drift-Free Uncertain Nonlinear Systems Jun 13, 2024 Reinforcement Learning (RL)
— Unverified 0A Tensor Network Approach to Finite Markov Decision Processes Feb 12, 2020 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0A Temporal-Pattern Backdoor Attack to Deep Reinforcement Learning May 5, 2022 Backdoor Attack Cloud Computing
— Unverified 0A Human-Centered Safe Robot Reinforcement Learning Framework with Interactive Behaviors Feb 25, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0A Temporal Difference Reinforcement Learning Theory of Emotion: unifying emotion, cognition and adaptive behavior Jul 24, 2018 Learning Theory Reinforcement Learning
— Unverified 0A Human-Centered Data-Driven Planner-Actor-Critic Architecture via Logic Programming Sep 18, 2019 General Knowledge Reinforcement Learning
— Unverified 0Adaptive action supervision in reinforcement learning from real-world multi-agent demonstrations May 22, 2023 Dynamic Time Warping reinforcement-learning
— Unverified 0ACDER: Augmented Curiosity-Driven Experience Replay Nov 16, 2020 FetchPush-v1 Reinforcement Learning (RL)
— Unverified 0DeepCQ+: Robust and Scalable Routing with Multi-Agent Deep Reinforcement Learning for Highly Dynamic Networks Nov 29, 2021 Deep Reinforcement Learning Q-Learning
— Unverified 0Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability Mar 17, 2017 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0DeepEdge: A Deep Reinforcement Learning based Task Orchestrator for Edge Computing Oct 5, 2021 Deep Reinforcement Learning Edge-computing
— Unverified 0A Technique to Create Weaker Abstract Board Game Agents via Reinforcement Learning Sep 1, 2022 Board Games Q-Learning
— Unverified 0A Technical Study into Small Reasoning Language Models Jun 16, 2025 Code Generation Computational Efficiency
— Unverified 0A Homogenization Approach for Gradient-Dominated Stochastic Optimization Aug 21, 2023 Management Reinforcement Learning (RL)
— Unverified 0A Teacher-Student Framework for Maintainable Dialog Manager Oct 1, 2018 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0A Taxonomy of Similarity Metrics for Markov Decision Processes Mar 8, 2021 Reinforcement Learning (RL) Transfer Learning
— Unverified 0Adaptive ABAC Policy Learning: A Reinforcement Learning Approach May 18, 2021 Attribute Management
— Unverified 0DeepCAS: A Deep Reinforcement Learning Algorithm for Control-Aware Scheduling Mar 8, 2018 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Atari-GPT: Benchmarking Multimodal Large Language Models as Low-Level Policies in Atari Games Aug 28, 2024 Atari Games Benchmarking
— Unverified 0Atari games and Intel processors May 19, 2017 Atari Games BIG-bench Machine Learning
— Unverified 0Adaptive 3D UI Placement in Mixed Reality Using Deep Reinforcement Learning Apr 30, 2025 Deep Reinforcement Learning Mixed Reality
— Unverified 0Gamifying the Vehicle Routing Problem with Stochastic Requests Nov 14, 2019 Atari Games Decision Making
— Unverified 0A Tale of Two-Timescale Reinforcement Learning with the Tightest Finite-Time Bound Nov 20, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0A Hierarchical Two-tier Approach to Hyper-parameter Optimization in Reinforcement Learning Sep 18, 2019 Bayesian Optimization reinforcement-learning
— Unverified 0A3C-S: Automated Agent Accelerator Co-Search towards Efficient Deep Reinforcement Learning Jun 11, 2021 Decision Making Deep Reinforcement Learning
— Unverified 0Deep Coherent Exploration For Continuous Control Jan 1, 2021 continuous-control Continuous Control
— Unverified 0A Hierarchical Reinforcement Learning Method for Persistent Time-Sensitive Tasks Jun 20, 2016 Hierarchical Reinforcement Learning reinforcement-learning
— Unverified 0A Systematic Decade Review of Trip Route Planning with Travel Time Estimation based on User Preferences and Behavior Mar 30, 2025 Data Integration Federated Learning
— Unverified 0Adapting World Models with Latent-State Dynamics Residuals Apr 3, 2025 MuJoCo Reinforcement Learning (RL)
— Unverified 0Asynchronous training of quantum reinforcement learning Jan 12, 2023 Decision Making Quantum Machine Learning
— Unverified 0A Hierarchical Model for Device Placement Jan 1, 2018 Deep Reinforcement Learning Machine Translation
— Unverified 0Deep Binary Reinforcement Learning for Scalable Verification Mar 11, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Fully Asynchronous Policy Evaluation in Distributed Reinforcement Learning over Networks Mar 1, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0A Hierarchical Hybrid Learning Framework for Multi-agent Trajectory Prediction Mar 22, 2023 Autonomous Vehicles Motion Planning
— Unverified 0A Hierarchical Framework of Cloud Resource Allocation and Power Management Using Deep Reinforcement Learning Mar 13, 2017 Cloud Computing Decision Making
— Unverified 0Adapting User Interfaces with Model-based Reinforcement Learning Mar 11, 2021 model Model-based Reinforcement Learning
— Unverified 0Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via Deep Reinforcement Learning Dec 31, 2022 Deep Reinforcement Learning Edge-computing
— Unverified 0Deep Communicating Agents for Abstractive Summarization Mar 27, 2018 Abstractive Text Summarization Decoder
— Unverified 0Asynchronous Fractional Multi-Agent Deep Reinforcement Learning for Age-Minimal Mobile Edge Computing Sep 25, 2024 Deep Reinforcement Learning Edge-computing
— Unverified 0Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis Apr 9, 2024 MuJoCo Reinforcement Learning (RL)
— Unverified 0