SOTAVerified

Deep Reinforcement Learning

Papers

Showing 351400 of 5822 papers

TitleStatusHype
DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided LearningCode1
PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment RepresentationsCode1
Learning Pneumatic Non-Prehensile Manipulation with a Mobile BlowerCode1
Quantum Multi-Agent Reinforcement Learning via Variational Quantum Circuit DesignCode1
Multi-Objective reward generalization: Improving performance of Deep Reinforcement Learning for applications in single-asset tradingCode1
Deep Reinforcement Learning for Entity AlignmentCode1
Model-free Neural Lyapunov Control for Safe Robot NavigationCode1
Affordance Learning from Play for Sample-Efficient Policy LearningCode1
Building a 3-Player Mahjong AI using Deep Reinforcement LearningCode1
Blockchain Framework for Artificial Intelligence ComputationCode1
Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in IntralogisticsCode1
A Comparative Study of Deep Reinforcement Learning-based Transferable Energy Management Strategies for Hybrid Electric VehiclesCode1
CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-based Autonomous Urban DrivingCode1
Soft Actor-Critic Deep Reinforcement Learning for Fault Tolerant Flight ControlCode1
Exploring Deep Reinforcement Learning-Assisted Federated Learning for Online Resource Allocation in Privacy-Persevering EdgeIoTCode1
Optimizing Sequential Experimental Design with Deep Reinforcement LearningCode1
Accelerating Deep Reinforcement Learning for Digital Twin Network Optimization with Evolutionary StrategiesCode1
CoTV: Cooperative Control for Traffic Light Signals and Connected Autonomous Vehicles using Deep Reinforcement LearningCode1
Graph Convolution-Based Deep Reinforcement Learning for Multi-Agent Decision-Making in Mixed Traffic EnvironmentsCode1
Mask-based Latent Reconstruction for Reinforcement LearningCode1
The First AI4TSP Competition: Learning to Solve Stochastic Routing ProblemsCode1
Solving Dynamic Graph Problems with Multi-Attention Deep Reinforcement LearningCode1
Verified Probabilistic Policies for Deep Reinforcement LearningCode1
Mirror Learning: A Unifying Framework of Policy OptimisationCode1
Balsa: Learning a Query Optimizer Without Expert DemonstrationsCode1
Sample Efficient Deep Reinforcement Learning via Uncertainty EstimationCode1
Hybrid intelligence for dynamic job-shop scheduling with deep reinforcement learning and attention mechanismCode1
SimSR: Simple Distance-based State Representation for Deep Reinforcement LearningCode1
Lane Change Decision-Making through Deep Reinforcement LearningCode1
Safety and Liveness Guarantees through Reach-Avoid Reinforcement LearningCode1
A Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with DroneCode1
Adversarial Deep Reinforcement Learning for Improving the Robustness of Multi-agent Autonomous Driving PoliciesCode1
Space Non-cooperative Object Active Tracking with Deep Reinforcement LearningCode1
ColO-RAN: Developing Machine Learning-based xApps for Open RAN Closed-loop Control on Programmable Experimental PlatformsCode1
Stochastic Actor-Executor-Critic for Image-to-Image TranslationCode1
Faster Deep Reinforcement Learning with Slower Online NetworkCode1
Federated Deep Reinforcement Learning for the Distributed Control of NextG Wireless NetworksCode1
Functional Regularization for Reinforcement Learning via Learned Fourier FeaturesCode1
EDGE: Explaining Deep Reinforcement Learning PoliciesCode1
Symbolic Regression via Deep Reinforcement Learning Enhanced Genetic Programming SeedingCode1
Automatic Data Augmentation for Generalization in Reinforcement LearningCode1
NovelD: A Simple yet Effective Exploration CriterionCode1
User Allocation in Mobile Edge Computing: A Deep Reinforcement Learning ApproachCode1
Data-Efficient Deep Reinforcement Learning for Attitude Control of Fixed-Wing UAVs: Field ExperimentsCode1
Robust Deep Reinforcement Learning for Quadcopter ControlCode1
Learning Large Neighborhood Search Policy for Integer ProgrammingCode1
URLB: Unsupervised Reinforcement Learning BenchmarkCode1
Learning Domain Invariant Representations in Goal-conditioned Block MDPsCode1
Towards Robust Bisimulation Metric LearningCode1
Learning Collaborative Policies to Solve NP-hard Routing ProblemsCode1
Show:102550
← PrevPage 8 of 117Next →

No leaderboard results yet.