SOTAVerified

Deep Reinforcement Learning

Papers

Showing 151200 of 5822 papers

TitleStatusHype
Contrastive Variational Reinforcement Learning for Complex ObservationsCode1
CORE: Towards Scalable and Efficient Causal Discovery with Reinforcement LearningCode1
Crowd-Robot Interaction: Crowd-aware Robot Navigation with Attention-based Deep Reinforcement LearningCode1
Decomposed Soft Actor-Critic Method for Cooperative Multi-Agent Reinforcement LearningCode1
Continuous Coordination As a Realistic Scenario for Lifelong LearningCode1
ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource AllocationCode1
Continuous Deep Q-Learning with Model-based AccelerationCode1
Computational Performance of Deep Reinforcement Learning to find Nash EquilibriaCode1
An End-to-end Deep Reinforcement Learning Approach for the Long-term Short-term Planning on the Frenet SpaceCode1
Comparing Observation and Action Representations for Deep Reinforcement Learning in μRTSCode1
Continuous-Time Fitted Value Iteration for Robust PoliciesCode1
An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agentsCode1
Combining Deep Reinforcement Learning and Search for Imperfect-Information GamesCode1
Learning Multi-Pursuit Evasion for Safe Targeted Navigation of DronesCode1
Accelerating Deep Reinforcement Learning for Digital Twin Network Optimization with Evolutionary StrategiesCode1
A multi-agent reinforcement learning model of common-pool resource appropriationCode1
Combining Reinforcement Learning and Constraint Programming for Combinatorial OptimizationCode1
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative TasksCode1
Comprehensive Training and Evaluation on Deep Reinforcement Learning for Automated Driving in Various Simulated Driving ManeuversCode1
An Application of Deep Reinforcement Learning to Algorithmic TradingCode1
An Efficient Asynchronous Method for Integrating Evolutionary and Gradient-based Policy SearchCode1
Connecting Deep-Reinforcement-Learning-based Obstacle Avoidance with Conventional Global Planners using Waypoint GeneratorsCode1
Contention Window Optimization in IEEE 802.11ax Networks with Deep Reinforcement LearningCode1
Continuous control with deep reinforcement learningCode1
An Introduction to Deep Reinforcement LearningCode1
An experimental evaluation of Deep Reinforcement Learning algorithms for HVAC controlCode1
An Equivalence between Loss Functions and Non-Uniform Sampling in Experience ReplayCode1
Control-Informed Reinforcement Learning for Chemical ProcessesCode1
Re4MPC: Reactive Nonlinear MPC for Multi-model Motion Planning via Deep Reinforcement LearningCode1
Correlation-aware Cooperative Multigroup Broadcast 360° Video Delivery Network: A Hierarchical Deep Reinforcement Learning ApproachCode1
An Optimistic Perspective on Offline Deep Reinforcement LearningCode1
CPU frequency scheduling of real-time applications on embedded devices with temporal encoding-based deep reinforcement learningCode1
A Platform-Agnostic Deep Reinforcement Learning Framework for Effective Sim2Real Transfer towards Autonomous DrivingCode1
Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement LearningCode1
CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement LearningCode1
Curiosity-Driven Energy-Efficient Worker Scheduling in Vehicular Crowdsourcing: A Deep Reinforcement Learning ApproachCode1
Data-Efficient Reinforcement Learning with Self-Predictive RepresentationsCode1
Accelerated Sim-to-Real Deep Reinforcement Learning: Learning Collision Avoidance from Human PlayerCode1
A Reinforcement Learning Environment For Job-Shop SchedulingCode1
A Reinforcement Learning Based Encoder-Decoder Framework for Learning Stock Trading RulesCode1
ColO-RAN: Developing Machine Learning-based xApps for Open RAN Closed-loop Control on Programmable Experimental PlatformsCode1
DeepACO: Neural-enhanced Ant Systems for Combinatorial OptimizationCode1
Deep Actor-Critic Learning for Distributed Power Control in Wireless Mobile NetworksCode1
Deep Deterministic Portfolio OptimizationCode1
A Closer Look at Invalid Action Masking in Policy Gradient AlgorithmsCode1
2-Level Reinforcement Learning for Ships on Inland Waterways: Path Planning and FollowingCode1
Acme: A Research Framework for Distributed Reinforcement LearningCode1
Deep Intrinsically Motivated Exploration in Continuous ControlCode1
Actor Prioritized Experience ReplayCode1
Amortizing intractable inference in diffusion models for vision, language, and controlCode1
Show:102550
← PrevPage 4 of 117Next →

No leaderboard results yet.