SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1120111250 of 15113 papers

TitleStatusHype
End-to-End Vision-Based Adaptive Cruise Control (ACC) Using Deep Reinforcement Learning0
PCGRL: Procedural Content Generation via Reinforcement LearningCode1
Multi-objective Neural Architecture Search via Non-stationary Policy Gradient0
Facial Feedback for Reinforcement Learning: A Case Study and Offline Analysis Using the TAMER Framework0
Interpretable End-to-end Urban Autonomous Driving with Latent Deep Reinforcement LearningCode1
Graph Constrained Reinforcement Learning for Natural Language Action SpacesCode1
Reducing Non-Normative Text Generation from Language Models0
GLIB: Efficient Exploration for Relational Model-Based Reinforcement Learning via Goal-Literal BabblingCode0
Reinforcement Learning Based Vehicle-cell Association Algorithm for Highly Mobile Millimeter Wave Communication0
On Simple Reactive Neural Networks for Behaviour-Based Reinforcement LearningCode1
On Solving Cooperative MARL Problems with a Few Good Experiences0
Local Policy Optimization for Trajectory-Centric Reinforcement Learning0
Emergence of Pragmatics from Referential Game between Theory of Mind AgentsCode0
Cooperative Highway Work Zone Merge Control based on Reinforcement Learning in A Connected and Automated Environment0
Improving Interaction Quality Estimation with BiLSTMs and the Impact on Dialogue Policy Learning0
Intelligent Bandwidth Allocation for Latency Management in NG-EPON using Reinforcement Learning Methods0
Unsupervisedly Learned Representations: Should the Quest be Over?0
Lyceum: An efficient and scalable ecosystem for robot learning0
SARL*: Deep Reinforcement Learning based Human-Aware Navigation for Mobile Robot in Indoor EnvironmentsCode1
Reinforcement Learning with Probabilistically Complete Exploration0
Nested-Wasserstein Self-Imitation Learning for Sequence Generation0
Memristor Hardware-Friendly Reinforcement Learning0
FRESH: Interactive Reward Shaping in High-Dimensional State Spaces using Human Feedback0
A Survey of Reinforcement Learning Techniques: Strategies, Recent Development, and Future Directions0
Discriminator Soft Actor Critic without Extrinsic RewardsCode1
Gradient Surgery for Multi-Task LearningCode1
Learning Options from Demonstration using Skill Segmentation0
cube2net: Efficient Query-Specific Network Construction with Data Cube Organization0
BNAS:An Efficient Neural Architecture Search Approach Using Broad Scalable Architecture0
Multi-agent Motion Planning for Dense and Dynamic Environments via Deep Reinforcement Learning0
Tree-Structured Policy based Progressive Reinforcement Learning for Temporally Language Grounding in VideoCode1
Algorithms in Multi-Agent Systems: A Holistic Perspective from Reinforcement Learning and Game Theory0
Reward Shaping for Reinforcement Learning with Omega-Regular Objectives0
MIME: Mutual Information Minimisation Exploration0
Model-based Multi-Agent Reinforcement Learning with Cooperative Prioritized Sweeping0
SEERL: Sample Efficient Ensemble Reinforcement Learning0
Robotic Grasp Manipulation Using Evolutionary Computing and Deep Reinforcement Learning0
Lipschitz Lifelong Reinforcement LearningCode1
Continuous-action Reinforcement Learning for Playing Racing Games: Comparing SPG to PPOCode0
PoPS: Policy Pruning and Shrinking for Deep Reinforcement LearningCode1
Multi-Robot Formation Control Using Reinforcement Learning0
Statistical Inference of the Value Function for Reinforcement Learning in Infinite Horizon SettingsCode0
POPCORN: Partially Observed Prediction COnstrained ReiNforcement LearningCode1
GridMask Data AugmentationCode1
Exploiting Language Instructions for Interpretable and Compositional Reinforcement Learning0
Learning to Locomote with Deep Neural-Network and CPG-based Control in a Soft Snake Robot0
Weakly Supervised Video Summarization by Hierarchical Reinforcement Learning0
Deep Reinforcement Learning for Complex Manipulation Tasks with Sparse Feedback0
Sparse Black-box Video Attack with Reinforcement LearningCode0
Reward Engineering for Object Pick and Place TrainingCode0
Show:102550
← PrevPage 225 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified