SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1315113200 of 15113 papers

TitleStatusHype
Model Learning for Look-ahead Exploration in Continuous ControlCode0
Reinforcement Learning of Active Vision for Manipulating Objects under OcclusionsCode0
Energy Efficiency in Reinforcement Learning for Wireless Sensor Networks0
Reinforcement Learning and Inverse Reinforcement Learning with System 1 and System 20
Simulated Autonomous Driving in a Realistic Driving Environment using Deep Reinforcement Learning and a Deterministic Finite State Machine0
Measurement-based adaptation protocol with quantum reinforcement learning in a Rigetti quantum computer0
Practical Deep Reinforcement Learning Approach for Stock TradingCode3
Scalable agent alignment via reward modeling: a research directionCode0
Reinforcement Learning with A* and a Deep HeuristicCode0
Learning Actionable Representations with Goal-Conditioned PoliciesCode0
Policy Optimization with Model-based Explorations0
Self-Organizing Maps for Storage and Transfer of Knowledge in Reinforcement Learning0
Recursive Sparse Pseudo-input Gaussian Process SARSA0
Parameter Sharing Reinforcement Learning Architecture for Multi Agent Driving Behaviors0
Emergence of linguistic conventions in multi-agent reinforcement learning0
Improving Automatic Source Code Summarization via Deep Reinforcement LearningCode0
Autonomous Extraction of a Hierarchical Structure of Tasks in Reinforcement Learning, A Sequential Associate Rule Mining Approach0
Concept Learning through Deep Reinforcement Learning with Memory-Augmented Neural Networks0
Intervention Aided Reinforcement Learning for Safe and Practical Policy Optimization in Navigation0
Orthogonal Policy Gradient and Autonomous Driving Application0
Reward learning from human preferences and demonstrations in AtariCode0
Tight Bayesian Ambiguity Sets for Robust MDPs0
The Utility of Sparse Representations for Control in Reinforcement Learning0
Natural Environment Benchmarks for Reinforcement LearningCode0
Large-scale Interactive Recommendation with Tree-structured Policy Gradient0
Bayesian Reinforcement Learning in Factored POMDPs0
Emergence of Addictive Behaviors in Reinforcement Learning Agents0
Modelling the Dynamic Joint Policy of Teammates with Attention Multi-agent DDPG0
Image Captioning Based on a Hierarchical Attention Mechanism and Policy Gradient Optimization0
Deep Q learning for fooling neural networksCode0
Coordinating Disaster Emergency Response with Heuristic Reinforcement Learning0
Learning Temporal Point Processes via Reinforcement Learning0
Navigating Assistance System for Quadcopter with Deep Reinforcement Learning0
Importance Weighted Evolution Strategies0
Learning data augmentation policies using augmented random searchCode0
An initial attempt of combining visual selective attention with deep reinforcement learning0
An Optimal Control View of Adversarial Machine Learning0
Optimizing Taxi Carpool Policies via Reinforcement Learning and Spatio-Temporal Mining0
Towards Governing Agent's Efficacy: Action-Conditional β-VAE for Deep Transparent Reinforcement Learning0
Product Title Refinement via Multi-Modal Generative Adversarial Learning0
Reinforcement Learning Based Speech Enhancement for Robust Speech Recognition0
Diversity-Driven Extensible Hierarchical Reinforcement LearningCode0
Fully Convolutional Network with Multi-Step Reinforcement Learning for Image ProcessingCode0
Learning Shaping Strategies in Human-in-the-loop Interactive Reinforcement Learning0
Reinforcement Learning for Automatic Test Case Prioritization and Selection in Continuous IntegrationCode0
A Hierarchical Framework for Relation Extraction with Reinforcement LearningCode0
Correlation Filter Selection for Visual Tracking Using Reinforcement Learning0
Modular Architecture for StarCraft II with Deep Reinforcement Learning0
Memory-based Deep Reinforcement Learning for Obstacle Avoidance in UAV with Limited Environment KnowledgeCode0
Meta-Learning for Multi-objective Reinforcement Learning0
Show:102550
← PrevPage 264 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified