SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 27012725 of 15113 papers

TitleStatusHype
Interactive Query-Assisted Summarization via Deep Reinforcement LearningCode0
Interactive Learning from Activity DescriptionCode0
Towards Abstractive Timeline Summarisation using Preference-based Reinforcement LearningCode0
Interactive Semantic Parsing for If-Then Recipes via Hierarchical Reinforcement LearningCode0
Asynchronous Methods for Model-Based Reinforcement LearningCode0
Air Learning: A Deep Reinforcement Learning Gym for Autonomous Aerial Robot Visual NavigationCode0
A Hierarchical Framework for Relation Extraction with Reinforcement LearningCode0
Asynchronous Episodic Deep Deterministic Policy Gradient: Towards Continuous Control in Computationally Complex EnvironmentsCode0
Adapting to Reward Progressivity via Spectral Reinforcement LearningCode0
Intelligent Traffic Light via Policy-based Deep Reinforcement LearningCode0
Integrating Reinforcement Learning, Action Model Learning, and Numeric Planning for Tackling Complex TasksCode0
Accuracy-based Curriculum Learning in Deep Reinforcement LearningCode0
A Hierarchical Architecture for Sequential Decision-Making in Autonomous Driving using Deep Reinforcement LearningCode0
Intelligent Trainer for Model-Based Reinforcement LearningCode0
Instance Weighted Incremental Evolution Strategies for Reinforcement Learning in Dynamic EnvironmentsCode0
Instance Selection for Dynamic Algorithm Configuration with Reinforcement Learning: Improving GeneralizationCode0
Policy Iterations for Reinforcement Learning Problems in Continuous Time and Space -- Fundamental Theory and MethodsCode0
Insights From the NeurIPS 2021 NetHack ChallengeCode0
Instance based Generalization in Reinforcement LearningCode0
Inherently Explainable Reinforcement Learning in Natural LanguageCode0
Input Convex Neural NetworksCode0
Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement LearningCode0
A2-RL: Aesthetics Aware Reinforcement Learning for Image CroppingCode0
Information-Theoretic State Variable Selection for Reinforcement LearningCode0
IN-RIL: Interleaved Reinforcement and Imitation Learning for Policy Fine-TuningCode0
Show:102550
← PrevPage 109 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified