SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 53515400 of 15113 papers

TitleStatusHype
Improving width-based planning with compact policies0
Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions0
IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues0
IMRL: Integrating Visual, Physical, Temporal, and Geometric Representations for Enhanced Food Acquisition0
I'm sorry Dave, I'm afraid I can't do that, Deep Q-learning from forbidden action0
Inapplicable Actions Learning for Knowledge Transfer in Reinforcement Learning0
Incentive-based demand response for smart grid with reinforcement learning and deep neural network0
Incentivizing an Unknown Crowd0
Generalizing Emergent Communication0
In-context Exploration-Exploitation for Reinforcement Learning0
Large Language Models can Implement Policy Iteration0
Incorporating Consistency Verification into Neural Data-to-Document Generation0
Incorporating Deception into CyberBattleSim for Autonomous Defense0
Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning0
Incorporating Graph Attention Mechanism into Knowledge Graph Reasoning Based on Deep Reinforcement Learning0
Incorporating Human Domain Knowledge into Large Scale Cost Function Learning0
Incorporating Pragmatic Reasoning Communication into Emergent Language0
Incorporating Relational Background Knowledge into Reinforcement Learning via Differentiable Inductive Logic Programming0
Incorporating Rivalry in Reinforcement Learning for a Competitive Game0
Incorporating Stylistic Lexical Preferences in Generative Language Models0
Incorporating Voice Instructions in Model-Based Reinforcement Learning for Self-Driving Cars0
Incorporation of Deep Neural Network & Reinforcement Learning with Domain Knowledge0
Increasing Energy Efficiency of Massive-MIMO Network via Base Stations Switching using Reinforcement Learning and Radio Environment Maps0
Increasing the Efficiency of Policy Learning for Autonomous Vehicles by Multi-Task Representation Learning0
Data Informed Residual Reinforcement Learning for High-Dimensional Robotic Tracking Control0
Incremental Hierarchical Reinforcement Learning with Multitask LMDPs0
Incrementality Bidding via Reinforcement Learning under Mixed and Delayed Rewards0
Incrementally Learning Functions of the Return0
Incremental Policy Gradients for Online Reinforcement Learning Control0
Incremental Reinforcement Learning --- a New Continuous Reinforcement Learning Frame Based on Stochastic Differential Equation methods0
Incremental Text to Speech for Neural Sequence-to-Sequence Models using Reinforcement Learning0
Independent Learning in Stochastic Games0
Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence0
Independent Policy Gradient Methods for Competitive Reinforcement Learning0
Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective0
Index Selection for NoSQL Database with Deep Reinforcement Learning0
Individual-Level Inverse Reinforcement Learning for Mean Field Games0
Individual specialization in multi-task environments with multiagent reinforcement learners0
Indoor Point-to-Point Navigation with Deep Reinforcement Learning and Ultra-wideband0
Inducing Cooperation via Learning to reshape rewards in semi-cooperative multi-agent reinforcement learning0
Inducing Cooperation via Team Regret Minimization based Multi-Agent Deep Reinforcement Learning0
Inducing Functions through Reinforcement Learning without Task Specification0
Induction and Exploitation of Subgoal Automata for Reinforcement Learning0
Induction of Subgoal Automata for Reinforcement Learning0
Inductive-bias-driven Reinforcement Learning For Efficient Schedules in Heterogeneous Clusters0
Inductive Bias-driven Reinforcement Learning For Efficient Schedules in Heterogeneous Clusters0
Inference Aided Reinforcement Learning for Incentive Mechanism Design in Crowdsourcing0
Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models0
Inference-Time Scaling for Generalist Reward Modeling0
Inferential Induction: A Novel Framework for Bayesian Reinforcement Learning0
Show:102550
← PrevPage 108 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified