SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1450114550 of 15113 papers

TitleStatusHype
Learning-based Model Predictive Control for Safe Exploration and Reinforcement LearningCode0
Efficient Information Diffusion in Time-Varying Graphs through Deep Reinforcement LearningCode0
Efficient Collaborative Multi-Agent Deep Reinforcement Learning for Large-Scale Fleet ManagementCode0
Learning Where to Sample in Structured PredictionCode0
Carle's Game: An Open-Ended Challenge in Exploratory Machine CreativityCode0
Generalization in Reinforcement Learning with Selective Noise Injection and Information BottleneckCode0
Generalization in Text-based Games via Hierarchical Reinforcement LearningCode0
Dealing with uncertainty: balancing exploration and exploitation in deep recurrent reinforcement learningCode0
Generalization in Visual Reinforcement Learning with the Reward Sequence DistributionCode0
A Biologically Plausible Learning Rule for Deep Learning in the BrainCode0
Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World ModelsCode0
A Structured Prediction Approach for Generalization in Cooperative Multi-Agent Reinforcement LearningCode0
Impartial Games: A Challenge for Reinforcement LearningCode0
Efficient Model-Based Deep Reinforcement Learning with Variational State TabulationCode0
Generalization through Simulation: Integrating Simulated and Real Data into Deep Reinforcement Learning for Vision-Based Autonomous FlightCode0
Learning to Optimize Variational Quantum Circuits to Solve Combinatorial ProblemsCode0
Can maker-taker fees prevent algorithmic cooperation in market making?Code0
Decentralized Computation Offloading for Multi-User Mobile Edge Computing: A Deep Reinforcement Learning ApproachCode0
Generalization Tower Network: A Novel Deep Neural Network Architecture for Multi-Task LearningCode0
Efficient Model-free Reinforcement Learning in Metric SpacesCode0
Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games?Code0
Imperfect also Deserves Reward: Multi-Level and Sequential Reward Modeling for Better Dialog ManagementCode0
Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across DomainsCode0
Can ChatGPT Enable ITS? The Case of Mixed Traffic Control via Reinforcement LearningCode0
Efficient Object Detection in Large Images using Deep Reinforcement LearningCode0
Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context VariablesCode0
A State-Distribution Matching Approach to Non-Episodic Reinforcement LearningCode0
Decaying Clipping Range in Proximal Policy OptimizationCode0
DEAR: Disentangled Environment and Agent Representations for Reinforcement Learning without ReconstructionCode0
Can Agents Run Relay Race with Strangers? Generalization of RL to Out-of-Distribution TrajectoriesCode0
Assistive Teaching of Motor Control Tasks to HumansCode0
Efficient Parallel Methods for Deep Reinforcement LearningCode0
A Comparison of Reward Functions in Q-Learning Applied to a Cart Position ProblemCode0
Learning Bellman Complete Representations for Offline Policy EvaluationCode0
Agent-State Construction with Auxiliary InputsCode0
Generalized Phase Pressure Control Enhanced Reinforcement Learning for Traffic Signal ControlCode0
Assessing the Potential of Classical Q-learning in General Game PlayingCode0
Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement LearningCode0
AgentForge: A Flexible Low-Code Platform for Reinforcement Learning Agent DesignCode0
Dealing with Sparse Rewards in Reinforcement LearningCode0
Efficient Probabilistic Performance Bounds for Inverse Reinforcement LearningCode0
DDxT: Deep Generative Transformer Models for Differential DiagnosisCode0
Assessing Generalization in Deep Reinforcement LearningCode0
Learning to Perceive in Deep Model-Free Reinforcement LearningCode0
Efficient reinforcement learning control for continuum robots based on Inexplicit Prior KnowledgeCode0
Generalized Speedy Q-learningCode0
Can Agents Learn by Analogy? An Inferable Model for PAC Reinforcement LearningCode0
Calibrated Model-Based Deep Reinforcement LearningCode0
Efficient Reinforcement Learning for Jumping MonopodsCode0
A Genetic Fuzzy System for Interpretable and Parsimonious Reinforcement Learning PoliciesCode0
Show:102550
← PrevPage 291 of 303Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified