SOTAVerified

General Reinforcement Learning

Papers

Showing 150 of 84 papers

TitleStatusHype
OpenSpiel: A Framework for Reinforcement Learning in GamesCode3
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language ModelsCode2
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement LearningCode2
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control TasksCode2
Discovering General Reinforcement Learning Algorithms with Adversarial Environment DesignCode1
Adaptive Rational Activations to Boost Deep Reinforcement LearningCode1
Learning Deformable Object Manipulation from Expert DemonstrationsCode1
Learning Exploration Policies for NavigationCode1
Learning to Incentivize Other Learning AgentsCode1
Action Branching Architectures for Deep Reinforcement LearningCode1
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning AlgorithmCode1
Dynamic Algorithm Configuration: Foundation of a New Meta-Algorithmic FrameworkCode1
NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement LearningCode1
Counterfactual Data Augmentation using Locally Factored DynamicsCode1
End-to-End Egospheric Spatial MemoryCode1
Align-RUDDER: Learning From Few Demonstrations by Reward RedistributionCode1
Data-Efficient Reinforcement Learning with Self-Predictive RepresentationsCode1
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement LearningCode1
DeFIX: Detecting and Fixing Failure Scenarios with Reinforcement Learning in Imitation Learning Based Autonomous DrivingCode1
Stabilizing Transformers for Reinforcement LearningCode1
Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring RotorsCode1
Intelligent Resource Allocation in Joint Radar-Communication With Graph Neural NetworksCode1
Intelligent Trading Systems: A Sentiment-Aware Reinforcement Learning ApproachCode1
Time Limits in Reinforcement LearningCode1
Gibson Env: Real-World Perception for Embodied AgentsCode0
Doubly-Robust Estimation for Correcting Position-Bias in Click Feedback for Unbiased Learning to RankCode0
Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge DistillationCode0
A Monte Carlo AIXI ApproximationCode0
The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement LearningCode0
AIXIjs: A Software Demo for General Reinforcement LearningCode0
Interactive Learning from Activity DescriptionCode0
Local and Global Explanations of Agent Behavior: Integrating Strategy Summaries with Saliency MapsCode0
Hypercube Policy Regularization Framework for Offline Reinforcement LearningCode0
Is Deep Reinforcement Learning Really Superhuman on Atari? Leveling the playing fieldCode0
QKSA: Quantum Knowledge Seeking AgentCode0
Learning to Represent Action Values as a Hypergraph on the Action VerticesCode0
Learning to Backdoor Federated LearningCode0
Using a Logarithmic Mapping to Enable Lower Discount Factors in Reinforcement LearningCode0
Dex: Incremental Learning for Complex Environments in Deep Reinforcement LearningCode0
Generalised Discount Functions applied to a Monte-Carlo AImu ImplementationCode0
The Problem of Social Cost in Multi-Agent General Reinforcement Learning: Survey and Synthesis0
The Sample-Complexity of General Reinforcement Learning0
Towards More Efficient, Robust, Instance-adaptive, and Generalizable Sequential Decision making0
Transferring Agent Behaviors from Videos via Motion GANs0
Variational Regret Bounds for Reinforcement Learning0
Abstractions of General Reinforcement Learning0
AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning0
Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via Deep Reinforcement Learning0
Active Information Acquisition0
A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RNBScore7Unverified
2PPOScore5Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore4.8Unverified
2PPOScore1Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore0.6Unverified
2PPOScore0.6Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore0.8Unverified
2PPOScore0.6Unverified
#ModelMetricClaimedVerifiedStatus
1PPOScore1.2Unverified
2RNBScore1Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore3.4Unverified
2PPOScore0.8Unverified