SOTAVerified

General Reinforcement Learning

Papers

Showing 150 of 84 papers

TitleStatusHype
OpenSpiel: A Framework for Reinforcement Learning in GamesCode3
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control TasksCode2
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement LearningCode2
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language ModelsCode2
Stabilizing Transformers for Reinforcement LearningCode1
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning AlgorithmCode1
Intelligent Resource Allocation in Joint Radar-Communication With Graph Neural NetworksCode1
Learning to Incentivize Other Learning AgentsCode1
Intelligent Trading Systems: A Sentiment-Aware Reinforcement Learning ApproachCode1
Counterfactual Data Augmentation using Locally Factored DynamicsCode1
Time Limits in Reinforcement LearningCode1
NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement LearningCode1
End-to-End Egospheric Spatial MemoryCode1
Align-RUDDER: Learning From Few Demonstrations by Reward RedistributionCode1
Discovering General Reinforcement Learning Algorithms with Adversarial Environment DesignCode1
Adaptive Rational Activations to Boost Deep Reinforcement LearningCode1
Data-Efficient Reinforcement Learning with Self-Predictive RepresentationsCode1
Learning Exploration Policies for NavigationCode1
Learning Deformable Object Manipulation from Expert DemonstrationsCode1
DeFIX: Detecting and Fixing Failure Scenarios with Reinforcement Learning in Imitation Learning Based Autonomous DrivingCode1
Action Branching Architectures for Deep Reinforcement LearningCode1
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement LearningCode1
Dynamic Algorithm Configuration: Foundation of a New Meta-Algorithmic FrameworkCode1
Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring RotorsCode1
Using a Logarithmic Mapping to Enable Lower Discount Factors in Reinforcement LearningCode0
Learning to Backdoor Federated LearningCode0
Local and Global Explanations of Agent Behavior: Integrating Strategy Summaries with Saliency MapsCode0
Doubly-Robust Estimation for Correcting Position-Bias in Click Feedback for Unbiased Learning to RankCode0
AIXIjs: A Software Demo for General Reinforcement LearningCode0
QKSA: Quantum Knowledge Seeking AgentCode0
Generalised Discount Functions applied to a Monte-Carlo AImu ImplementationCode0
Gibson Env: Real-World Perception for Embodied AgentsCode0
Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge DistillationCode0
Hypercube Policy Regularization Framework for Offline Reinforcement LearningCode0
Dex: Incremental Learning for Complex Environments in Deep Reinforcement LearningCode0
The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement LearningCode0
Interactive Learning from Activity DescriptionCode0
Is Deep Reinforcement Learning Really Superhuman on Atari? Leveling the playing fieldCode0
A Monte Carlo AIXI ApproximationCode0
Student/Teacher Advising through Reward Augmentation0
Superior Performance with Diversified Strategic Control in FPS Games Using General Reinforcement Learning0
The Problem of Social Cost in Multi-Agent General Reinforcement Learning: Survey and Synthesis0
The Sample-Complexity of General Reinforcement Learning0
Towards More Efficient, Robust, Instance-adaptive, and Generalizable Sequential Decision making0
Transferring Agent Behaviors from Videos via Motion GANs0
Variational Regret Bounds for Reinforcement Learning0
Abstractions of General Reinforcement Learning0
AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning0
Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via Deep Reinforcement Learning0
Active Information Acquisition0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RNBScore7Unverified
2PPOScore5Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore4.8Unverified
2PPOScore1Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore0.6Unverified
2PPOScore0.6Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore0.8Unverified
2PPOScore0.6Unverified
#ModelMetricClaimedVerifiedStatus
1PPOScore1.2Unverified
2RNBScore1Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore3.4Unverified
2PPOScore0.8Unverified