SOTAVerified

General Reinforcement Learning

Papers

Showing 150 of 84 papers

TitleStatusHype
OpenSpiel: A Framework for Reinforcement Learning in GamesCode3
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control TasksCode2
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language ModelsCode2
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement LearningCode2
Time Limits in Reinforcement LearningCode1
Counterfactual Data Augmentation using Locally Factored DynamicsCode1
End-to-End Egospheric Spatial MemoryCode1
Align-RUDDER: Learning From Few Demonstrations by Reward RedistributionCode1
Data-Efficient Reinforcement Learning with Self-Predictive RepresentationsCode1
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement LearningCode1
DeFIX: Detecting and Fixing Failure Scenarios with Reinforcement Learning in Imitation Learning Based Autonomous DrivingCode1
NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement LearningCode1
Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring RotorsCode1
Intelligent Resource Allocation in Joint Radar-Communication With Graph Neural NetworksCode1
Intelligent Trading Systems: A Sentiment-Aware Reinforcement Learning ApproachCode1
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning AlgorithmCode1
Learning to Incentivize Other Learning AgentsCode1
Stabilizing Transformers for Reinforcement LearningCode1
Learning Deformable Object Manipulation from Expert DemonstrationsCode1
Discovering General Reinforcement Learning Algorithms with Adversarial Environment DesignCode1
Learning Exploration Policies for NavigationCode1
Adaptive Rational Activations to Boost Deep Reinforcement LearningCode1
Action Branching Architectures for Deep Reinforcement LearningCode1
Dynamic Algorithm Configuration: Foundation of a New Meta-Algorithmic FrameworkCode1
Variational Regret Bounds for Reinforcement Learning0
Abstractions of General Reinforcement Learning0
AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning0
Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via Deep Reinforcement Learning0
Active Information Acquisition0
A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning0
A State Representation Dueling Network for Deep Reinforcement Learning0
Autonomous Reinforcement of Behavioral Sequences in Neural Dynamics0
Reinforcement Learning of Causal Variables Using Mediation Analysis0
Computable Artificial General Intelligence0
Computably Continuous Reinforcement-Learning Objectives are PAC-learnable0
D3PG: Dirichlet DDPG for Task Partitioning and Offloading With Constrained Hybrid Action Space in Mobile-Edge Computing0
Nearest-Neighbor-based Collision Avoidance for Quadrotors via Reinforcement Learning0
Differential Temporal Difference Learning0
Dropout Strategy in Reinforcement Learning: Limiting the Surrogate Objective Variance in Policy Optimization Methods0
Dynamic Knowledge Injection for AIXI Agents0
Computational Dualism and Objective Superintelligence0
Exact Reduction of Huge Action Spaces in General Reinforcement Learning0
FaiR-IoT: Fairness-aware Human-in-the-Loop Reinforcement Learning for Harnessing Human Variability in Personalized IoT0
Goal-Driven Sequential Data Abstraction0
High-order Regularization for Machine Learning and Learning-based Control0
Image Transformation Sequence Retrieval with General Reinforcement Learning0
Integrating Reinforcement Learning to Self Training for Pulmonary Nodule Segmentation in Chest X-rays0
Learning as Reinforcement: Applying Principles of Neuroscience for More General Reinforcement Learning Agents0
Nonparametric General Reinforcement Learning0
PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RNBScore7Unverified
2PPOScore5Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore4.8Unverified
2PPOScore1Unverified
#ModelMetricClaimedVerifiedStatus
1PPOScore0.6Unverified
2RNBScore0.6Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore0.8Unverified
2PPOScore0.6Unverified
#ModelMetricClaimedVerifiedStatus
1PPOScore1.2Unverified
2RNBScore1Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore3.4Unverified
2PPOScore0.8Unverified