SOTAVerified

General Reinforcement Learning

Papers

Showing 150 of 84 papers

TitleStatusHype
OpenSpiel: A Framework for Reinforcement Learning in GamesCode3
Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement LearningCode2
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control TasksCode2
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language ModelsCode2
NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement LearningCode1
Discovering General Reinforcement Learning Algorithms with Adversarial Environment DesignCode1
DeFIX: Detecting and Fixing Failure Scenarios with Reinforcement Learning in Imitation Learning Based Autonomous DrivingCode1
Intelligent Resource Allocation in Joint Radar-Communication With Graph Neural NetworksCode1
Learning Deformable Object Manipulation from Expert DemonstrationsCode1
Intelligent Trading Systems: A Sentiment-Aware Reinforcement Learning ApproachCode1
Adaptive Rational Activations to Boost Deep Reinforcement LearningCode1
End-to-End Egospheric Spatial MemoryCode1
Align-RUDDER: Learning From Few Demonstrations by Reward RedistributionCode1
Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring RotorsCode1
Data-Efficient Reinforcement Learning with Self-Predictive RepresentationsCode1
Counterfactual Data Augmentation using Locally Factored DynamicsCode1
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement LearningCode1
Learning to Incentivize Other Learning AgentsCode1
Dynamic Algorithm Configuration: Foundation of a New Meta-Algorithmic FrameworkCode1
Stabilizing Transformers for Reinforcement LearningCode1
Learning Exploration Policies for NavigationCode1
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning AlgorithmCode1
Time Limits in Reinforcement LearningCode1
Action Branching Architectures for Deep Reinforcement LearningCode1
PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning0
High-order Regularization for Machine Learning and Learning-based Control0
Towards More Efficient, Robust, Instance-adaptive, and Generalizable Sequential Decision making0
The Problem of Social Cost in Multi-Agent General Reinforcement Learning: Survey and Synthesis0
Hypercube Policy Regularization Framework for Offline Reinforcement LearningCode0
Reinforcement Learning: Tutorial and Survey0
Dynamic Knowledge Injection for AIXI Agents0
Dropout Strategy in Reinforcement Learning: Limiting the Surrogate Objective Variance in Policy Optimization Methods0
Image Transformation Sequence Retrieval with General Reinforcement Learning0
L-SA: Learning Under-Explored Targets in Multi-Target Reinforcement Learning0
Computably Continuous Reinforcement-Learning Objectives are PAC-learnable0
Policy Mirror Descent Inherently Explores Action Space0
Learning to Backdoor Federated LearningCode0
Computational Dualism and Objective Superintelligence0
Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via Deep Reinforcement Learning0
AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning0
Computable Artificial General Intelligence0
D3PG: Dirichlet DDPG for Task Partitioning and Offloading With Constrained Hybrid Action Space in Mobile-Edge Computing0
Doubly-Robust Estimation for Correcting Position-Bias in Click Feedback for Unbiased Learning to RankCode0
Abstractions of General Reinforcement Learning0
Reducing Planning Complexity of General Reinforcement Learning with Non-Markovian Abstractions0
Superior Performance with Diversified Strategic Control in FPS Games Using General Reinforcement Learning0
^2-exploration for Reinforcement Learning0
Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge DistillationCode0
A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning0
Low-Resource Machine Translation based on Asynchronous Dynamic Programming0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RNBScore7Unverified
2PPOScore5Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore4.8Unverified
2PPOScore1Unverified
#ModelMetricClaimedVerifiedStatus
1PPOScore0.6Unverified
2RNBScore0.6Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore0.8Unverified
2PPOScore0.6Unverified
#ModelMetricClaimedVerifiedStatus
1PPOScore1.2Unverified
2RNBScore1Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore3.4Unverified
2PPOScore0.8Unverified