SOTAVerified

General Reinforcement Learning

Papers

Showing 2650 of 84 papers

TitleStatusHype
D3PG: Dirichlet DDPG for Task Partitioning and Offloading With Constrained Hybrid Action Space in Mobile-Edge Computing0
Doubly-Robust Estimation for Correcting Position-Bias in Click Feedback for Unbiased Learning to RankCode0
Reducing Planning Complexity of General Reinforcement Learning with Non-Markovian Abstractions0
Abstractions of General Reinforcement Learning0
Intelligent Trading Systems: A Sentiment-Aware Reinforcement Learning ApproachCode1
^2-exploration for Reinforcement Learning0
Superior Performance with Diversified Strategic Control in FPS Games Using General Reinforcement Learning0
Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge DistillationCode0
A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning0
Low-Resource Machine Translation based on Asynchronous Dynamic Programming0
QKSA: Quantum Knowledge Seeking AgentCode0
Nearest-Neighbor-based Collision Avoidance for Quadrotors via Reinforcement Learning0
FaiR-IoT: Fairness-aware Human-in-the-Loop Reinforcement Learning for Harnessing Human Variability in Personalized IoT0
Adaptive Rational Activations to Boost Deep Reinforcement LearningCode1
End-to-End Egospheric Spatial MemoryCode1
Interactive Learning from Activity DescriptionCode0
A State Representation Dueling Network for Deep Reinforcement Learning0
Exact Reduction of Huge Action Spaces in General Reinforcement Learning0
Reinforcement Learning of Causal Variables Using Mediation Analysis0
Learning to Represent Action Values as a Hypergraph on the Action VerticesCode0
Align-RUDDER: Learning From Few Demonstrations by Reward RedistributionCode1
Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring RotorsCode1
Data-Efficient Reinforcement Learning with Self-Predictive RepresentationsCode1
The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement LearningCode0
Counterfactual Data Augmentation using Locally Factored DynamicsCode1
Show:102550
← PrevPage 2 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RNBScore7Unverified
2PPOScore5Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore4.8Unverified
2PPOScore1Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore0.6Unverified
2PPOScore0.6Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore0.8Unverified
2PPOScore0.6Unverified
#ModelMetricClaimedVerifiedStatus
1PPOScore1.2Unverified
2RNBScore1Unverified
#ModelMetricClaimedVerifiedStatus
1RNBScore3.4Unverified
2PPOScore0.8Unverified