SOTAVerified

NetHack

Mean in-game score over 1000 episodes with random seeds not seen during training. See https://arxiv.org/abs/2006.13760 (Section 2.4 Evaluation Protocol) for details.

Papers

Showing 2628 of 28 papers

TitleStatusHype
Insights From the NeurIPS 2021 NetHack ChallengeCode0
Improving Policy Learning via Language Dynamics DistillationCode0
Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation ProblemCode0
Show:102550
← PrevPage 2 of 2Next →

No leaderboard results yet.