SOTAVerified

NetHack

Mean in-game score over 1000 episodes with random seeds not seen during training. See https://arxiv.org/abs/2006.13760 (Section 2.4 Evaluation Protocol) for details.

Papers

Showing 110 of 28 papers

TitleStatusHype
PufferLib: Making Reinforcement Learning Libraries and Environments Play NiceCode4
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement LearningCode3
Dungeons and Data: A Large-Scale NetHack DatasetCode2
Syllabus: Portable Curricula for Reinforcement Learning AgentsCode2
diff History for Neural Language AgentsCode1
BeBold: Exploration Beyond the Boundary of Explored RegionsCode1
Hierarchical Kickstarting for Skill Transfer in Reinforcement LearningCode1
Katakomba: Tools and Benchmarks for Data-Driven NetHackCode1
CORA: Benchmarks, Baselines, and Metrics as a Platform for Continual Reinforcement Learning AgentsCode1
LuckyMera: a Modular AI Framework for Building Hybrid NetHack AgentsCode1
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.