SOTAVerified

Decision Making

Papers

Showing 181190 of 12311 papers

TitleStatusHype
BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous AgentsCode2
Cumulative Reasoning with Large Language ModelsCode2
Global birdsong embeddings enable superior transfer learning for bioacoustic classificationCode2
Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAXCode2
Adversarial attacks and defenses in explainable artificial intelligence: A surveyCode2
STEVE-1: A Generative Model for Text-to-Behavior in MinecraftCode2
Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation LearningCode2
Training Diffusion Models with Reinforcement LearningCode2
AGIEval: A Human-Centric Benchmark for Evaluating Foundation ModelsCode2
Large AI Models in Health Informatics: Applications, Challenges, and the FutureCode2
Show:102550
← PrevPage 19 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified