SOTAVerified

Decision Making

Papers

Showing 17711780 of 12311 papers

TitleStatusHype
Reflection-Bench: probing AI intelligence with reflectionCode1
Fine-Tuning LLMs for Reliable Medical Question-Answering Services0
SceneGraMMi: Scene Graph-boosted Hybrid-fusion for Multi-Modal Misinformation Veracity Prediction0
TAGExplainer: Narrating Graph Explanations for Text-Attributed Graph Learning Models0
Conditional Uncertainty Quantification for Tensorized Topological Neural Networks0
Learning-Augmented Algorithms for the Bahncard Problem0
Large Language Models for Autonomous Driving (LLM4AD): Concept, Benchmark, Experiments, and Challenges0
A Comprehensive Evaluation of Cognitive Biases in LLMsCode1
Economic Anthropology in the Era of Generative Artificial Intelligence0
Who is Undercover? Guiding LLMs to Explore Multi-Perspective Team Tactic in the Game0
Show:102550
← PrevPage 178 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified