SOTAVerified

Decision Making

Papers

Showing 881890 of 12311 papers

TitleStatusHype
An Objective Metric for Explainable AI: How and Why to Estimate the Degree of ExplainabilityCode1
Predictive Coding for Decision TransformerCode1
Defeasible Visual Entailment: Benchmark, Evaluator, and Reward-Driven OptimizationCode1
Deep Reinforcement Learning with Task-Adaptive Retrieval via HypernetworkCode1
An Introduction to Deep Reinforcement LearningCode1
Emergent Linear Representations in World Models of Self-Supervised Sequence ModelsCode1
Private Prediction SetsCode1
EMT: Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine ReadingCode1
EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray ImagesCode1
EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge SummariesCode1
Show:102550
← PrevPage 89 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified