SOTAVerified

Decision Making

Papers

Showing 461470 of 12311 papers

TitleStatusHype
Large Language Model as a Policy Teacher for Training Reinforcement Learning AgentsCode1
From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language ModelsCode1
Inherently Interpretable Time Series Classification via Multiple Instance LearningCode1
DocLens: Multi-aspect Fine-grained Evaluation for Medical Text GenerationCode1
ToolTalk: Evaluating Tool-Usage in a Conversational SettingCode1
XplainLLM: A Knowledge-Augmented Dataset for Reliable Grounded Explanations in LLMsCode1
A Comprehensive Evaluation of GPT-4V on Knowledge-Intensive Visual Question AnsweringCode1
Real-Time Machine-Learning-Based Optimization Using Input Convex Long Short-Term Memory NetworkCode1
Benchmarking PtO and PnO Methods in the Predictive Combinatorial Optimization RegimeCode1
MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable UncertaintyCode1
Show:102550
← PrevPage 47 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified