SOTAVerified

Decision Making

Papers

Showing 471480 of 12311 papers

TitleStatusHype
Can GPT-4V(ision) Serve Medical Applications? Case Studies on GPT-4V for Multimodal Medical DiagnosisCode1
Adaptive Conformal Predictions for Time SeriesCode1
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised LearningCode1
From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language ModelsCode1
From Questions to Clinical Recommendations: Large Language Models Driving Evidence-Based Clinical Decision MakingCode1
AvalonBench: Evaluating LLMs Playing the Game of AvalonCode1
From Attribution Maps to Human-Understandable Explanations through Concept Relevance PropagationCode1
Frustum-PointPillars: A Multi-Stage Approach for 3D Object Detection using RGB Camera and LiDARCode1
Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMsCode1
CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in CoqCode1
Show:102550
← PrevPage 48 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified