SOTAVerified

Decision Making

Papers

Showing 451460 of 12311 papers

TitleStatusHype
Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous DrivingCode1
Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI GymCode1
MEDPSeg: Hierarchical polymorphic multitask learning for the segmentation of ground-glass opacities, consolidation, and pulmonary structures on computed tomographyCode1
RiskBench: A Scenario-based Benchmark for Risk IdentificationCode1
Understanding the (Extra-)Ordinary: Validating Deep Model Decisions with Prototypical Concept-based ExplanationsCode1
Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language ModelsCode1
Utilizing Explainability Techniques for Reinforcement Learning Model AssuranceCode1
VSViG: Real-time Video-based Seizure Detection via Skeleton-based Spatiotemporal ViGCode1
Large Language Model as a Policy Teacher for Training Reinforcement Learning AgentsCode1
Labeling Neural Representations with Inverse RecognitionCode1
Show:102550
← PrevPage 46 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified