SOTAVerified

Decision Making

Papers

Showing 331340 of 12311 papers

TitleStatusHype
Hyp2Nav: Hyperbolic Planning and Curiosity for Crowd NavigationCode1
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming VideosCode1
InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply ChainsCode1
Can Learned Optimization Make Reinforcement Learning Less Difficult?Code1
Integrating Clinical Knowledge into Concept Bottleneck ModelsCode1
A Mamba-based Siamese Network for Remote Sensing Change DetectionCode1
Language Model Alignment in Multilingual Trolley ProblemsCode1
PUZZLES: A Benchmark for Neural Algorithmic ReasoningCode1
Evidential Concept Embedding Models: Towards Reliable Concept Explanations for Skin Disease DiagnosisCode1
CELLO: Causal Evaluation of Large Vision-Language ModelsCode1
Show:102550
← PrevPage 34 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified