SOTAVerified

Decision Making

Papers

Showing 26812690 of 12311 papers

TitleStatusHype
CoXQL: A Dataset for Parsing Explanation Requests in Conversational XAI SystemsCode0
Asymptotically Optimal Regret for Black-Box Predict-then-Optimize0
Are Objective Explanatory Evaluation metrics Trustworthy? An Adversarial Analysis0
Learning positional encodings in transformers depends on initialization0
Bridging the Gap: Unravelling Local Government Data Sharing Barriers in Estonia and Beyond0
Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning0
LVBench: An Extreme Long Video Understanding BenchmarkCode2
Large Language Model-empowered multimodal strain sensory system for shape recognition, monitoring, and human interaction of tensegrity0
"It answers questions that I didn't know I had": Ph.D. Students' Evaluation of an Information Sharing Knowledge Graph0
Test-Time Fairness and Robustness in Large Language Models0
Show:102550
← PrevPage 269 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified