SOTAVerified

Decision Making

Papers

Showing 12761300 of 12311 papers

TitleStatusHype
Uncertainty Quantification in Stereo MatchingCode0
Real-world Deployment and Evaluation of PErioperative AI CHatbot (PEACH) -- a Large Language Model Chatbot for Perioperative Medicine0
A Deep Reinforcement Learning Framework for Dynamic Portfolio Optimization: Evidence from China's Stock MarketCode0
Bayesian Optimization of Bilevel Problems0
Accelerating process control and optimization via machine learning: A review0
Agents on the Bench: Large Language Model Based Multi Agent Framework for Trustworthy Digital Justice0
MineStudio: A Streamlined Package for Minecraft AI Agent DevelopmentCode3
GeneSUM: Large Language Model-based Gene Summary Extraction0
BIG-MoE: Bypass Isolated Gating MoE for Generalized Multimodal Face Anti-SpoofingCode0
Quantum framework for Reinforcement Learning: Integrating Markov decision process, quantum arithmetic, and trajectory search0
INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent0
An Instrumental Value for Data Production and its Application to Data Pricing0
An Overview and Discussion of the Suitability of Existing Speech Datasets to Train Machine Learning Models for Collective Problem Solving0
Multimodal Learning with Uncertainty Quantification based on Discounted Belief FusionCode1
CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language ModelsCode1
Explainability in Neural Networks for Natural Language Processing Tasks0
A Dual-Perspective Metaphor Detection Framework Using Large Language ModelsCode0
LegalAgentBench: Evaluating LLM Agents in Legal DomainCode1
EPE-P: Evidence-based Parameter-efficient Prompting for Multimodal Learning with Missing ModalitiesCode0
Enhancing Cancer Diagnosis with Explainable & Trustworthy Deep Learning Models0
The Role of XAI in Transforming Aeronautics and Aerospace Systems0
MineAgent: Towards Remote-Sensing Mineral Exploration with Multimodal Large Language Models0
Fairness in Reinforcement Learning with Bisimulation Metrics0
Multi-Agent Sampling: Scaling Inference Compute for Data Synthesis with Tree Search-Based Agentic CollaborationCode0
Decentralized Governance of Autonomous AI Agents0
Show:102550
← PrevPage 52 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified