SOTAVerified

Decision Making

Papers

Showing 12511300 of 12311 papers

TitleStatusHype
Leveraging AI for Automatic Classification of PCOS Using Ultrasound ImagingCode0
Joint Scoring Rules: Zero-Sum Competition Avoids Performative Prediction0
Learning Epidemiological Dynamics via the Finite Expression Method0
KARPA: A Training-free Method of Adapting Knowledge Graph as References for Large Language Model's Reasoning Path Aggregation0
Reconciling Privacy and Explainability in High-Stakes: A Systematic InquiryCode0
Plancraft: an evaluation dataset for planning with LLM agentsCode1
A Comprehensive Framework for Reliable Legal AI: Combining Specialized Expert Systems and Adaptive Refinement0
A fuzzy rank-based ensemble of CNN models for MRI segmentationCode0
Leveraging Edge Intelligence and LLMs to Advance 6G-Enabled Internet of Automated Defense Vehicles0
Towards influence centrality: where to not add an edge in the network?0
Leveraging Large Language Models for Enhancing Autonomous Vehicle Perception0
MAFT: Efficient Model-Agnostic Fairness Testing for Deep Neural Networks via Zero-Order Gradient Search0
Quantiles under ambiguity and risk sharing0
Hidformer: Transformer-Style Neural Network in Stock Price Forecasting0
Uncertainty quantification for improving radiomic-based models in radiation pneumonitis prediction0
A Review on the Integration of Artificial Intelligence and Medical Imaging in IVF Ovarian Stimulation0
Disparate Model Performance and Stability in Machine Learning Clinical Support for Diabetes and Heart Diseases0
Optimizing Fantasy Sports Team Selection with Deep Reinforcement Learning0
Modality-Projection Universal Model for Comprehensive Full-Body Medical Imaging SegmentationCode1
A theory of appropriateness with applications to generative artificial intelligence0
Data clustering: an essential technique in data science0
Bridging Interpretability and Robustness Using LIME-Guided Model Refinement0
Constraint-Adaptive Policy Switching for Offline Safe Reinforcement LearningCode1
A Novel Task-Driven Method with Evolvable Interactive Agents Using Event Trees for Enhanced Emergency Decision Support0
Advancing Explainability in Neural Machine Translation: Analytical Metrics for Attention and Alignment Consistency0
Uncertainty Quantification in Stereo MatchingCode0
Real-world Deployment and Evaluation of PErioperative AI CHatbot (PEACH) -- a Large Language Model Chatbot for Perioperative Medicine0
A Deep Reinforcement Learning Framework for Dynamic Portfolio Optimization: Evidence from China's Stock MarketCode0
Bayesian Optimization of Bilevel Problems0
Accelerating process control and optimization via machine learning: A review0
Agents on the Bench: Large Language Model Based Multi Agent Framework for Trustworthy Digital Justice0
MineStudio: A Streamlined Package for Minecraft AI Agent DevelopmentCode3
GeneSUM: Large Language Model-based Gene Summary Extraction0
BIG-MoE: Bypass Isolated Gating MoE for Generalized Multimodal Face Anti-SpoofingCode0
Quantum framework for Reinforcement Learning: Integrating Markov decision process, quantum arithmetic, and trajectory search0
INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent0
An Instrumental Value for Data Production and its Application to Data Pricing0
An Overview and Discussion of the Suitability of Existing Speech Datasets to Train Machine Learning Models for Collective Problem Solving0
Multimodal Learning with Uncertainty Quantification based on Discounted Belief FusionCode1
CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language ModelsCode1
Explainability in Neural Networks for Natural Language Processing Tasks0
A Dual-Perspective Metaphor Detection Framework Using Large Language ModelsCode0
LegalAgentBench: Evaluating LLM Agents in Legal DomainCode1
EPE-P: Evidence-based Parameter-efficient Prompting for Multimodal Learning with Missing ModalitiesCode0
Enhancing Cancer Diagnosis with Explainable & Trustworthy Deep Learning Models0
The Role of XAI in Transforming Aeronautics and Aerospace Systems0
MineAgent: Towards Remote-Sensing Mineral Exploration with Multimodal Large Language Models0
Fairness in Reinforcement Learning with Bisimulation Metrics0
Multi-Agent Sampling: Scaling Inference Compute for Data Synthesis with Tree Search-Based Agentic CollaborationCode0
Decentralized Governance of Autonomous AI Agents0
Show:102550
← PrevPage 26 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified