SOTAVerified

Decision Making

Papers

Showing 80018050 of 12311 papers

TitleStatusHype
Evaluating AI-Driven Automated Map Digitization in QGIS0
Evaluating and Aligning Human Economic Risk Preferences in LLMs0
Evaluating and Boosting Uncertainty Quantification in Classification0
Evaluating and Improving Value Judgments in AI: A Scenario-Based Study on Large Language Models' Depiction of Social Conventions0
Evaluating Bayesian Model Visualisations0
Bias Evaluation and Mitigation in Retrieval-Augmented Medical Question-Answering Systems0
Evaluating Brain-Inspired Modular Training in Automated Circuit Discovery for Mechanistic Interpretability0
Evaluating Conversational Recommender Systems: A Landscape of Research0
Decictor: Towards Evaluating the Robustness of Decision-Making in Autonomous Driving Systems0
Evaluating Deep Human-in-the-Loop Optimization for Retinal Implants Using Sighted Participants0
Evaluating Dynamic Conditional Quantile Treatment Effects with Applications in Ridesharing0
Evaluating Explanation Methods for Vision-and-Language Navigation0
Evaluating Fair Feature Selection in Machine Learning for Healthcare0
Public Perceptions of Fairness Metrics Across Borders0
Evaluating Fairness Metrics in the Presence of Dataset Bias0
Evaluating Gender Bias of LLMs in Making Morality Judgements0
Partially Observable Markov Decision Process Modelling for Assessing Hierarchies0
Evaluating Human-AI Collaboration: A Review and Methodological Framework0
Evaluating Human Alignment and Model Faithfulness of LLM Rationale0
Evaluating Human-like Explanations for Robot Actions in Reinforcement Learning Scenarios0
Evaluating Interventional Reasoning Capabilities of Large Language Models0
Evaluating Large Language Models in Ophthalmology0
Evaluating Large Language Models through Gender and Racial Stereotypes0
Evaluating LeNet Algorithms in Classification Lung Cancer from Iraq-Oncology Teaching Hospital/National Center for Cancer Diseases0
Evaluating LLMs for Text-to-SQL Generation With Complex SQL Workload0
Evaluating Perceptual Distance Models by Fitting Binomial Distributions to Two-Alternative Forced Choice Data0
Evaluating Reinforcement Learning Algorithms in Observational Health Settings0
Evaluating Scenario-based Decision-making for Interactive Autonomous Driving Using Rational Criteria: A Survey0
Evaluating subgroup disparity using epistemic uncertainty in mammography0
Evaluating Text Classification Robustness to Part-of-Speech Adversarial Examples0
Evaluating the Bias in LLMs for Surveying Opinion and Decision Making in Healthcare0
Evaluating the Determinants of Mode Choice Using Statistical and Machine Learning Techniques in the Indian Megacity of Bengaluru0
Evaluating the Effectiveness of 2D and 3D Features for Predicting Tumor Response to Chemotherapy0
Evaluating the Explainability of Attributes and Prototypes for a Medical Classification Model0
Evaluating the Explainable AI Method Grad-CAM for Breath Classification on Newborn Time Series Data0
Evaluating the impact of quarantine measures on COVID-19 spread0
Evaluating the Performance of Large Language Models in Scientific Claim Detection and Classification0
Evaluating the Reproducibility of Research in Obstetrics and Gynecology0
Evaluating the Robustness of Bayesian Neural Networks Against Different Types of Attacks0
Evaluating the Safety of Deep Reinforcement Learning Models using Semi-Formal Verification0
Evaluating the Similarity Estimator component of the TWIN Personality-based Recommender System0
Evaluating the Stability of Deep Learning Latent Feature Spaces0
Evaluating the Utility of Conformal Prediction Sets for AI-Advised Image Labeling0
Evaluating the Utility of Model Explanations for Model Development0
Evaluating Uncertainties in Electricity Markets via Machine Learning and Quantum Computing0
Evaluating Uncertainty Estimation Methods on 3D Semantic Segmentation of Point Clouds0
Evaluating Visual Explanations of Attention Maps for Transformer-based Medical Imaging0
Evaluating World Models with LLM for Decision Making0
Evaluation and selection of Medical Tourism sites: A rough AHP based MABAC approach0
Evaluation Mechanism of Collective Intelligence for Heterogeneous Agents Group0
Show:102550
← PrevPage 161 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified