SOTAVerified

Decision Making

Papers

Showing 401450 of 12311 papers

TitleStatusHype
A deep active learning system for species identification and counting in camera trap imagesCode1
Discern: Discourse-Aware Entailment Reasoning Network for Conversational Machine ReadingCode1
Collaborative Decision Making Using Action SuggestionsCode1
Dissecting and Mitigating Diffusion Bias via Mechanistic InterpretabilityCode1
ColaCare: Enhancing Electronic Health Record Modeling through Large Language Model-Driven Multi-Agent CollaborationCode1
Distributive Justice as the Foundational Premise of Fair ML: Unification, Extension, and Interpretation of Group Fairness MetricsCode1
Collective eXplainable AI: Explaining Cooperative Strategies and Agent Contribution in Multiagent Reinforcement Learning with Shapley ValuesCode1
Cognitive Accident Prediction in Driving Scenes: A Multimodality BenchmarkCode1
DocSegTr: An Instance-Level End-to-End Document Image Segmentation TransformerCode1
6GAN: IPv6 Multi-Pattern Target Generation via Generative Adversarial Nets with Reinforcement LearningCode1
Do graph neural networks learn traditional jet substructure?Code1
Domain Generalization via Rationale InvarianceCode1
A Comparative Visual Analytics Framework for Evaluating Evolutionary Processes in Multi-objective OptimizationCode1
Adversarial Robustness of Representation Learning for Knowledge GraphsCode1
Collective Intelligence in Human-AI Teams A Bayesian Theory of Mind ApproachCode1
ComTraQ-MPC: Meta-Trained DQN-MPC Integration for Trajectory Tracking with Limited Active Localization UpdatesCode1
Clinician-in-the-Loop Decision Making: Reinforcement Learning with Near-Optimal Set-Valued PoliciesCode1
Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning PoliciesCode1
A Benchmark on Uncertainty Quantification for Deep Learning PrognosticsCode1
Clinically-Inspired Multi-Agent Transformers for Disease Trajectory Forecasting from Multimodal DataCode1
A Comprehensive Evaluation of Cognitive Biases in LLMsCode1
Dynamic Causal Bayesian OptimizationCode1
CoCoG: Controllable Visual Stimuli Generation based on Human Concept RepresentationsCode1
Early Lane Change Prediction for Automated Driving Systems Using Multi-Task Attention-based Convolutional Neural NetworksCode1
EDITS: Modeling and Mitigating Data Bias for Graph Neural NetworksCode1
Effective Reinforcement Learning through Evolutionary Surrogate-Assisted PrescriptionCode1
CirT: Global Subseasonal-to-Seasonal Forecasting with Geometry-inspired TransformerCode1
GLAMOUR: Graph Learning over Macromolecule RepresentationsCode1
Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and ClassificationCode1
EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge SummariesCode1
ChatGPT-powered Conversational Drug Editing Using Retrieval and Domain FeedbackCode1
Emergent Linear Representations in World Models of Self-Supervised Sequence ModelsCode1
Empowering Many, Biasing a Few: Generalist Credit Scoring through Large Language ModelsCode1
EMT: Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine ReadingCode1
ChessGPT: Bridging Policy Learning and Language ModelingCode1
Engineering flexible machine learning systems by traversing functionally-invariant pathsCode1
CLASS: A Design Framework for building Intelligent Tutoring Systems based on Learning Science principlesCode1
CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in CoqCode1
A Divergence Minimization Perspective on Imitation Learning MethodsCode1
Ensemble Quantile Networks: Uncertainty-Aware Reinforcement Learning with Applications in Autonomous DrivingCode1
Entropy-Regularized Token-Level Policy Optimization for Language Agent ReinforcementCode1
Epidemic Modeling with Generative AgentsCode1
Aequitas: A Bias and Fairness Audit ToolkitCode1
Ergodicity-breaking reveals time optimal decision making in humansCode1
Certified Reinforcement Learning with Logic GuidanceCode1
CFGPT: Chinese Financial Assistant with Large Language ModelCode1
Causal thinking for decision making on Electronic Health Records: why and howCode1
Explainable AI for computational pathology identifies model limitations and tissue biomarkersCode1
Explainable Deep Learning for Tumor Dynamic Modeling and Overall Survival Prediction using Neural-ODECode1
CELLO: Causal Evaluation of Large Vision-Language ModelsCode1
Show:102550
← PrevPage 9 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified