SOTAVerified

Decision Making

Papers

Showing 551600 of 12311 papers

TitleStatusHype
EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge SummariesCode1
EDGE COVID-19: A Web Platform to generate submission-ready genomes for SARS-CoV-2 sequencing effortsCode1
A Semantic Segmentation Network for Urban-Scale Building Footprint Extraction Using RGB Satellite ImageryCode1
EDITS: Modeling and Mitigating Data Bias for Graph Neural NetworksCode1
Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal ConstraintsCode1
Dynamic Causal Bayesian OptimizationCode1
Dynamic planning in hierarchical active inferenceCode1
Effective Reinforcement Learning through Evolutionary Surrogate-Assisted PrescriptionCode1
EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray ImagesCode1
Beyond Pixels: Enhancing LIME with Hierarchical Features and Segmentation Foundation ModelsCode1
Driving Style Recognition Using Interval Type-2 Fuzzy Inference System and Multiple Experts Decision MakingCode1
DuaLight: Enhancing Traffic Signal Control by Leveraging Scenario-Specific and Scenario-Shared KnowledgeCode1
A Recurrent Vision-and-Language BERT for NavigationCode1
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI BenchmarkCode1
Driving Style Alignment for LLM-powered Driver AgentCode1
Dual Intent Enhanced Graph Neural Network for Session-based New Item RecommendationCode1
Do graph neural networks learn traditional jet substructure?Code1
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision MakingCode1
Domain Generalization via Rationale InvarianceCode1
Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value FunctionsCode1
DocSegTr: An Instance-Level End-to-End Document Image Segmentation TransformerCode1
A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and AdaptationCode1
Re4MPC: Reactive Nonlinear MPC for Multi-model Motion Planning via Deep Reinforcement LearningCode1
Examining Inter-Consistency of Large Language Models Collaboration: An In-depth Analysis via DebateCode1
DORA: Exploring Outlier Representations in Deep Neural NetworksCode1
End-to-End Conformal Calibration for Optimization Under UncertaintyCode1
Distributional GFlowNets with Quantile FlowsCode1
Dissecting and Mitigating Diffusion Bias via Mechanistic InterpretabilityCode1
A Probabilistic U-Net for Segmentation of Ambiguous ImagesCode1
Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making using Language Guided World ModellingCode1
A Probabilistic Graphical Model Foundation for Enabling Predictive Digital Twins at ScaleCode1
Discriminative Particle Filter Reinforcement Learning for Complex Partial ObservationsCode1
Adaptive Two-Stage Cloud Resource Scaling via Hierarchical Multi-Indicator Forecasting and Bayesian Decision-MakingCode1
Distributional Counterfactual Explanations With Optimal TransportCode1
Distributive Justice as the Foundational Premise of Fair ML: Unification, Extension, and Interpretation of Group Fairness MetricsCode1
DRIVE: Deep Reinforced Accident Anticipation with Visual ExplanationCode1
DIME: Fine-grained Interpretations of Multimodal Models via Disentangled Local ExplanationsCode1
Are Human-generated Demonstrations Necessary for In-context Learning?Code1
A novel interpretable machine learning system to generate clinical risk scores: An application for predicting early mortality or unplanned readmission in a retrospective cohort studyCode1
DisastIR: A Comprehensive Information Retrieval Benchmark for Disaster ManagementCode1
Argumentative Large Language Models for Explainable and Contestable Claim VerificationCode1
Dynaformer: A Deep Learning Model for Ageing-aware Battery Discharge PredictionCode1
Digital Transformation in the Water Distribution System based on the Digital Twins ConceptCode1
ARM-Net: Adaptive Relation Modeling Network for Structured DataCode1
An Objective Metric for Explainable AI: How and Why to Estimate the Degree of ExplainabilityCode1
Scalable Multi-agent Reinforcement Learning Algorithm for Wireless NetworksCode1
Dynamic Sparse Training for Deep Reinforcement LearningCode1
Early Lane Change Prediction for Automated Driving Systems Using Multi-Task Attention-based Convolutional Neural NetworksCode1
Discern: Discourse-Aware Entailment Reasoning Network for Conversational Machine ReadingCode1
Diverse and Admissible Trajectory Forecasting through Multimodal Context UnderstandingCode1
Show:102550
← PrevPage 12 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified