SOTAVerified

Decision Making

Papers

Showing 501550 of 12311 papers

TitleStatusHype
Adaptive Conformal Predictions for Time SeriesCode1
Do graph neural networks learn traditional jet substructure?Code1
CLASS: A Design Framework for building Intelligent Tutoring Systems based on Learning Science principlesCode1
iPLAN: Intent-Aware Planning in Heterogeneous Traffic via Distributed Multi-Agent Reinforcement LearningCode1
Aequitas: A Bias and Fairness Audit ToolkitCode1
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI BenchmarkCode1
Accuracy and Fairness Trade-offs in Machine Learning: A Stochastic Multi-Objective ApproachCode1
Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy OptimizationCode1
Argumentative Large Language Models for Explainable and Contestable Claim VerificationCode1
ARM-Net: Adaptive Relation Modeling Network for Structured DataCode1
Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making using Language Guided World ModellingCode1
Examining Inter-Consistency of Large Language Models Collaboration: An In-depth Analysis via DebateCode1
AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information AssistantCode1
Are Human-generated Demonstrations Necessary for In-context Learning?Code1
DocSegTr: An Instance-Level End-to-End Document Image Segmentation TransformerCode1
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision MakingCode1
Beyond Pixels: Enhancing LIME with Hierarchical Features and Segmentation Foundation ModelsCode1
Efficient Nonmyopic Bayesian Optimization via One-Shot Multi-Step TreesCode1
Explainable Claim Verification via Knowledge-Grounded Reasoning with Large Language ModelsCode1
A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and AdaptationCode1
Adapting and Evaluating Influence-Estimation Methods for Gradient-Boosted Decision TreesCode1
Distributional GFlowNets with Quantile FlowsCode1
A Probabilistic Graphical Model Foundation for Enabling Predictive Digital Twins at ScaleCode1
A Probabilistic U-Net for Segmentation of Ambiguous ImagesCode1
ADaPT: As-Needed Decomposition and Planning with Language ModelsCode1
Discriminative Particle Filter Reinforcement Learning for Complex Partial ObservationsCode1
AdaPlanner: Adaptive Planning from Feedback with Language ModelsCode1
Distributional Counterfactual Explanations With Optimal TransportCode1
Dissecting and Mitigating Diffusion Bias via Mechanistic InterpretabilityCode1
Distributive Justice as the Foundational Premise of Fair ML: Unification, Extension, and Interpretation of Group Fairness MetricsCode1
DIME: Fine-grained Interpretations of Multimodal Models via Disentangled Local ExplanationsCode1
Digital Transformation in the Water Distribution System based on the Digital Twins ConceptCode1
DisastIR: A Comprehensive Information Retrieval Benchmark for Disaster ManagementCode1
DiffSTG: Probabilistic Spatio-Temporal Graph Forecasting with Denoising Diffusion ModelsCode1
DiffPO: A causal diffusion model for learning distributions of potential outcomesCode1
A Recurrent Vision-and-Language BERT for NavigationCode1
Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value FunctionsCode1
DiffLoad: Uncertainty Quantification in Electrical Load Forecasting with the Diffusion ModelCode1
Diffusion-Based Electrocardiography Noise Quantification via Anomaly DetectionCode1
Discern: Discourse-Aware Entailment Reasoning Network for Conversational Machine ReadingCode1
Diverse and Admissible Trajectory Forecasting through Multimodal Context UnderstandingCode1
A novel interpretable machine learning system to generate clinical risk scores: An application for predicting early mortality or unplanned readmission in a retrospective cohort studyCode1
Diagnosing Infeasible Optimization Problems Using Large Language ModelsCode1
DiffAIL: Diffusion Adversarial Imitation LearningCode1
An Objective Metric for Explainable AI: How and Why to Estimate the Degree of ExplainabilityCode1
DexArt: Benchmarking Generalizable Dexterous Manipulation with Articulated ObjectsCode1
DG-Trans: Dual-level Graph Transformer for Spatiotemporal Incident Impact Prediction on Traffic NetworksCode1
Detecting Individual Decision-Making Style: Exploring Behavioral Stylometry in ChessCode1
Developing Optimal Causal Cyber-Defence Agents via Cyber Security SimulationCode1
Dense Uncertainty EstimationCode1
Show:102550
← PrevPage 11 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified