SOTAVerified

Decision Making

Papers

Showing 601650 of 12311 papers

TitleStatusHype
Agents Explore the Environment Beyond Good Actions to Improve Their Model for Better DecisionsCode1
Do graph neural networks learn traditional jet substructure?Code1
A Survey of Medical Vision-and-Language Applications and Their TechniquesCode1
A survey on datasets for fairness-aware machine learningCode1
A Survey of World Models for Autonomous DrivingCode1
DORA: Exploring Outlier Representations in Deep Neural NetworksCode1
A Probabilistic Graphical Model Foundation for Enabling Predictive Digital Twins at ScaleCode1
Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value FunctionsCode1
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision MakingCode1
ECG-Image-Kit: A Synthetic Image Generation Toolbox to Facilitate Deep Learning-Based Electrocardiogram DigitizationCode1
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI BenchmarkCode1
DocSegTr: An Instance-Level End-to-End Document Image Segmentation TransformerCode1
DocLens: Multi-aspect Fine-grained Evaluation for Medical Text GenerationCode1
AT-RAG: An Adaptive RAG Model Enhancing Query Efficiency with Topic Filtering and Iterative ReasoningCode1
A Probabilistic U-Net for Segmentation of Ambiguous ImagesCode1
Examining Inter-Consistency of Large Language Models Collaboration: An In-depth Analysis via DebateCode1
A Graph-Based Modeling Framework for Tracing Hydrological Pollutant Transport in Surface WatersCode1
Epidemic Modeling with Generative AgentsCode1
Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making using Language Guided World ModellingCode1
Attention to Fires: Multi-Channel Deep Learning Models for Wildfire Severity PredictionCode1
DRIVE: Deep Reinforced Accident Anticipation with Visual ExplanationCode1
Distributive Justice as the Foundational Premise of Fair ML: Unification, Extension, and Interpretation of Group Fairness MetricsCode1
Diverse and Admissible Trajectory Forecasting through Multimodal Context UnderstandingCode1
A User's Guide to Calibrating Robotics SimulatorsCode1
Dissecting and Mitigating Diffusion Bias via Mechanistic InterpretabilityCode1
Augmenting Reinforcement Learning with Transformer-based Scene Representation Learning for Decision-making of Autonomous DrivingCode1
A general framework for multi-step ahead adaptive conformal heteroscedastic time series forecastingCode1
Distributional GFlowNets with Quantile FlowsCode1
Diverse and Admissible Trajectory Prediction through Multimodal Context UnderstandingCode1
Auto-GPT for Online Decision Making: Benchmarks and Additional OpinionsCode1
Distributional Counterfactual Explanations With Optimal TransportCode1
Discern: Discourse-Aware Entailment Reasoning Network for Conversational Machine ReadingCode1
Explainable Deep Learning for Tumor Dynamic Modeling and Overall Survival Prediction using Neural-ODECode1
Discriminative Particle Filter Reinforcement Learning for Complex Partial ObservationsCode1
Adapting and Evaluating Influence-Estimation Methods for Gradient-Boosted Decision TreesCode1
DIME: Fine-grained Interpretations of Multimodal Models via Disentangled Local ExplanationsCode1
Explainable Neural Computation via Stack Neural Module NetworksCode1
Explaining Autonomous Driving Actions with Visual Question AnsweringCode1
A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and AdaptationCode1
DisastIR: A Comprehensive Information Retrieval Benchmark for Disaster ManagementCode1
Divide and Conquer: Answering Questions with Object Factorization and Compositional ReasoningCode1
A View From Somewhere: Human-Centric Face RepresentationsCode1
Failure Detection in Medical Image Classification: A Reality Check and Benchmarking TestbedCode1
Fair and Optimal Classification via Post-ProcessingCode1
BAT: Behavior-Aware Human-Like Trajectory Prediction for Autonomous DrivingCode1
Balancing Biases and Preserving Privacy on Balanced Faces in the WildCode1
Fairness in Credit Scoring: Assessment, Implementation and Profit ImplicationsCode1
Fairness in Ranking under UncertaintyCode1
BayesianFitForecast: A User-Friendly R Toolbox for Parameter Estimation and Forecasting with Ordinary Differential EquationsCode1
An Objective Metric for Explainable AI: How and Why to Estimate the Degree of ExplainabilityCode1
Show:102550
← PrevPage 13 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified