SOTAVerified

Decision Making

Papers

Showing 651700 of 12311 papers

TitleStatusHype
BuildingView: Constructing Urban Building Exteriors Databases with Street View Imagery and Multimodal Large Language ModeCode1
DocSegTr: An Instance-Level End-to-End Document Image Segmentation TransformerCode1
Cal-DETR: Calibrated Detection TransformerCode1
Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making using Language Guided World ModellingCode1
Bridging POMDPs and Bayesian decision making for robust maintenance planning under model uncertainty: An application to railway systemsCode1
Algorithmic Decision Making with Conditional FairnessCode1
DRIVE: Deep Reinforced Accident Anticipation with Visual ExplanationCode1
A general framework for multi-step ahead adaptive conformal heteroscedastic time series forecastingCode1
Building a Scalable and Interpretable Bayesian Deep Learning Framework for Quality Control of Free Form SurfacesCode1
Calibration of Neural Networks using SplinesCode1
BoWFire: Detection of Fire in Still Images by Integrating Pixel Color and Texture AnalysisCode1
Dual Intent Enhanced Graph Neural Network for Session-based New Item RecommendationCode1
Brain Tumor Segmentation and Radiomics Survival Prediction: Contribution to the BRATS 2017 ChallengeCode1
Dynamic planning in hierarchical active inferenceCode1
Early Lane Change Prediction for Automated Driving Systems Using Multi-Task Attention-based Convolutional Neural NetworksCode1
EDGE COVID-19: A Web Platform to generate submission-ready genomes for SARS-CoV-2 sequencing effortsCode1
Binocular Mutual Learning for Improving Few-shot ClassificationCode1
Efficient Planning in a Compact Latent Action SpaceCode1
Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and ClassificationCode1
EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge SummariesCode1
BIMCaP: BIM-based AI-supported LiDAR-Camera Pose RefinementCode1
EmoDynamiX: Emotional Support Dialogue Strategy Prediction by Modelling MiXed Emotions and Discourse DynamicsCode1
EMT: Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine ReadingCode1
Emulation of physical processes with EmukitCode1
BLADE: Benchmarking Language Model Agents for Data-Driven ScienceCode1
Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop FeedbackCode1
CAMANet: Class Activation Map Guided Attention Network for Radiology Report GenerationCode1
CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in CoqCode1
Beyond Trivial Counterfactual Explanations with Diverse Valuable ExplanationsCode1
AI-Driven Day-to-Day Route ChoiceCode1
Entropy-Regularized Token-Level Policy Optimization for Language Agent ReinforcementCode1
Epidemic Modeling with Generative AgentsCode1
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced DatasetsCode1
Ergodicity-breaking reveals time optimal decision making in humansCode1
Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for SamplingCode1
Beyond calibration: estimating the grouping loss of modern neural networksCode1
Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-based Beam SearchCode1
Bias in Multimodal AI: Testbed for Fair Automatic RecruitmentCode1
Explainability of Deep Learning models for Urban Space perceptionCode1
Explainable AI for computational pathology identifies model limitations and tissue biomarkersCode1
Explainable Fuzzy Neural Network with Multi-Fidelity Reinforcement Learning for Micro-Architecture Design Space ExplorationCode1
Explainable Image Similarity: Integrating Siamese Networks and Grad-CAMCode1
Explainable Neural Computation via Stack Neural Module NetworksCode1
Explaining Autonomous Driving Actions with Visual Question AnsweringCode1
Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine ReadingCode1
Extended Tree Search for Robot Task and Motion PlanningCode1
Extracting Reward Functions from Diffusion ModelsCode1
Failure Detection in Medical Image Classification: A Reality Check and Benchmarking TestbedCode1
Fairness Beyond Disparate Treatment & Disparate Impact: Learning Classification without Disparate MistreatmentCode1
BetaZero: Belief-State Planning for Long-Horizon POMDPs using Learned ApproximationsCode1
Show:102550
← PrevPage 14 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified