SOTAVerified

Decision Making

Papers

Showing 751800 of 12311 papers

TitleStatusHype
IdentiFace : A VGG Based Multimodal Facial Biometric SystemCode1
Rejecting Hallucinated State Targets during PlanningCode1
ImageFlowNet: Forecasting Multiscale Image-Level Trajectories of Disease Progression with Irregularly-Sampled Longitudinal Medical ImagesCode1
Algorithmic Recourse: from Counterfactual Explanations to InterventionsCode1
BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical AnalysisCode1
Improving Recommendation Fairness via Data AugmentationCode1
Goal-directed graph construction using reinforcement learningCode1
In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-ThoughtCode1
Addressing cognitive bias in medical language modelsCode1
Injecting Planning-Awareness into Prediction and Detection EvaluationCode1
Integrated Multi-omics Analysis Using Variational Autoencoders: Application to Pan-cancer ClassificationCode1
Integrating Clinical Knowledge into Concept Bottleneck ModelsCode1
A Closer Look at Advantage-Filtered Behavioral Cloning in High-Noise DatasetsCode1
Interpretable by Design: Learning Predictors by Composing Interpretable QueriesCode1
Interpretable Recommender System With Heterogeneous Information: A Geometric Deep Learning PerspectiveCode1
Interpretable statistical representations of neural population dynamics and geometryCode1
BAT: Behavior-Aware Human-Like Trajectory Prediction for Autonomous DrivingCode1
BayesianFitForecast: A User-Friendly R Toolbox for Parameter Estimation and Forecasting with Ordinary Differential EquationsCode1
Bayesian Safety Validation for Failure Probability Estimation of Black-Box SystemsCode1
Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy OptimizationCode1
BetaZero: Belief-State Planning for Long-Horizon POMDPs using Learned ApproximationsCode1
JEDAI: A System for Skill-Aligned Explainable Robot PlanningCode1
ChatGPT-powered Conversational Drug Editing Using Retrieval and Domain FeedbackCode1
Jump to Conclusions: Short-Cutting Transformers With Linear TransformationsCode1
LaMPP: Language Models as Probabilistic Priors for Perception and ActionCode1
Lane Change Classification and Prediction with Action Recognition NetworksCode1
Alleviating Matthew Effect of Offline Reinforcement Learning in Interactive RecommendationCode1
Autonomous Exploration Under Uncertainty via Deep Reinforcement Learning on GraphsCode1
Active Inference and Behavior Trees for Reactive Action Planning and Execution in RoboticsCode1
Active Fire Detection in Landsat-8 Imagery: a Large-Scale Dataset and a Deep-Learning StudyCode1
ALMA: Hierarchical Learning for Composite Multi-Agent TasksCode1
Large Language Model Cascades with Mixture of Thoughts Representations for Cost-efficient ReasoningCode1
Autonomous Driving using Residual Sensor Fusion and Deep Reinforcement LearningCode1
An algorithmic framework for synthetic cost-aware decision making in molecular designCode1
LLM-SAP: Large Language Models Situational Awareness Based PlanningCode1
Large-scale moral machine experiment on large language modelsCode1
auto-sktime: Automated Time Series ForecastingCode1
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent ConversationCode1
Auto-GPT for Online Decision Making: Benchmarks and Additional OpinionsCode1
Sparse learned kernels for interpretable and efficient medical time series processingCode1
Learning hierarchical behavior and motion planning for autonomous drivingCode1
Learning High-Level Policies for Model Predictive ControlCode1
Learning Multi-Level Hierarchies with HindsightCode1
Learning non-stationary Langevin dynamics from stochastic observations of latent trajectoriesCode1
ALYMPICS: LLM Agents Meet Game Theory -- Exploring Strategic Decision-Making with AI AgentsCode1
Learning Robust Rewards with Adversarial Inverse Reinforcement LearningCode1
Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-MakingCode1
Learning to Model Opponent LearningCode1
A Mamba-based Siamese Network for Remote Sensing Change DetectionCode1
Sample Efficient Reinforcement Learning via Large Vision Language Model DistillationCode1
Show:102550
← PrevPage 16 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified