SOTAVerified

Decision Making

Papers

Showing 9511000 of 12311 papers

TitleStatusHype
A Recurrent Vision-and-Language BERT for NavigationCode1
Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value FunctionsCode1
Dynamic Sparse Training for Deep Reinforcement LearningCode1
SocialVAE: Human Trajectory Prediction using Timewise LatentsCode1
Early Lane Change Prediction for Automated Driving Systems Using Multi-Task Attention-based Convolutional Neural NetworksCode1
SSL-SoilNet: A Hybrid Transformer-based Framework with Self-Supervised Learning for Large-scale Soil Organic Carbon PredictionCode1
SOTIF Entropy: Online SOTIF Risk Quantification and Mitigation for Autonomous DrivingCode1
SPACE: A Python-based Simulator for Evaluating Decentralized Multi-Robot Task Allocation AlgorithmsCode1
Fairness Beyond Disparate Treatment & Disparate Impact: Learning Classification without Disparate MistreatmentCode1
Efficient Planning in a Compact Latent Action SpaceCode1
Fairness Through Robustness: Investigating Robustness Disparity in Deep LearningCode1
Ergodicity-breaking reveals time optimal decision making in humansCode1
Efficient Nonmyopic Bayesian Optimization via One-Shot Multi-Step TreesCode1
STeCa: Step-level Trajectory Calibration for LLM Agent LearningCode1
Fibrosis-Net: A Tailored Deep Convolutional Neural Network Design for Prediction of Pulmonary Fibrosis Progression from Chest CT ImagesCode1
Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine ReadingCode1
Adversarial Attacks on Probabilistic Autoregressive Forecasting ModelsCode1
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning RateCode1
Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and ClassificationCode1
Structured Scene Memory for Vision-Language NavigationCode1
An Empirical Characterization of Fair Machine Learning For Clinical Risk PredictionCode1
An empirical evaluation of active inference in multi-armed banditsCode1
Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4Code1
EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray ImagesCode1
Extended Tree Search for Robot Task and Motion PlanningCode1
EMT: Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine ReadingCode1
Explaining generative diffusion models via visual analysis for interpretable decision-making processCode1
Emergent Coordination through Game-Induced Nonlinear Opinion DynamicsCode1
EmoDynamiX: Emotional Support Dialogue Strategy Prediction by Modelling MiXed Emotions and Discourse DynamicsCode1
An End-to-end Deep Reinforcement Learning Approach for the Long-term Short-term Planning on the Frenet SpaceCode1
Explaining Autonomous Driving Actions with Visual Question AnsweringCode1
Tactical Decision-Making in Autonomous Driving by Reinforcement Learning with Uncertainty EstimationCode1
Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning ApproachCode1
Emulation of physical processes with EmukitCode1
TAPAS: a Toolbox for Adversarial Privacy Auditing of Synthetic DataCode1
Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis AgentsCode1
Adversarial Robustness of Representation Learning for Knowledge GraphsCode1
Engineering flexible machine learning systems by traversing functionally-invariant pathsCode1
t-DGR: A Trajectory-Based Deep Generative Replay Method for Continual Learning in Decision MakingCode1
TE2Rules: Explaining Tree Ensembles using RulesCode1
Explaining machine-learned particle-flow reconstructionCode1
Extending CAM-based XAI methods for Remote Sensing Imagery SegmentationCode1
Equitable Restless Multi-Armed Bandits: A General Framework Inspired By Digital HealthCode1
Explainable Image Similarity: Integrating Siamese Networks and Grad-CAMCode1
AdViCE: Aggregated Visual Counterfactual Explanations for Machine Learning Model ValidationCode1
Explainable Fuzzy Neural Network with Multi-Fidelity Reinforcement Learning for Micro-Architecture Design Space ExplorationCode1
Large Language Models are Learnable Planners for Long-Term RecommendationCode1
TFN: An Interpretable Neural Network with Time-Frequency Transform Embedded for Intelligent Fault DiagnosisCode1
The Grammar of Interactive Explanatory Model AnalysisCode1
Explainable Machine Larning for liver transplantationCode1
Show:102550
← PrevPage 20 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified