SOTAVerified

Decision Making

Papers

Showing 551600 of 12311 papers

TitleStatusHype
Uncertainty Quantification for Molecular Property Predictions with Graph Neural Architecture SearchCode1
Interpreting and Correcting Medical Image Classification with PIP-NetCode1
Explaining Autonomous Driving Actions with Visual Question AnsweringCode1
Fast model inference and training on-board of SatellitesCode1
Is Imitation All You Need? Generalized Decision-Making with Dual-Phase TrainingCode1
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street ViewCode1
Epidemic Modeling with Generative AgentsCode1
Alleviating Matthew Effect of Offline Reinforcement Learning in Interactive RecommendationCode1
Sparse learned kernels for interpretable and efficient medical time series processingCode1
ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource AllocationCode1
Causal Discovery with Language Models as Imperfect ExpertsCode1
ECG-Image-Kit: A Synthetic Image Generation Toolbox to Facilitate Deep Learning-Based Electrocardiogram DigitizationCode1
Contextualizing MLP-Mixers Spatiotemporally for Urban Data Forecast at ScaleCode1
Towards Safe Autonomous Driving Policies using a Neuro-Symbolic Deep Reinforcement Learning ApproachCode1
Computationally Assisted Quality Control for Public Health Data StreamsCode1
milliFlow: Scene Flow Estimation on mmWave Radar Point Cloud for Human Motion SensingCode1
Collective Intelligence in Human-AI Teams A Bayesian Theory of Mind ApproachCode1
Interaction-Aware Planning With Deep Inverse Reinforcement Learning for Human-Like Autonomous Driving in Merge ScenariosCode1
TrustGuard: GNN-based Robust and Explainable Trust Evaluation with Dynamicity SupportCode1
Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMsCode1
Comprehensive Training and Evaluation on Deep Reinforcement Learning for Automated Driving in Various Simulated Driving ManeuversCode1
Sampling from Gaussian Process Posteriors using Stochastic Gradient DescentCode1
CAMMARL: Conformal Action Modeling in Multi Agent Reinforcement LearningCode1
Deep Reinforcement Learning with Task-Adaptive Retrieval via HypernetworkCode1
Evaluating Superhuman Models with Consistency ChecksCode1
Simplified Temporal Consistency Reinforcement LearningCode1
ChessGPT: Bridging Policy Learning and Language ModelingCode1
Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer ControlCode1
iPLAN: Intent-Aware Planning in Heterogeneous Traffic via Distributed Multi-Agent Reinforcement LearningCode1
Weight Freezing: A Regularization Approach for Fully Connected Layers with an Application in EEG ClassificationCode1
Decision Stacks: Flexible Reinforcement Learning via Modular Generative ModelsCode1
Sentiment Analysis in Finance: From Transformers Back to eXplainable Lexicons (XLex)Code1
Turning large language models into cognitive modelsCode1
Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning ApproachCode1
Agents Explore the Environment Beyond Good Actions to Improve Their Model for Better DecisionsCode1
A Large-Scale Study of Probabilistic Calibration in Neural Network RegressionCode1
A Study of Situational Reasoning for Traffic UnderstandingCode1
Equity-Transformer: Solving NP-hard Min-Max Routing Problems as Sequential Generation with Equity ContextCode1
Auto-GPT for Online Decision Making: Benchmarks and Additional OpinionsCode1
Extracting Reward Functions from Diffusion ModelsCode1
BetaZero: Belief-State Planning for Long-Horizon POMDPs using Learned ApproximationsCode1
DiffLoad: Uncertainty Quantification in Electrical Load Forecasting with the Diffusion ModelCode1
ChatGPT-powered Conversational Drug Editing Using Retrieval and Domain FeedbackCode1
Future-conditioned Unsupervised Pretraining for Decision TransformerCode1
AdaPlanner: Adaptive Planning from Feedback with Language ModelsCode1
Market Making with Deep Reinforcement Learning from Limit Order BooksCode1
Koopman Kernel RegressionCode1
TransWorldNG: Traffic Simulation via Foundation ModelCode1
Think Before You Act: Decision Transformers with Working MemoryCode1
CLASS: A Design Framework for building Intelligent Tutoring Systems based on Learning Science principlesCode1
Show:102550
← PrevPage 12 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified