SOTAVerified

Decision Making

Papers

Showing 851875 of 12311 papers

TitleStatusHype
Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning ApproachCode1
ENTMOOT: A Framework for Optimization over Ensemble Tree ModelsCode1
Decision Stacks: Flexible Reinforcement Learning via Modular Generative ModelsCode1
ORGANA: A Robotic Assistant for Automated Chemistry Experimentation and CharacterizationCode1
A novel interpretable machine learning system to generate clinical risk scores: An application for predicting early mortality or unplanned readmission in a retrospective cohort studyCode1
Emergent Linear Representations in World Models of Self-Supervised Sequence ModelsCode1
EmoDynamiX: Emotional Support Dialogue Strategy Prediction by Modelling MiXed Emotions and Discourse DynamicsCode1
Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement LearningCode1
An Objective Metric for Explainable AI: How and Why to Estimate the Degree of ExplainabilityCode1
Pareto Set Learning for Expensive Multi-Objective OptimizationCode1
Emergent Coordination through Game-Induced Nonlinear Opinion DynamicsCode1
Deep Attentive Learning for Stock Movement Prediction From Social Media Text and Company CorrelationsCode1
Empowering Many, Biasing a Few: Generalist Credit Scoring through Large Language ModelsCode1
An Introduction to Deep Reinforcement LearningCode1
Deep Reinforcement Learning for Entity AlignmentCode1
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming VideosCode1
EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge SummariesCode1
Deep Learning-based Frozen Section to FFPE TranslationCode1
Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and ClassificationCode1
PFL-MoE: Personalized Federated Learning Based on Mixture of ExpertsCode1
EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray ImagesCode1
EMT: Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine ReadingCode1
Entropy-Regularized Token-Level Policy Optimization for Language Agent ReinforcementCode1
Plancraft: an evaluation dataset for planning with LLM agentsCode1
EDITS: Modeling and Mitigating Data Bias for Graph Neural NetworksCode1
Show:102550
← PrevPage 35 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified