SOTAVerified

Decision Making

Papers

Showing 351375 of 12311 papers

TitleStatusHype
G-Transformer for Conditional Average Potential Outcome Estimation over TimeCode1
Pursuing Overall Welfare in Federated Learning through Sequential Decision MakingCode1
In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-ThoughtCode1
LLM experiments with simulation: Large Language Model Multi-Agent System for Simulation Model Parametrization in Digital TwinsCode1
GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement LearningCode1
Rethinking Transformers in Solving POMDPsCode1
STRIDE: A Tool-Assisted LLM Agent Framework for Strategic and Interactive Decision-MakingCode1
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning RateCode1
Improving Single Domain-Generalized Object Detection: A Focus on Diversification and AlignmentCode1
PATE: Proximity-Aware Time series anomaly EvaluationCode1
SemEval-2024 Task 3: Multimodal Emotion Cause Analysis in ConversationsCode1
Movie Revenue Prediction using Machine Learning ModelsCode1
Conformal Alignment: Knowing When to Trust Foundation Models with GuaranteesCode1
FedGCS: A Generative Framework for Efficient Client Selection in Federated Learning via Gradient-based OptimizationCode1
Weakly-Supervised Residual Evidential Learning for Multi-Instance Uncertainty EstimationCode1
Argumentative Large Language Models for Explainable and Contestable Claim VerificationCode1
UCB-driven Utility Function Search for Multi-objective Reinforcement LearningCode1
CoSense3D: an Agent-based Efficient Learning Framework for Collective PerceptionCode1
CoCoG: Controllable Visual Stimuli Generation based on Human Concept RepresentationsCode1
Large Language Models in the Clinic: A Comprehensive BenchmarkCode1
BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical AnalysisCode1
Conformal Predictive Systems Under Covariate ShiftCode1
Group-Aware Coordination Graph for Multi-Agent Reinforcement LearningCode1
Open-Ended Wargames with Large Language ModelsCode1
MCPNet: An Interpretable Classifier via Multi-Level Concept PrototypesCode1
Show:102550
← PrevPage 15 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified