SOTAVerified

Decision Making

Papers

Showing 1000110025 of 12311 papers

TitleStatusHype
LLM Reasoner and Automated Planner: A new NPC approach0
LLMs and the Human Condition0
LLMs as Deceptive Agents: How Role-Based Prompting Induces Semantic Ambiguity in Puzzle Tasks0
LLMs for clinical risk prediction0
LLMs for Generalizable Language-Conditioned Policy Learning under Minimal Data Requirements0
LLMs for Relational Reasoning: How Far are We?0
LLMs for Robotic Object Disambiguation0
LLMs meet Federated Learning for Scalable and Secure IoT Management0
LLM-Stackelberg Games: Conjectural Reasoning Equilibria and Their Applications to Spearphishing0
LLM-State: Open World State Representation for Long-horizon Task Planning with Large Language Model0
LLMs Working in Harmony: A Survey on the Technological Aspects of Building Effective LLM-Based Multi Agent Systems0
LLM Trading: Analysis of LLM Agent Behavior in Experimental Asset Markets0
LLT-PolyU: Identifying Sentiment Intensity in Ironic Tweets0
LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations0
LMExplainer: Grounding Knowledge and Explaining Language Models0
Local and global model interpretability via backward selection and clustering0
Local Differential Privacy for Sequential Decision Making in a Changing Environment0
Local-Global Learning of Interpretable Control Policies: The Interface between MPC and Reinforcement Learning0
Local Interpretation Methods to Machine Learning Using the Domain of the Feature Space0
Localization under Topological Uncertainty for Lane Identification of Autonomous Vehicles0
Local Calibration: Metrics and Recalibration0
Localized Flood DetectionWith Minimal Labeled Social Media Data Using Transfer Learning0
Local Justice and the Algorithmic Allocation of Societal Resources0
Optimal Local Explainer Aggregation for Interpretable Prediction0
Worst-Case Optimal Multi-Armed Gaussian Best Arm Identification with a Fixed Budget0
Show:102550
← PrevPage 401 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified