SOTAVerified

Decision Making

Papers

Showing 921930 of 12311 papers

TitleStatusHype
Human-Artificial Interaction in the Age of Agentic AI: A System-Theoretical Approach0
Playing Hex and Counter Wargames using Reinforcement Learning and Recurrent Neural NetworksCode0
Benchmarking LLMs for Political Science: A United Nations PerspectiveCode1
LLM-Enhanced Dialogue Management for Full-Duplex Spoken Dialogue Systems0
RGAR: Recurrence Generation-augmented Retrieval for Factual-aware Medical Question Answering0
MCTS-KBQA: Monte Carlo Tree Search for Knowledge Base Question Answering0
Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region0
RobustX: Robust Counterfactual Explanations Made EasyCode1
AgentCF++: Memory-enhanced LLM-based Agents for Popularity-aware Cross-domain RecommendationsCode0
Fighter Jet Navigation and Combat using Deep Reinforcement Learning with Explainable AICode0
Show:102550
← PrevPage 93 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified