SOTAVerified

Decision Making

Papers

Showing 126150 of 12311 papers

TitleStatusHype
Contextual Experience Replay for Self-Improvement of Language Agents0
QuantMCP: Grounding Large Language Models in Verifiable Financial Reality0
Prompting Wireless Networks: Reinforced In-Context Learning for Power Control0
Object Navigation with Structure-Semantic Reasoning-Based Multi-level Map and Multimodal Decision-Making LLM0
SurGSplat: Progressive Geometry-Constrained Gaussian Splatting for Surgical Scene Reconstruction0
Structured Labeling Enables Faster Vision-Language Models for End-to-End Autonomous Driving0
Natural Language Interaction with Databases on Edge Devices in the Internet of Battlefield Things0
Empowering Economic Simulation for Massively Multiplayer Online Games through Generative Agent-Based Modeling0
Look Before You Leap: A GUI-Critic-R1 Model for Pre-Operative Error Diagnosis in GUI Automation0
Ignoring Directionality Leads to Compromised Graph Neural Network Explanations0
Artificial Intelligence Should Genuinely Support Clinical Reasoning and Decision Making To Bridge the Translational Gap0
Impact of Hill coefficient and time delay on a perceptual decision-making model0
AD-EE: Early Exiting for Fast and Reliable Vision-Language Models in Autonomous Driving0
Conformal Mixed-Integer Constraint Learning with Feasibility Guarantees0
CLAIM: An Intent-Driven Multi-Agent Framework for Analyzing Manipulation in Courtroom DialoguesCode0
An AI-Based Public Health Data Monitoring System0
OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data SynthesisCode1
Finding signatures of low-dimensional geometric landscapes in high-dimensional cell fate transitionsCode0
FPGA-Enabled Machine Learning Applications in Earth Observation: A Systematic ReviewCode0
TextAtari: 100K Frames Game Playing with Language AgentsCode0
VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments0
Joint Modeling for Learning Decision-Making Dynamics in Behavioral Experiments0
A Smart Multimodal Healthcare Copilot with Powerful LLM ReasoningCode3
Improving Performance of Spike-based Deep Q-Learning using Ternary Neurons0
Designing Algorithmic Delegates: The Role of Indistinguishability in Human-AI Handoff0
Show:102550
← PrevPage 6 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified