SOTAVerified

Decision Making

Papers

Showing 13511375 of 12311 papers

TitleStatusHype
ViQAgent: Zero-Shot Video Question Answering via Agent with Open-Vocabulary Grounding ValidationCode0
Identification of Probabilities of Causation: A Complete Characterization0
Agentic Feature Augmentation: Unifying Selection and Generation with Teaming, Planning, and Memories0
Finding separatrices of dynamical flows with Deep Koopman Eigenfunctions0
ReflAct: World-Grounded Decision Making in LLM Agents via Goal-State Reflection0
Cost-Augmented Monte Carlo Tree Search for LLM-Assisted Planning0
Enhancing Robot Navigation Policies with Task-Specific Uncertainty Managements0
Choosing a Model, Shaping a Future: Comparing LLM Perspectives on Sustainability and its Relationship with AI0
The Evolution of Alpha in Finance Harnessing Human Insight and LLM Agents0
RAG/LLM Augmented Switching Driven Polymorphic Metaheuristic Framework0
Interpretable Reinforcement Learning for Load Balancing using Kolmogorov-Arnold Networks0
Hypothesis on the Functional Advantages of the Selection-Broadcast Cycle Structure: Global Workspace Theory and Dealing with a Real-Time World0
Energy-Efficient Deep Reinforcement Learning with Spiking Transformers0
Interpretable Dual-Stream Learning for Local Wind Hazard Prediction in Vulnerable Communities0
Structured Agent Distillation for Large Language Model0
BACON: A fully explainable AI model with graded logic for decision making problems0
APEX: Empowering LLMs with Physics-Based Task Planning for Real-time InsightCode0
CSAGC-IDS: A Dual-Module Deep Learning Network Intrusion Detection Model for Complex and Imbalanced Data0
Dynamic Decision-Making under Model Misspecification0
Bellman operator convergence enhancements in reinforcement learning algorithms0
When Bias Backfires: The Modulatory Role of Counterfactual Explanations on the Adoption of Algorithmic Bias in XAI-Supported Human Decision-MakingCode0
High-dimensional Nonparametric Contextual Bandit Problem0
Adversarial Testing in LLMs: Insights into Decision-Making Vulnerabilities0
SurveillanceVQA-589K: A Benchmark for Comprehensive Surveillance Video-Language Understanding with Large Models0
Low-regret Strategies for Energy Systems Planning in a Highly Uncertain Future0
Show:102550
← PrevPage 55 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified