SOTAVerified

Decision Making

Papers

Showing 28012825 of 12311 papers

TitleStatusHype
Understanding Intrinsic Socioeconomic Biases in Large Language Models0
Towards Dialogues for Joint Human-AI Reasoning and Value Alignment0
Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous DrivingCode2
MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning0
Resisting Stochastic Risks in Diffusion Planners with the Trajectory Aggregation TreeCode0
Can Automatic Metrics Assess High-Quality Translations?0
LLM experiments with simulation: Large Language Model Multi-Agent System for Simulation Model Parametrization in Digital TwinsCode1
Can We Trust Embodied Agents? Exploring Backdoor Attacks against Embodied LLM-based Decision-Making Systems0
The Economic Implications of Large Language Model Selection on Earnings and Return on Investment: A Decision Theoretic Model0
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators0
Ontology-Enhanced Decision-Making for Autonomous Agents in Dynamic and Partially Observable Environments0
Collage is the New Writing: Exploring the Fragmentation of Text and User Interfaces in AI Tools0
LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence0
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation0
Benchmarking General-Purpose In-Context Learning0
Position: Foundation Agents as the Paradigm Shift for Decision MakingCode2
GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement LearningCode1
Rethinking Transformers in Solving POMDPsCode1
Leveraging Offline Data in Linear Latent Bandits0
Exploring and steering the moral compass of Large Language ModelsCode0
CoSLight: Co-optimizing Collaborator Selection and Decision-making to Enhance Traffic Signal ControlCode0
Make Safe Decisions in Power System: Safe Reinforcement Learning Based Pre-decision Making for Voltage Stability Emergency Control0
Augmented Risk Prediction for the Onset of Alzheimer's Disease from Electronic Health Records with Large Language Models0
Variational Offline Multi-agent Skill Discovery0
On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching RegularizationCode0
Show:102550
← PrevPage 113 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified