SOTAVerified

Decision Making

Papers

Showing 801850 of 12311 papers

TitleStatusHype
Iterative Value Function Optimization for Guided Decoding0
Generator-Assistant Stepwise Rollback Framework for Large Language Model AgentCode0
Seeing Stereotypes0
Unlocking a New Rust Programming Experience: Fast and Slow Thinking with LLMs to Conquer Undefined Behaviors0
A Game-Theoretic Approach for High-Resolution Automotive FMCW Radar Interference Avoidance0
Towards Robust Multi-UAV Collaboration: MARL with Noise-Resilient Communication and Attention MechanismsCode0
An energy-efficient learning solution for the Agile Earth Observation Satellite Scheduling Problem0
Medical Support System for Spontaneous Breathing Trial Prediction Using Nonuniform Discrete Fourier Transform0
How Do Consumers Really Choose: Exposing Hidden Preferences with the Mixture of Experts Model0
Survey Perspective: The Role of Explainable AI in Threat Intelligence0
Proportionality in Thumbs Up and Down Voting0
Synthetic Tabular Data Detection In the Wild0
Improving Retrospective Language Agents via Joint Policy Gradient Optimization0
Perceptual Motor Learning with Active Inference Framework for Robust Lateral Control0
Learning to Generate Long-term Future Narrations Describing Activities of Daily Living0
Architectural and Inferential Inductive Biases For Exchangeable Sequence ModelingCode0
PA-CLIP: Enhancing Zero-Shot Anomaly Detection through Pseudo-Anomaly Awareness0
Building Interval Type-2 Fuzzy Membership Function: A Deck of Cards based Co-constructive Approach0
Multi-Agent Reinforcement Learning with Long-Term Performance Objectives for Service Workforce Optimization0
Can AI Model the Complexities of Human Moral Decision-Making? A Qualitative Study of Kidney Allocation Decisions0
Evidence of conceptual mastery in the application of rules by Large Language Models0
On Generalization Across Environments In Multi-Objective Reinforcement LearningCode1
From Understanding the World to Intervening in It: A Unified Multi-Scale Framework for Embodied CognitionCode0
Evolution of Information in Interactive Decision Making: A Case Study for Multi-Armed Bandits0
Explainable LiDAR 3D Point Cloud Segmentation and Clustering for Detecting Airplane-Generated Wind Turbulence0
Dissecting the Impact of Model Misspecification in Data-Driven Optimization0
Shaping Laser Pulses with Reinforcement Learning0
Semi-Parametric Batched Global Multi-Armed Bandits with Covariates0
Interacting with AI Reasoning Models: Harnessing "Thoughts" for AI-Driven Software Engineering0
Shifting Power: Leveraging LLMs to Simulate Human Aversion in ABMs of Bilateral Financial Exchanges, A bond market study0
What Makes a Good Diffusion Planner for Decision Making?Code2
MedHallTune: An Instruction-Tuning Benchmark for Mitigating Medical Hallucination in Vision-Language ModelsCode0
Ro-To-Go! Robust Reactive Control with Signal Temporal Logic0
Digital Player: Evaluating Large Language Models based Human-like Agent in GamesCode2
Adaptive Reinforcement Learning for State Avoidance in Discrete Event Systems0
Llamarine: Open-source Maritime Industry-specific Large Language Model0
Personalized Causal Graph Reasoning for LLMs: A Case Study on Dietary Recommendations0
Investigating the Relationship Between Debiasing and Artifact Removal using Saliency Maps0
A Deep User Interface for Exploring LLaMa0
Scalable Decision-Making in Stochastic Environments through Learned Temporal AbstractionCode0
Advanced Deep Learning Techniques for Analyzing Earnings Call Transcripts: Methodologies and Applications0
Non-Cooperative Games with Uncertainty0
Efficient Risk-sensitive Planning via Entropic Risk Measures0
Large Language Model Strategic Reasoning Evaluation through Behavioral Game Theory0
Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application0
Minds on the Move: Decoding Trajectory Prediction in Autonomous Driving with Cognitive Insights0
CirT: Global Subseasonal-to-Seasonal Forecasting with Geometry-inspired TransformerCode1
Can a calibration metric be both testable and actionable?Code0
WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management StrategiesCode0
Program Synthesis Dialog Agents for Interactive Decision-MakingCode0
Show:102550
← PrevPage 17 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified