SOTAVerified

Decision Making

Papers

Showing 17011750 of 12311 papers

TitleStatusHype
Calibrated Decision-Making through LLM-Assisted Retrieval0
Do LLM Personas Dream of Bull Markets? Comparing Human and AI Investment Strategies Through the Lens of the Five-Factor Model0
Project MPG: towards a generalized performance benchmark for LLM capabilities0
Moral Agency in Silico: Exploring Free Will in Large Language Models0
Sabotage Evaluations for Frontier Models0
Can Machines Think Like Humans? A Behavioral Evaluation of LLM-Agents in Dictator Games0
Meta-Learning for Speeding Up Large Model Inference in Decentralized Environments0
Bayesian Regression for Predicting Subscription to Bank Term Deposits in Direct Marketing Campaigns0
Quantum Reinforcement Learning-Based Two-Stage Unit Commitment Framework for Enhanced Power Systems Robustness0
Bridging the Gap between Expert and Language Models: Concept-guided Chess Commentary Generation and Evaluation0
Efficient Bilinear Attention-based Fusion for Medical Visual Question Answering0
Towards Unifying Evaluation of Counterfactual Explanations: Leveraging Large Language Models for Human-Centric AssessmentsCode0
The green transition of firms: The role of evolutionary competition, adjustment costs, transition risk, and green technology progress0
Voting with Random Proposers: Two Rounds Suffice0
Deconfounding Time Series Forecasting0
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning0
Toward Conditional Distribution Calibration in Survival PredictionCode1
Language Models And A Second Opinion Use Case: The Pocket Professional0
Deep Reinforcement Learning Agents for Strategic Production Policies in Microeconomic Market SimulationsCode0
Predicting Mortality and Functional Status Scores of Traumatic Brain Injury Patients using Supervised Machine Learning0
Improving Decision SparsityCode0
Quantifying Risk Propensities of Large Language Models: Ethical Focus and Bias Detection through Role-Play0
KisanQRS: A Deep Learning-based Automated Query-Response System for Agricultural Decision-Making0
Beyond Fine-Tuning: Effective Strategies for Mitigating Hallucinations in Large Language Models for Data Analytics0
LLM-Consensus: Multi-Agent Debate for Visual Misinformation Detection0
Causal Abstraction in Model Interpretability: A Compact Survey0
Neural Fields in Robotics: A SurveyCode5
Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting0
Optimizing Hearthstone Agents using an Evolutionary AlgorithmCode0
Enhancing Exchange Rate Forecasting with Explainable Deep Learning Models0
Planning-Aware Diffusion Networks for Enhanced Motion Forecasting in Autonomous Driving0
Designing LLM-Agents with Personalities: A Psychometric Approach0
AgentForge: A Flexible Low-Code Platform for Reinforcement Learning Agent DesignCode0
Robust Thompson Sampling Algorithms Against Reward Poisoning Attacks0
Context-Aware Trajectory Anomaly Detection0
Zero-shot Object Navigation with Vision-Language Models Reasoning0
Learning to Look: Seeking Information for Decision Making via Policy Factorization0
Aligning CodeLLMs with Direct Preference Optimization0
From Efficiency to Equity: Measuring Fairness in Preference Learning0
Context is Key: A Benchmark for Forecasting with Essential Textual InformationCode2
Impact of uncertainties on the Stability Lobe Diagram for vibration evaluation in milling0
Predicting Company Growth by Econophysics informed Machine Learning0
Identifiable Representation and Model Learning for Latent Dynamic Systems0
Exploiting Text-Image Latent Spaces for the Description of Visual Concepts0
Lightweight Neural App Control0
Learning Versatile Skills with Curriculum MaskingCode0
Integrating Large Language Models for UAV Control in Simulated Environments: A Modular Interaction Approach0
The Hive Mind is a Single Reinforcement Learning Agent0
ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context PromptingCode1
Applying Data Driven Decision Making to rank Vocational and Educational Training Programs with TOPSIS0
Show:102550
← PrevPage 35 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified