SOTAVerified

Decision Making

Papers

Showing 28012850 of 12311 papers

TitleStatusHype
Low-rank finetuning for LLMs: A fairness perspective0
Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous DrivingCode2
MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning0
Towards Dialogues for Joint Human-AI Reasoning and Value Alignment0
Resisting Stochastic Risks in Diffusion Planners with the Trajectory Aggregation TreeCode0
LLM experiments with simulation: Large Language Model Multi-Agent System for Simulation Model Parametrization in Digital TwinsCode1
Can Automatic Metrics Assess High-Quality Translations?0
Can We Trust Embodied Agents? Exploring Backdoor Attacks against Embodied LLM-based Decision-Making Systems0
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators0
The Economic Implications of Large Language Model Selection on Earnings and Return on Investment: A Decision Theoretic Model0
Ontology-Enhanced Decision-Making for Autonomous Agents in Dynamic and Partially Observable Environments0
Collage is the New Writing: Exploring the Fragmentation of Text and User Interfaces in AI Tools0
LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence0
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation0
Exploring and steering the moral compass of Large Language ModelsCode0
Benchmarking General-Purpose In-Context Learning0
Leveraging Offline Data in Linear Latent Bandits0
CoSLight: Co-optimizing Collaborator Selection and Decision-making to Enhance Traffic Signal ControlCode0
GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement LearningCode1
Position: Foundation Agents as the Paradigm Shift for Decision MakingCode2
Rethinking Transformers in Solving POMDPsCode1
Make Safe Decisions in Power System: Safe Reinforcement Learning Based Pre-decision Making for Voltage Stability Emergency Control0
On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching RegularizationCode0
Augmented Risk Prediction for the Onset of Alzheimer's Disease from Electronic Health Records with Large Language Models0
AnyCBMs: How to Turn Any Black Box into a Concept Bottleneck Model0
Variational Offline Multi-agent Skill Discovery0
On Bits and Bandits: Quantifying the Regret-Information Trade-offCode0
An Evolutionary Framework for Connect-4 as Test-Bed for Comparison of Advanced Minimax, Q-Learning and MCTS0
SED: Self-Evaluation Decoding Enhances Large Language Models for Better Generation0
Detection of decision-making manipulation in the pairwise comparisons method0
Improving Health Professionals' Onboarding with AI and XAI for Trustworthy Human-AI Collaborative Decision Making0
Causal Concept Graph Models: Beyond Causal Opacity in Deep LearningCode0
A Prudent Framework for Understanding Risk-Awareness in Demand Response0
Acquiring Better Load Estimates by Combining Anomaly and Change Point Detection in Power Grid Time-series MeasurementsCode0
Front-propagation Algorithm: Explainable AI Technique for Extracting Linear Function Approximations from Neural Networks0
Federated Learning for Non-factorizable Models using Deep Generative Prior ApproximationsCode0
STRIDE: A Tool-Assisted LLM Agent Framework for Strategic and Interactive Decision-MakingCode1
Generation of synthetic data using breast cancer dataset and classification with resnet180
Quantifying the Cross-sectoral Intersecting Discrepancies within Multiple Groups Using Latent Class Analysis Towards Fairness0
Serving economic prosperity: economic impact assessments (EIA) on Earth observation-based services and tools by SERVIR0
Inference of Utilities and Time Preference in Sequential Decision-Making0
Concept-based Explainable Malignancy Scoring on Pulmonary Nodules in CT Images0
Federated Offline Policy Optimization with Dual Regularization0
A Neurosymbolic Framework for Bias Correction in Convolutional Neural Networks0
Federated Behavioural Planes: Explaining the Evolution of Client Behaviour in Federated LearningCode0
A Trajectory-Based Bayesian Approach to Multi-Objective Hyperparameter Optimization with Epoch-Aware Trade-Offs0
iVideoGPT: Interactive VideoGPTs are Scalable World ModelsCode2
Diffusion Actor-Critic with Entropy RegulatorCode2
Learning Invariant Causal Mechanism from Vision-Language Models0
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning RateCode1
Show:102550
← PrevPage 57 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified