SOTAVerified

Decision Making

Papers

Showing 12011225 of 12311 papers

TitleStatusHype
LoRA Users Beware: A Few Spurious Tokens Can Manipulate Your Finetuned ModelCode0
Box-Constrained Softmax Function and Its Application for Post-Hoc CalibrationCode0
Do Language Models Have Bayesian Brains? Distinguishing Stochastic and Deterministic Decision Patterns within Large Language Models0
Reasoning RAG via System 1 or System 2: A Survey on Reasoning Agentic Retrieval-Augmented Generation for Industry ChallengesCode0
The Gittins Index: A Design Principle for Decision-Making Under Uncertainty0
Agentic Semantic Control for Autonomous Wireless Space Networks: Extending Space-O-RAN with MCP-Driven Distributed Intelligence0
Towards Responsible AI: Advances in Safety, Fairness, and Accountability of Autonomous Systems0
Alzheimer's Dementia Detection Using Perplexity from Paired Large Language Models0
Flipping Against All Odds: Reducing LLM Coin Flip Bias via Verbalized Rejection Sampling0
Beyond Nash Equilibrium: Bounded Rationality of LLMs and humans in Strategic Decision-making0
The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability0
FinHEAR: Human Expertise and Adaptive Risk-Aware Temporal Reasoning for Financial Decision-Making0
Did I Faithfully Say What I Thought? Bridging the Gap Between Neural Activity and Self-Explanations in Large Language Models0
Spiking Neural Models for Decision-Making Tasks with LearningCode0
Real-Time Cascade Mitigation in Power Systems Using Influence Graph Improved by Reinforcement Learning0
Bayesian Inverse Physics for Neuro-Symbolic Robot Learning0
How to Provably Improve Return Conditioned Supervised Learning?0
Unlocking the Potential of Large Language Models in the Nuclear Industry with Synthetic Data0
Understanding Software Engineering Agents Through the Lens of Traceability: An Empirical Study0
Measuring Data Science Automation: A Survey of Evaluation Tools for AI Assistants and Agents0
HGFormer: A Hierarchical Graph Transformer Framework for Two-Stage Colonel Blotto Games via Reinforcement Learning0
LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement0
Improving Fairness of Large Language Models in Multi-document SummarizationCode0
A Unified Anti-Jamming Design in Complex Environments Based on Cross-Modal Fusion and Intelligent Decision-Making0
REMoH: A Reflective Evolution of Multi-objective Heuristics approach via Large Language Models0
Show:102550
← PrevPage 49 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified