SOTAVerified

Decision Making

Papers

Showing 69016925 of 12311 papers

TitleStatusHype
Policy Gradient with Expected Quadratic Utility Maximization: A New Mean-Variance Approach in Reinforcement Learning0
Mean-Variance Efficient Reinforcement Learning with Applications to Dynamic Financial Investment0
Policy Gradient With Serial Markov Chain Reasoning0
Policy Gradient With Value Function Approximation For Collective Multiagent Planning0
Policy-labeled Preference Learning: Is Preference Enough for RLHF?0
Policy Learning for Domain Selection in an Extensible Multi-domain Spoken Dialogue System0
Policy Learning with a Natural Language Action Space: A Causal Approach0
Policy Learning with Asymmetric Counterfactual Utilities0
Policy Optimization Using Semi-parametric Models for Dynamic Pricing0
Policy Optimization with Model-based Explorations0
Policy Regularization for Legible Behavior0
Policy Trees for Prediction: Interpretable and Adaptive Model Selection for Machine Learning0
Polynomial Regret Concentration of UCB for Non-Deterministic State Transitions0
POMDPs in Continuous Time and Discrete Spaces0
POPPINS : A Population-Based Digital Spiking Neuromorphic Processor with Integer Quadratic Integrate-and-Fire Neurons0
Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning0
Population stratification enables modeling effects of reopening policies on mortality and hospitalization rates0
Portfolio optimization with two coherent risk measures0
Portfolio Selection via Topological Data Analysis0
Pose-based Tremor Classification for Parkinson's Disease Diagnosis from Video0
Position: Bayesian Statistics Facilitates Stakeholder Participation in Evaluation of Generative AI0
Position: Emergent Machina Sapiens Urge Rethinking Multi-Agent Paradigms0
Position: Empowering Time Series Reasoning with Multimodal LLMs0
Position Paper On Diagnostic Uncertainty Estimation from Large Language Models: Next-Word Probability Is Not Pre-test Probability0
Position Paper: Rethinking Privacy in RL for Sequential Decision-making in the Age of LLMs0
Show:102550
← PrevPage 277 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified