SOTAVerified

Decision Making

Papers

Showing 69016950 of 12311 papers

TitleStatusHype
Policy Gradient with Expected Quadratic Utility Maximization: A New Mean-Variance Approach in Reinforcement Learning0
Mean-Variance Efficient Reinforcement Learning with Applications to Dynamic Financial Investment0
Policy Gradient With Serial Markov Chain Reasoning0
Policy Gradient With Value Function Approximation For Collective Multiagent Planning0
Policy-labeled Preference Learning: Is Preference Enough for RLHF?0
Policy Learning for Domain Selection in an Extensible Multi-domain Spoken Dialogue System0
Policy Learning with a Natural Language Action Space: A Causal Approach0
Policy Learning with Asymmetric Counterfactual Utilities0
Policy Optimization Using Semi-parametric Models for Dynamic Pricing0
Policy Optimization with Model-based Explorations0
Policy Regularization for Legible Behavior0
Policy Trees for Prediction: Interpretable and Adaptive Model Selection for Machine Learning0
Polynomial Regret Concentration of UCB for Non-Deterministic State Transitions0
POMDPs in Continuous Time and Discrete Spaces0
POPPINS : A Population-Based Digital Spiking Neuromorphic Processor with Integer Quadratic Integrate-and-Fire Neurons0
Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning0
Population stratification enables modeling effects of reopening policies on mortality and hospitalization rates0
Portfolio optimization with two coherent risk measures0
Portfolio Selection via Topological Data Analysis0
Pose-based Tremor Classification for Parkinson's Disease Diagnosis from Video0
Position: Bayesian Statistics Facilitates Stakeholder Participation in Evaluation of Generative AI0
Position: Emergent Machina Sapiens Urge Rethinking Multi-Agent Paradigms0
Position: Empowering Time Series Reasoning with Multimodal LLMs0
Position Paper On Diagnostic Uncertainty Estimation from Large Language Models: Next-Word Probability Is Not Pre-test Probability0
Position Paper: Rethinking Privacy in RL for Sequential Decision-making in the Age of LLMs0
Possibility neutrosophic soft sets with applications in decision making and similarity measure0
Post-COVID Inflation & the Monetary Policy Dilemma: An Agent-Based Scenario Analysis0
A Posteriori Probabilistic Bounds of Convex Scenario Programs with Validation Tests0
Posterior Sampling via Autoregressive Generation0
Posterior sampling with CNN-based, Plug-and-Play regularization with applications to Post-Stack Seismic Inversion0
Post-hoc Calibration of Neural Networks by g-Layers0
Post-Hoc Explainability of BI-RADS Descriptors in a Multi-task Framework for Breast Cancer Detection and Segmentation0
Post-hoc Interpretability Illumination for Scientific Interaction Discovery0
Posthoc Interpretability of Learning to Rank Models using Secondary Training Data0
Post-hoc loss-calibration for Bayesian neural networks0
Post-Radiotherapy PET Image Outcome Prediction by Deep Learning under Biological Model Guidance: A Feasibility Study of Oropharyngeal Cancer Application0
Potential-based Credit Assignment for Cooperative RL-based Testing of Autonomous Vehicles0
Potential Game-Based Decision-Making for Autonomous Driving0
Power and accountability in reinforcement learning applications to environmental policy0
Power and Accountability in RL-driven Environmental Policy0
Powerful A/B-Testing Metrics and Where to Find Them0
Power grid operational risk assessment using graph neural network surrogates0
Power-law Scaling to Assist with Key Challenges in Artificial Intelligence0
Power System Decarbonization: Impacts of Energy Storage Duration and Interannual Renewables Variability0
Power System Fault Diagnosis with Quantum Computing and Efficient Gate Decomposition0
Power to the teens? A model of parents' and teens' collective labor supply0
PowRL: A Reinforcement Learning Framework for Robust Management of Power Networks0
Pyramid Pixel Context Adaption Network for Medical Image Classification with Supervised Contrastive Learning0
Practical Algorithms for Multi-Stage Voting Rules with Parallel Universes Tiebreaking0
Practical Algorithms for STV and Ranked Pairs with Parallel Universes Tiebreaking0
Show:102550
← PrevPage 139 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified