SOTAVerified

Decision Making

Papers

Showing 69016950 of 12311 papers

TitleStatusHype
Observation Adaptation via Annealed Importance Resampling for Partially Observable Markov Decision Processes0
Observation-Augmented Contextual Multi-Armed Bandits for Robotic Search and Exploration0
Occupancy Grids: A Stochastic Spatial Representation for Active Robot Perception0
OCMDP: Observation-Constrained Markov Decision Process0
OCR is All you need: Importing Multi-Modality into Image-based Defect Detection System0
OFA^2: A Multi-Objective Perspective for the Once-for-All Neural Architecture Search0
Off-line approximate dynamic programming for the vehicle routing problem with a highly variable customer basis and stochastic demands0
Offline Hierarchical Reinforcement Learning via Inverse Optimization0
Offline Imitation Learning with a Misspecified Simulator0
Offline Imitation of Badminton Player Behavior via Experiential Contexts and Brownian Motion0
Offline Inverse Constrained Reinforcement Learning for Safe-Critical Decision Making in Healthcare0
Offline Learning for Combinatorial Multi-armed Bandits0
Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information0
Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation0
Offline Reinforcement Learning Hands-On0
Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient0
Offline Risk-sensitive RL with Partial Observability to Enhance Performance in Human-Robot Teaming0
Off-Policy Evaluation and Counterfactual Methods in Dynamic Auction Environments0
Off-policy Evaluation for Payments at Adyen0
Off-Policy Evaluation for Sequential Persuasion Process with Unobserved Confounding0
Off-Policy Evaluation with Policy-Dependent Optimization Response0
Off-Policy Interval Estimation with Lipschitz Value Iteration0
Of Mice and Machines: A Comparison of Learning Between Real World Mice and RL Agents0
Of Quantiles and Expectiles: Consistent Scoring Functions, Choquet Representations, and Forecast Rankings0
OMBA: User-Guided Product Representations for Online Market Basket Analysis0
OMENN: One Matrix to Explain Neural Networks0
OMGPT: A Sequence Modeling Framework for Data-driven Operational Decision Making0
Omnidirectional Information Gathering for Knowledge Transfer-based Audio-Visual Navigation0
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning0
On adaptivity and minimax optimality of two-sided nearest neighbors0
On Adversarial Examples and Stealth Attacks in Artificial Intelligence Systems0
On a gap between rational annuitization price for producer and price for customer0
On Algorithmic Decision Procedures in Emergency Response Systems in Smart and Connected Communities0
On a notion of independence proposed by Teddy Seidenfeld0
On anthropomorphic decision making in a model observer0
On Applications of Bootstrap in Continuous Space Reinforcement Learning0
On Appropriate Selection of Fuzzy Aggregation Operators in Medical Decision Support System0
On Attribution of Recurrent Neural Network Predictions via Additive Decomposition0
On Avoiding Power-Seeking by Artificial Intelligence0
On Bellman's Optimality Principle for zs-POSGs0
On Blame Attribution for Accountable Multi-Agent Sequential Decision Making0
Onboard Optimization and Learning: A Survey0
On Calibration in Multi-Distribution Learning0
On Calibration of Speech Classification Models: Insights from Energy-Based Model Investigations0
On Causally Disentangled State Representation Learning for Reinforcement Learning based Recommender Systems0
Onchain Sports Betting using UBET Automated Market Maker0
On Combining Machine Learning with Decision Making0
On Composable and Parametric Uncertainty in Systems Co-Design0
On Computation and Generalization of Generative Adversarial Imitation Learning0
On Consequentialism and Fairness0
Show:102550
← PrevPage 139 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified