SOTAVerified

Sequential Decision Making

Papers

Showing 76100 of 1210 papers

TitleStatusHype
Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control ProblemCode1
Mixed Policy Gradient: off-policy reinforcement learning driven jointly by data and modelCode1
An empirical evaluation of active inference in multi-armed banditsCode1
TimeSHAP: Explaining Recurrent Models through Sequence PerturbationsCode1
Adaptive Stress Testing of Trajectory Predictions in Flight Management SystemsCode1
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and BaselinesCode1
Multi-task Causal Learning with Gaussian ProcessesCode1
CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in CoqCode1
Occupancy Anticipation for Efficient Exploration and NavigationCode1
Efficient Nonmyopic Bayesian Optimization via One-Shot Multi-Step TreesCode1
Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal ConstraintsCode1
Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RLCode1
Can Increasing Input Dimensionality Improve Deep Reinforcement Learning?Code1
Learning Dynamic Belief Graphs to Generalize on Text-Based GamesCode1
PDDLGym: Gym Environments from PDDL ProblemsCode1
Effective Reinforcement Learning through Evolutionary Surrogate-Assisted PrescriptionCode1
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision MakingCode1
Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value FunctionsCode1
Reinforcement Learning for Temporal Logic Control Synthesis with Probabilistic Satisfaction GuaranteesCode1
Deep Reinforcement Learning based Recommendation with Explicit User-Item Interactions ModelingCode1
Learning Multi-Level Hierarchies with HindsightCode1
SkipNet: Learning Dynamic Routing in Convolutional NetworksCode1
Thinking Fast and Slow with Deep Learning and Tree SearchCode1
An Alternative Softmax Operator for Reinforcement LearningCode1
AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air0
Show:102550
← PrevPage 4 of 49Next →

No leaderboard results yet.