SOTAVerified

Sequential Decision Making

Papers

Showing 331340 of 1210 papers

TitleStatusHype
Efficient Sequential Decision Making with Large Language Models0
Data-Driven Upper Confidence Bounds with Near-Optimal Regret for Heavy-Tailed Bandits0
"Give Me an Example Like This": Episodic Active Reinforcement Learning from DemonstrationsCode0
Sound Heuristic Search Value Iteration for Undiscounted POMDPs with Reachability ObjectivesCode0
Rectifying Reinforcement Learning for Reward Matching0
Combining Experimental and Historical Data for Policy EvaluationCode0
Reward Machines for Deep RL in Noisy and Uncertain EnvironmentsCode0
Low-rank finetuning for LLMs: A fairness perspective0
Leveraging Offline Data in Linear Latent Bandits0
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators0
Show:102550
← PrevPage 34 of 121Next →

No leaderboard results yet.