SOTAVerified

Sequential Decision Making

Papers

Showing 326350 of 1210 papers

TitleStatusHype
Catastrophic-risk-aware reinforcement learning with extreme-value-theory-based policy gradientsCode0
ARDuP: Active Region Video Diffusion for Universal Policies0
Learned Graph Rewriting with Equality Saturation: A New Paradigm in Relational Query Rewrite and Beyond0
Model Adaptation for Time Constrained Embodied Control0
Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms0
Efficient Sequential Decision Making with Large Language Models0
Data-Driven Upper Confidence Bounds with Near-Optimal Regret for Heavy-Tailed Bandits0
Sound Heuristic Search Value Iteration for Undiscounted POMDPs with Reachability ObjectivesCode0
"Give Me an Example Like This": Episodic Active Reinforcement Learning from DemonstrationsCode0
Rectifying Reinforcement Learning for Reward Matching0
Combining Experimental and Historical Data for Policy EvaluationCode0
Reward Machines for Deep RL in Noisy and Uncertain EnvironmentsCode0
Low-rank finetuning for LLMs: A fairness perspective0
Leveraging Offline Data in Linear Latent Bandits0
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators0
Variational Offline Multi-agent Skill Discovery0
Inference of Utilities and Time Preference in Sequential Decision-Making0
Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning0
A finite time analysis of distributed Q-learning0
Efficiently Training Deep-Learning Parametric Policies using Lagrangian Duality0
Understanding the Training and Generalization of Pretrained Transformer for Sequential Decision Making0
Reinforcing Language Agents via Policy Optimization with Action Decomposition0
On the Brittle Foundations of ReAct Prompting for Agentic Large Language Models0
FLIPHAT: Joint Differential Privacy for High Dimensional Sparse Linear BanditsCode0
Optimistic Query Routing in Clustering-based Approximate Maximum Inner Product SearchCode0
Show:102550
← PrevPage 14 of 49Next →

No leaderboard results yet.