SOTAVerified

Sequential Decision Making

Papers

Showing 150 of 1210 papers

TitleStatusHype
Multi-Agent Reinforcement Learning for Autonomous Driving: A SurveyCode5
Reflexion: Language Agents with Verbal Reinforcement LearningCode4
Eureka: Human-Level Reward Design via Coding Large Language ModelsCode4
Reinforcement Learning Meets Visual OdometryCode3
MineStudio: A Streamlined Package for Minecraft AI Agent DevelopmentCode3
Trieste: Efficiently Exploring The Depths of Black-box Functions with TensorFlowCode2
Jack of All Trades, Master of Some, a Multi-Purpose Transformer AgentCode2
STEVE-1: A Generative Model for Text-to-Behavior in MinecraftCode2
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-DependencyCode2
Web-Shepherd: Advancing PRMs for Reinforcing Web AgentsCode2
Dungeons and Data: A Large-Scale NetHack DatasetCode2
Multi-Agent Reinforcement Learning is a Sequence Modeling ProblemCode2
AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPOCode2
MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency TradingCode2
Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future DirectionsCode2
Pre-Trained Language Models for Interactive Decision-MakingCode2
Learning Dynamic Belief Graphs to Generalize on Text-Based GamesCode1
Large Language Models for Planning: A Comprehensive and Systematic SurveyCode1
Learning Multi-Level Hierarchies with HindsightCode1
IQ-Learn: Inverse soft-Q Learning for ImitationCode1
Adaptive Stress Testing of Trajectory Predictions in Flight Management SystemsCode1
How Can LLM Guide RL? A Value-Based ApproachCode1
Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy OptimizationCode1
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised LearningCode1
Hybrid Multi-agent Deep Reinforcement Learning for Autonomous Mobility on Demand SystemsCode1
Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control ProblemCode1
LLF-Bench: Benchmark for Interactive Learning from Language FeedbackCode1
Large Language Model as a Policy Teacher for Training Reinforcement Learning AgentsCode1
Layered and Staged Monte Carlo Tree Search for SMT Strategy SynthesisCode1
Learning Discrete World Models for Heuristic SearchCode1
Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal ConstraintsCode1
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision MakingCode1
Effective Reinforcement Learning through Evolutionary Surrogate-Assisted PrescriptionCode1
Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State SpacesCode1
DataEnvGym: Data Generation Agents in Teacher Environments with Student FeedbackCode1
Deep Reinforcement Learning based Recommendation with Explicit User-Item Interactions ModelingCode1
Efficient Nonmyopic Bayesian Optimization via One-Shot Multi-Step TreesCode1
CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in CoqCode1
Counterfactual Explanations in Sequential Decision Making Under UncertaintyCode1
Curriculum-based Reinforcement Learning for Distribution System Critical Load RestorationCode1
AdaPlanner: Adaptive Planning from Feedback with Language ModelsCode1
Decision Stacks: Flexible Reinforcement Learning via Modular Generative ModelsCode1
Comparing Deep Reinforcement Learning Algorithms in Two-Echelon Supply ChainsCode1
Deep Reinforcement Learning for Entity AlignmentCode1
Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning PoliciesCode1
Dynamic Causal Bayesian OptimizationCode1
Can Increasing Input Dimensionality Improve Deep Reinforcement Learning?Code1
Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning ApproachCode1
Extracting Reward Functions from Diffusion ModelsCode1
Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI GymCode1
Show:102550
← PrevPage 1 of 25Next →

No leaderboard results yet.