SOTAVerified

Sequential Decision Making

Papers

Showing 125 of 1210 papers

TitleStatusHype
Multi-Agent Reinforcement Learning for Autonomous Driving: A SurveyCode5
Eureka: Human-Level Reward Design via Coding Large Language ModelsCode4
Reflexion: Language Agents with Verbal Reinforcement LearningCode4
MineStudio: A Streamlined Package for Minecraft AI Agent DevelopmentCode3
Reinforcement Learning Meets Visual OdometryCode3
MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency TradingCode2
STEVE-1: A Generative Model for Text-to-Behavior in MinecraftCode2
Pre-Trained Language Models for Interactive Decision-MakingCode2
Trieste: Efficiently Exploring The Depths of Black-box Functions with TensorFlowCode2
Multi-Agent Reinforcement Learning is a Sequence Modeling ProblemCode2
Web-Shepherd: Advancing PRMs for Reinforcing Web AgentsCode2
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-DependencyCode2
Dungeons and Data: A Large-Scale NetHack DatasetCode2
Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future DirectionsCode2
Jack of All Trades, Master of Some, a Multi-Purpose Transformer AgentCode2
AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPOCode2
Decision Stacks: Flexible Reinforcement Learning via Modular Generative ModelsCode1
Deep Reinforcement Learning based Recommendation with Explicit User-Item Interactions ModelingCode1
DataEnvGym: Data Generation Agents in Teacher Environments with Student FeedbackCode1
Curriculum-based Reinforcement Learning for Distribution System Critical Load RestorationCode1
Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State SpacesCode1
Comparing Deep Reinforcement Learning Algorithms in Two-Echelon Supply ChainsCode1
CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in CoqCode1
Counterfactual Explanations in Sequential Decision Making Under UncertaintyCode1
Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI GymCode1
Show:102550
← PrevPage 1 of 49Next →

No leaderboard results yet.