SOTAVerified

Sequential Decision Making

Papers

Showing 125 of 1210 papers

TitleStatusHype
Multi-Agent Reinforcement Learning for Autonomous Driving: A SurveyCode5
Eureka: Human-Level Reward Design via Coding Large Language ModelsCode4
Reflexion: Language Agents with Verbal Reinforcement LearningCode4
MineStudio: A Streamlined Package for Minecraft AI Agent DevelopmentCode3
Reinforcement Learning Meets Visual OdometryCode3
Web-Shepherd: Advancing PRMs for Reinforcing Web AgentsCode2
AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPOCode2
MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency TradingCode2
Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future DirectionsCode2
Jack of All Trades, Master of Some, a Multi-Purpose Transformer AgentCode2
STEVE-1: A Generative Model for Text-to-Behavior in MinecraftCode2
Trieste: Efficiently Exploring The Depths of Black-box Functions with TensorFlowCode2
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-DependencyCode2
Dungeons and Data: A Large-Scale NetHack DatasetCode2
Multi-Agent Reinforcement Learning is a Sequence Modeling ProblemCode2
Pre-Trained Language Models for Interactive Decision-MakingCode2
Large Language Models for Planning: A Comprehensive and Systematic SurveyCode1
LLINBO: Trustworthy LLM-in-the-Loop Bayesian OptimizationCode1
Comparing Exploration-Exploitation Strategies of LLMs and Humans: Insights from Standard Multi-armed Bandit TasksCode1
On Generalization Across Environments In Multi-Objective Reinforcement LearningCode1
Reinforcement learning with combinatorial actions for coupled restless banditsCode1
Training a Generally Curious AgentCode1
Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning PoliciesCode1
LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban SimulationCode1
DataEnvGym: Data Generation Agents in Teacher Environments with Student FeedbackCode1
Show:102550
← PrevPage 1 of 49Next →

No leaderboard results yet.