SOTAVerified

In-Context Reinforcement Learning

Papers

Showing 125 of 33 papers

TitleStatusHype
PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language ModelsCode3
Knowing You Don't Know: Learning When to Continue Search in Multi-round RAG through Self-PracticingCode1
Distilling Reinforcement Learning Algorithms for In-Context Model-Based PlanningCode1
Vintix: Action Model via In-Context Reinforcement LearningCode1
ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AICode1
In-Context Reinforcement Learning for Variable Action SpacesCode1
Emergence of In-Context Reinforcement Learning from Noise DistillationCode1
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive AgentsCode1
Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised PretrainingCode1
Structured State Space Models for In-Context Reinforcement LearningCode1
In-context Reinforcement Learning with Algorithm DistillationCode1
From Memories to Maps: Mechanisms of In-Context Reinforcement Learning in Transformers0
Scaling Algorithm Distillation for Continuous Control with Mamba0
Can In-Context Reinforcement Learning Recover From Reward Poisoning Attacks?0
Filtering Learning Histories Enhances In-Context Reinforcement Learning0
Free Random Projection for In-Context Reinforcement LearningCode0
Yes, Q-learning Helps Offline In-Context RL0
A Survey of In-Context Reinforcement Learning0
OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds0
An Adaptable Budget Planner for Enhancing Budget-Constrained Auto-Bidding in Online AdvertisingCode0
RL + Transformer = A General-Purpose Problem Solver0
GUI-Bee: Align GUI Action Grounding to Novel Environments via Autonomous Exploration0
HVAC-DPT: A Decision Pretrained Transformer for HVAC Control0
Random Policy Enables In-Context Reinforcement Learning within Trust Horizons0
Honesty to Subterfuge: In-Context Reinforcement Learning Can Make Honest Models Reward Hack0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.