| PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language Models | Feb 2, 2024 | Action GenerationDecision Making | CodeCode Available | 3 |
| AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents | Oct 15, 2023 | In-Context LearningIn-Context Reinforcement Learning | CodeCode Available | 1 |
| Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning | Feb 26, 2025 | In-Context Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Emergence of In-Context Reinforcement Learning from Noise Distillation | Dec 19, 2023 | In-Context Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| In-Context Reinforcement Learning for Variable Action Spaces | Dec 20, 2023 | In-Context Reinforcement LearningMulti-Armed Bandits | CodeCode Available | 1 |
| In-context Reinforcement Learning with Algorithm Distillation | Oct 25, 2022 | In-Context Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Knowing You Don't Know: Learning When to Continue Search in Multi-round RAG through Self-Practicing | May 5, 2025 | In-Context Reinforcement LearningRAG | CodeCode Available | 1 |
| ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI | Oct 3, 2024 | Few-Shot Imitation LearningImitation Learning | CodeCode Available | 1 |
| Structured State Space Models for In-Context Reinforcement Learning | Mar 7, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining | Oct 12, 2023 | In-Context Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Vintix: Action Model via In-Context Reinforcement Learning | Jan 31, 2025 | Decision MakingIn-Context Reinforcement Learning | CodeCode Available | 1 |
| Generative Pre-Trained Transformer for Symbolic Regression Base In-Context Reinforcement Learning | Apr 9, 2024 | Combinatorial OptimizationIn-Context Reinforcement Learning | —Unverified | 0 |
| Hidden in Plain Text: Emergence & Mitigation of Steganographic Collusion in LLMs | Oct 2, 2024 | In-Context Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Retrieval-Augmented Hierarchical in-Context Reinforcement Learning and Hindsight Modular Reflections for Task Planning with LLMs | Aug 12, 2024 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| Honesty to Subterfuge: In-Context Reinforcement Learning Can Make Honest Models Reward Hack | Oct 9, 2024 | In-Context Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| HVAC-DPT: A Decision Pretrained Transformer for HVAC Control | Nov 29, 2024 | In-Context Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Yes, Q-learning Helps Offline In-Context RL | Feb 24, 2025 | In-Context Reinforcement LearningMuJoCo | —Unverified | 0 |
| Supervised Pretraining Can Learn In-Context Reinforcement Learning | Jun 26, 2023 | Decision MakingIn-Context Learning | —Unverified | 0 |
| Filtering Learning Histories Enhances In-Context Reinforcement Learning | May 21, 2025 | In-Context Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| From Memories to Maps: Mechanisms of In-Context Reinforcement Learning in Transformers | Jun 24, 2025 | In-Context LearningIn-Context Reinforcement Learning | —Unverified | 0 |
| GUI-Bee: Align GUI Action Grounding to Novel Environments via Autonomous Exploration | Jan 23, 2025 | In-Context Reinforcement Learning | —Unverified | 0 |
| LLMs Are In-Context Bandit Reinforcement Learners | Oct 7, 2024 | In-Context LearningIn-Context Reinforcement Learning | —Unverified | 0 |
| OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds | Feb 5, 2025 | Few-Shot LearningImitation Learning | —Unverified | 0 |
| Transformers Learn Temporal Difference Methods for In-Context Reinforcement Learning | May 22, 2024 | In-Context LearningIn-Context Reinforcement Learning | —Unverified | 0 |
| RL + Transformer = A General-Purpose Problem Solver | Jan 24, 2025 | In-Context Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Survey of In-Context Reinforcement Learning | Feb 11, 2025 | In-Context Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Beyond Numeric Awards: In-Context Dueling Bandits with LLM Agents | Jul 2, 2024 | Decision MakingIn-Context Reinforcement Learning | —Unverified | 0 |
| Can In-Context Reinforcement Learning Recover From Reward Poisoning Attacks? | Jun 7, 2025 | In-Context Reinforcement Learning | —Unverified | 0 |
| Random Policy Enables In-Context Reinforcement Learning within Trust Horizons | Oct 25, 2024 | In-Context LearningIn-Context Reinforcement Learning | —Unverified | 0 |
| Scaling Algorithm Distillation for Continuous Control with Mamba | Jun 16, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning | Jun 13, 2024 | GPUIn-Context Learning | CodeCode Available | 0 |
| An Adaptable Budget Planner for Enhancing Budget-Constrained Auto-Bidding in Online Advertising | Jan 26, 2025 | In-Context Reinforcement LearningSequential Decision Making | CodeCode Available | 0 |
| Free Random Projection for In-Context Reinforcement Learning | Apr 9, 2025 | In-Context Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |