| PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language Models | Feb 2, 2024 | Action GenerationDecision Making | CodeCode Available | 3 |
| Knowing You Don't Know: Learning When to Continue Search in Multi-round RAG through Self-Practicing | May 5, 2025 | In-Context Reinforcement LearningRAG | CodeCode Available | 1 |
| Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning | Feb 26, 2025 | In-Context Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Vintix: Action Model via In-Context Reinforcement Learning | Jan 31, 2025 | Decision MakingIn-Context Reinforcement Learning | CodeCode Available | 1 |
| ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI | Oct 3, 2024 | Few-Shot Imitation LearningImitation Learning | CodeCode Available | 1 |
| In-Context Reinforcement Learning for Variable Action Spaces | Dec 20, 2023 | In-Context Reinforcement LearningMulti-Armed Bandits | CodeCode Available | 1 |
| Emergence of In-Context Reinforcement Learning from Noise Distillation | Dec 19, 2023 | In-Context Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents | Oct 15, 2023 | In-Context LearningIn-Context Reinforcement Learning | CodeCode Available | 1 |
| Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining | Oct 12, 2023 | In-Context Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Structured State Space Models for In-Context Reinforcement Learning | Mar 7, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 |