| RL + Transformer = A General-Purpose Problem Solver | Jan 24, 2025 | In-Context Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| GUI-Bee: Align GUI Action Grounding to Novel Environments via Autonomous Exploration | Jan 23, 2025 | In-Context Reinforcement Learning | —Unverified | 0 |
| HVAC-DPT: A Decision Pretrained Transformer for HVAC Control | Nov 29, 2024 | In-Context Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Random Policy Enables In-Context Reinforcement Learning within Trust Horizons | Oct 25, 2024 | In-Context LearningIn-Context Reinforcement Learning | —Unverified | 0 |
| Honesty to Subterfuge: In-Context Reinforcement Learning Can Make Honest Models Reward Hack | Oct 9, 2024 | In-Context Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| LLMs Are In-Context Bandit Reinforcement Learners | Oct 7, 2024 | In-Context LearningIn-Context Reinforcement Learning | —Unverified | 0 |
| Hidden in Plain Text: Emergence & Mitigation of Steganographic Collusion in LLMs | Oct 2, 2024 | In-Context Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Retrieval-Augmented Hierarchical in-Context Reinforcement Learning and Hindsight Modular Reflections for Task Planning with LLMs | Aug 12, 2024 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| Beyond Numeric Awards: In-Context Dueling Bandits with LLM Agents | Jul 2, 2024 | Decision MakingIn-Context Reinforcement Learning | —Unverified | 0 |
| XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning | Jun 13, 2024 | GPUIn-Context Learning | CodeCode Available | 0 |