| Vintix: Action Model via In-Context Reinforcement Learning | Jan 31, 2025 | Decision MakingIn-Context Reinforcement Learning | CodeCode Available | 1 |
| An Adaptable Budget Planner for Enhancing Budget-Constrained Auto-Bidding in Online Advertising | Jan 26, 2025 | In-Context Reinforcement LearningSequential Decision Making | CodeCode Available | 0 |
| RL + Transformer = A General-Purpose Problem Solver | Jan 24, 2025 | In-Context Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| GUI-Bee: Align GUI Action Grounding to Novel Environments via Autonomous Exploration | Jan 23, 2025 | In-Context Reinforcement Learning | —Unverified | 0 |
| HVAC-DPT: A Decision Pretrained Transformer for HVAC Control | Nov 29, 2024 | In-Context Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Random Policy Enables In-Context Reinforcement Learning within Trust Horizons | Oct 25, 2024 | In-Context LearningIn-Context Reinforcement Learning | —Unverified | 0 |
| Honesty to Subterfuge: In-Context Reinforcement Learning Can Make Honest Models Reward Hack | Oct 9, 2024 | In-Context Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| LLMs Are In-Context Bandit Reinforcement Learners | Oct 7, 2024 | In-Context LearningIn-Context Reinforcement Learning | —Unverified | 0 |
| ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI | Oct 3, 2024 | Few-Shot Imitation LearningImitation Learning | CodeCode Available | 1 |
| Hidden in Plain Text: Emergence & Mitigation of Steganographic Collusion in LLMs | Oct 2, 2024 | In-Context Reinforcement Learningreinforcement-learning | —Unverified | 0 |