| Vintix: Action Model via In-Context Reinforcement Learning | Jan 31, 2025 | Decision MakingIn-Context Reinforcement Learning | CodeCode Available | 1 | 5 |
| Free Random Projection for In-Context Reinforcement Learning | Apr 9, 2025 | In-Context Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning | Jun 13, 2024 | GPUIn-Context Learning | CodeCode Available | 0 | 5 |
| An Adaptable Budget Planner for Enhancing Budget-Constrained Auto-Bidding in Online Advertising | Jan 26, 2025 | In-Context Reinforcement LearningSequential Decision Making | CodeCode Available | 0 | 5 |
| Honesty to Subterfuge: In-Context Reinforcement Learning Can Make Honest Models Reward Hack | Oct 9, 2024 | In-Context Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| HVAC-DPT: A Decision Pretrained Transformer for HVAC Control | Nov 29, 2024 | In-Context Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Yes, Q-learning Helps Offline In-Context RL | Feb 24, 2025 | In-Context Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Supervised Pretraining Can Learn In-Context Reinforcement Learning | Jun 26, 2023 | Decision MakingIn-Context Learning | —Unverified | 0 | 0 |
| LLMs Are In-Context Bandit Reinforcement Learners | Oct 7, 2024 | In-Context LearningIn-Context Reinforcement Learning | —Unverified | 0 | 0 |
| OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds | Feb 5, 2025 | Few-Shot LearningImitation Learning | —Unverified | 0 | 0 |