| Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach | Jun 6, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| Learning Embeddings for Sequential Tasks Using Population of Agents | Jun 5, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Data-Driven Online Model Selection With Regret Guarantees | Jun 5, 2023 | Decision Makingmodel | —Unverified | 0 |
| Extracting Reward Functions from Diffusion Models | Jun 1, 2023 | Decision MakingImage Generation | CodeCode Available | 1 |
| STEVE-1: A Generative Model for Text-to-Behavior in Minecraft | Jun 1, 2023 | Decision MakingImage Generation | CodeCode Available | 2 |
| Modeling Adversarial Attack on Pre-trained Language Models as Sequential Decision Making | May 27, 2023 | Adversarial AttackDecision Making | CodeCode Available | 0 |
| AdaPlanner: Adaptive Planning from Feedback with Language Models | May 26, 2023 | Decision MakingHallucination | CodeCode Available | 1 |
| Stability-penalty-adaptive follow-the-regularized-leader: Sparsity, game-dependency, and best-of-both-worlds | May 26, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Adversarial Attacks on Online Learning to Rank with Click Feedback | May 26, 2023 | Decision MakingLearning-To-Rank | —Unverified | 0 |
| Self-Supervised Reinforcement Learning that Transfers using Random Features | May 26, 2023 | Decision MakingModel Predictive Control | —Unverified | 0 |