| Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces | Mar 29, 2024 | Decision MakingMamba | CodeCode Available | 1 |
| Reinforced Sequential Decision-Making for Sepsis Treatment: The POSNEGDM Framework with Mortality Classifier and Transformer | Mar 12, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision | Mar 10, 2024 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| How Can LLM Guide RL? A Value-Based Approach | Feb 25, 2024 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 1 |
| PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control | Feb 16, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss | Feb 9, 2024 | Computational Efficiencycontinuous-control | CodeCode Available | 1 |
| Sym-Q: Adaptive Symbolic Regression via Sequential Decision-Making | Feb 7, 2024 | Decision Makingregression | CodeCode Available | 1 |
| Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills | Feb 5, 2024 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Layered and Staged Monte Carlo Tree Search for SMT Strategy Synthesis | Jan 30, 2024 | Decision MakingEfficient Exploration | CodeCode Available | 1 |
| LLF-Bench: Benchmark for Interactive Learning from Language Feedback | Dec 11, 2023 | Information RetrievalOpenAI Gym | CodeCode Available | 1 |