| Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting Diversity | Nov 7, 2024 | DiversityMeta Reinforcement Learning | CodeCode Available | 0 |
| FALCON: Feedback-driven Adaptive Long/short-term memory reinforced Coding Optimization system | Oct 28, 2024 | Code GenerationHumanEval | CodeCode Available | 0 |
| Stable Hadamard Memory: Revitalizing Memory-Augmented Agents for Reinforcement Learning | Oct 14, 2024 | Decision MakingManagement | CodeCode Available | 1 |
| Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator | Oct 13, 2024 | AllBilevel Optimization | —Unverified | 0 |
| Meta Reinforcement Learning Approach for Adaptive Resource Optimization in O-RAN | Sep 30, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Solving Truly Massive Budgeted Monotonic POMDPs with Oracle-Guided Meta-Reinforcement Learning | Aug 13, 2024 | Meta Reinforcement Learning | —Unverified | 0 |
| Importance Sampling-Guided Meta-Training for Intelligent Agents in Highly Interactive Environments | Jul 22, 2024 | Meta Reinforcement LearningNavigate | —Unverified | 0 |
| Constrained Meta Agnostic Reinforcement Learning | Jun 20, 2024 | Meta-LearningMeta Reinforcement Learning | —Unverified | 0 |
| Memory Sequence Length of Data Sampling Impacts the Adaptation of Meta-Reinforcement Learning Agents | Jun 18, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning | Jun 7, 2024 | Contrastive LearningMeta Reinforcement Learning | CodeCode Available | 1 |
| Test-Time Regret Minimization in Meta Reinforcement Learning | Jun 4, 2024 | Meta Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A CMDP-within-online framework for Meta-Safe Reinforcement Learning | May 26, 2024 | Meta-LearningMeta Reinforcement Learning | —Unverified | 0 |
| Theoretical Analysis of Meta Reinforcement Learning: Generalization Bounds and Convergence Guarantees | May 22, 2024 | Generalization BoundsMeta Reinforcement Learning | —Unverified | 0 |
| Scrutinize What We Ignore: Reining In Task Representation Shift Of Context-Based Offline Meta Reinforcement Learning | May 20, 2024 | Meta-LearningMeta Reinforcement Learning | CodeCode Available | 0 |
| On the Performance of Unmanned Aerial Vehicles with MIMO VLC | May 18, 2024 | Meta-LearningMeta Reinforcement Learning | —Unverified | 0 |
| Meta Reinforcement Learning for Resource Allocation in Multi-Antenna UAV Network with Rate Splitting Multiple Access | May 18, 2024 | Meta-LearningMeta Reinforcement Learning | —Unverified | 0 |
| Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity | Apr 10, 2024 | Decision MakingMeta Reinforcement Learning | CodeCode Available | 0 |
| MAMBA: an Effective World Model Approach for Meta-Reinforcement Learning | Mar 14, 2024 | Efficient ExplorationMamba | CodeCode Available | 1 |
| Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation | Mar 12, 2024 | Contrastive LearningData Augmentation | CodeCode Available | 0 |
| SplAgger: Split Aggregation for Meta-Reinforcement Learning | Mar 5, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning | Feb 25, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Hierarchical Transformers are Efficient Meta-Reinforcement Learners | Feb 9, 2024 | Meta Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Analysing the Sample Complexity of Opponent Shaping | Feb 8, 2024 | Meta Reinforcement Learning | —Unverified | 0 |
| In-context learning agents are asymmetric belief updaters | Feb 6, 2024 | counterfactualIn-Context Learning | —Unverified | 0 |
| Learning to Abstract Visuomotor Mappings using Meta-Reinforcement Learning | Feb 5, 2024 | Meta Reinforcement Learningreinforcement-learning | —Unverified | 0 |