| Bandit Linear Optimization for Sequential Decision Making and Extensive-Form Games | Mar 8, 2021 | counterfactualDecision Making | —Unverified | 0 |
| Bandit Convex Optimization in Non-stationary Environments | Jul 29, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| MSPM: A Modularized and Scalable Multi-Agent Reinforcement Learning-based System for Financial Portfolio Management | Feb 6, 2021 | Decision MakingManagement | —Unverified | 0 |
| A Classification View on Meta Learning Bandits | Apr 6, 2025 | ClassificationMeta-Learning | —Unverified | 0 |
| Bandit based centralized matching in two-sided markets for peer to peer lending | May 6, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A modular framework for object-based saccadic decisions in dynamic scenes | Jun 10, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Adaptive Exploration in Linear Contextual Bandit | Oct 15, 2019 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| AVID: Adapting Video Diffusion Models to World Models | Oct 1, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Auxiliary Reward Generation with Transition Distance Representation Learning | Feb 12, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| A Mini Review on the utilization of Reinforcement Learning with OPC UA | May 24, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |