| Variational Planning for Graph-based MDPs | Dec 1, 2013 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Verifiable RNN-Based Policies for POMDPs Under Temporal Logic Constraints | Feb 13, 2020 | Decision MakingDiagnostic | —Unverified | 0 | 0 |
| Vid2World: Crafting Video Diffusion Models to Interactive World Models | May 20, 2025 | Robot ManipulationSequential Decision Making | —Unverified | 0 | 0 |
| Video Summarisation by Classification with Deep Reinforcement Learning | Jul 9, 2018 | ClassificationDecision Making | —Unverified | 0 | 0 |
| VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making | Mar 19, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Visual Comfort Aware-Reinforcement Learning for Depth Adjustment of Stereoscopic 3D Images | Apr 14, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Visual Encoders for Data-Efficient Imitation Learning in Modern Video Games | Dec 4, 2023 | Atari GamesDecision Making | —Unverified | 0 | 0 |
| Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark | Sep 13, 2024 | Sequential Decision MakingWorld Knowledge | —Unverified | 0 | 0 |
| VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation | Feb 4, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement Learning | Feb 11, 2025 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Weakly-supervised Multi-output Regression via Correlated Gaussian Processes | Feb 19, 2020 | Decision MakingGaussian Processes | —Unverified | 0 | 0 |
| Welfare Maximization Algorithm for Solving Budget-Constrained Multi-Component POMDPs | Mar 18, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| What are the Statistical Limits of Offline RL with Linear Function Approximation? | Oct 22, 2020 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| When is Particle Filtering Efficient for Planning in Partially Observed Linear Dynamical Systems? | Jun 10, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| When to Call Your Neighbor? Strategic Communication in Cooperative Stochastic Bandits | Oct 8, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Winning the CityLearn Challenge: Adaptive Optimization with Evolutionary Search under Trajectory-based Guidance | Dec 4, 2022 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Working Memory Graphs | Nov 17, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations | Apr 10, 2022 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| You Can Trade Your Experience in Distributed Multi-Agent Multi-Armed Bandits | Jun 19, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| A Trainable Approach to Zero-delay Smoothing Spline Interpolation | Mar 7, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Zero-Shot Action Generalization with Limited Observations | Mar 11, 2025 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Statistical Inference with M-Estimators on Adaptively Collected Data | Apr 29, 2021 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Few-shot Scooping Under Domain Shift via Simulated Maximal Deployment Gaps | Aug 6, 2024 | Bayesian OptimizationMeta-Learning | —Unverified | 0 | 0 |
| Adversarial Agents: Black-Box Evasion Attacks with Reinforcement Learning | Mar 3, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| Deep Symbolic Optimization: Reinforcement Learning for Symbolic Mathematics | May 16, 2025 | Equation Discoveryreinforcement-learning | —Unverified | 0 | 0 |
| How to Provably Improve Return Conditioned Supervised Learning? | Jun 10, 2025 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes | Jul 30, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Accelerating E-Commerce Search Engine Ranking by Contextual Factor Selection | Mar 14, 2018 | Combinatorial OptimizationDecision Making | —Unverified | 0 | 0 |
| Accelerating exploration and representation learning with offline pre-training | Mar 31, 2023 | Decision MakingNetHack | —Unverified | 0 | 0 |
| Accelerating Matrix Diagonalization through Decision Transformers with Epsilon-Greedy Optimization | Jun 23, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation | Sep 17, 2021 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Accelerating Reinforcement Learning for Reaching using Continuous Curriculum Learning | Feb 7, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| A Classification View on Meta Learning Bandits | Apr 6, 2025 | ClassificationMeta-Learning | —Unverified | 0 | 0 |
| A Computational Framework for Motor Skill Acquisition | Jan 3, 2019 | Decision MakingReinforcement Learning | —Unverified | 0 | 0 |
| A Contextual Bandit Approach for Stream-Based Active Learning | Jan 24, 2017 | Active LearningDecision Making | —Unverified | 0 | 0 |
| Action Set Based Policy Optimization for Safe Power Grid Management | Jun 29, 2021 | Decision MakingManagement | —Unverified | 0 | 0 |
| Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation | May 29, 2025 | Decision MakingHallucination | —Unverified | 0 | 0 |
| Active Learning-Based Multistage Sequential Decision-Making Model with Application on Common Bile Duct Stone Evaluation | Jan 13, 2022 | Active LearningDecision Making | —Unverified | 0 | 0 |
| Active Learning for Accurate Estimation of Linear Models | Mar 2, 2017 | Active LearningDecision Making | —Unverified | 0 | 0 |
| Active Measure Reinforcement Learning for Observation Cost Minimization | May 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Active Reinforcement Learning Strategies for Offline Policy Improvement | Dec 17, 2024 | Active Learningcontinuous-control | —Unverified | 0 | 0 |
| Active Sensing as Bayes-Optimal Sequential Decision Making | Aug 9, 2014 | Decision MakingSensitivity | —Unverified | 0 | 0 |
| Active Sensing as Bayes-Optimal Sequential Decision Making | May 28, 2013 | Decision MakingSensitivity | —Unverified | 0 | 0 |
| Actor-Critic Algorithms for Risk-Sensitive MDPs | Dec 1, 2013 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Actor-Critic Deep Reinforcement Learning for Solving Job Shop Scheduling Problems | Apr 14, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Actual Causality and Responsibility Attribution in Decentralized Partially Observable Markov Decision Processes | Apr 1, 2022 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| A Customizable Generator for Comic-Style Visual Narrative | Dec 14, 2023 | ARCDecision Making | —Unverified | 0 | 0 |
| adaPARL: Adaptive Privacy-Aware Reinforcement Learning for Sequential-Decision Making Human-in-the-Loop Systems | Mar 7, 2023 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Adaptive Exploration in Linear Contextual Bandit | Oct 15, 2019 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing | May 27, 2025 | Sequential Decision Making | —Unverified | 0 | 0 |