| Conservative Contextual Bandits: Beyond Linear Representations | Dec 9, 2024 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 | 0 |
| Constrained Online Decision-Making: A Unified Framework | May 11, 2025 | Active Learningcounterfactual | —Unverified | 0 | 0 |
| Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms | Jun 17, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Contextual Bandits for Unbounded Context Distributions | Aug 19, 2024 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Contextual Experience Replay for Self-Improvement of Language Agents | Jun 7, 2025 | Decision MakingLarge Language Model | —Unverified | 0 | 0 |
| Contextual Online Decision Making with Infinite-Dimensional Functional Regression | Jan 30, 2025 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Continual Vision-and-Language Navigation | Mar 22, 2024 | Continual LearningNavigate | —Unverified | 0 | 0 |
| Continuous Episodic Control | Nov 28, 2022 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs | Jun 6, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Convex Markov Games: A New Frontier for Multi-Agent Reinforcement Learning | Oct 22, 2024 | Decision MakingDiversity | —Unverified | 0 | 0 |
| Convex Regularization in Monte-Carlo Tree Search | Jul 1, 2020 | Atari GamesDecision Making | —Unverified | 0 | 0 |
| Cooperative Bayesian Optimization for Imperfect Agents | Mar 7, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 | 0 |
| CoordiQ : Coordinated Q-learning for Electric Vehicle Charging Recommendation | Jan 28, 2021 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation | Mar 11, 2024 | Recommendation SystemsReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Correlational Dueling Bandits with Application to Clinical Treatment in Large Decision Spaces | Jul 8, 2017 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Counterfactual Inference under Thompson Sampling | Apr 3, 2025 | Causal Inferencecounterfactual | —Unverified | 0 | 0 |
| Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing | Jan 10, 2025 | Causal Inferencecounterfactual | —Unverified | 0 | 0 |
| Counterfactually Guided Off-policy Transfer in Clinical Settings | Jun 20, 2020 | counterfactualDecision Making | —Unverified | 0 | 0 |
| Counterfactual Strategies for Markov Decision Processes | May 14, 2025 | counterfactualDecision Making | —Unverified | 0 | 0 |
| CPS-LLM: Large Language Model based Safe Usage Plan Generator for Human-in-the-Loop Human-in-the-Plant Cyber-Physical System | May 19, 2024 | ChatbotLanguage Modeling | —Unverified | 0 | 0 |
| CrowdPlay: Crowdsourcing human demonstration data for offline learning in Atari games | Sep 29, 2021 | Atari GamesDecision Making | —Unverified | 0 | 0 |
| D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection | May 4, 2025 | Causal DiscoveryDecision Making | —Unverified | 0 | 0 |
| Data-Driven Online Model Selection With Regret Guarantees | Jun 5, 2023 | Decision Makingmodel | —Unverified | 0 | 0 |
| Data-Driven Upper Confidence Bounds with Near-Optimal Regret for Heavy-Tailed Bandits | Jun 9, 2024 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Data-Efficient Reinforcement Learning for Malaria Control | May 4, 2021 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Data-efficient visuomotor policy training using reinforcement learning and generative models | Jul 26, 2020 | Decision MakingDisentanglement | —Unverified | 0 | 0 |
| DBA bandits: Self-driving index tuning under ad-hoc, analytical workloads with safety guarantees | Oct 19, 2020 | AttributeDecision Making | —Unverified | 0 | 0 |
| DDO: Dual-Decision Optimization via Multi-Agent Collaboration for LLM-Based Medical Consultation | May 24, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Decentralized Cross-Entropy Method for Model-Based Reinforcement Learning | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Decentralized Multi-Agent Reinforcement Learning with Networked Agents: Recent Advances | Dec 9, 2019 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Deceptive Sequential Decision-Making via Regularized Policy Optimization | Jan 30, 2025 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Deciding What to Learn: A Rate-Distortion Approach | Jan 15, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Deciding What to Model: Value-Equivalent Sampling for Reinforcement Learning | Jun 4, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Decision-Focused Evaluation: Analyzing Performance of Deployed Restless Multi-Arm Bandits | Jan 19, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Decision making with dynamic probabilistic forecasts | Jun 30, 2021 | Decision Makingenergy trading | —Unverified | 0 | 0 |
| Deep Action Sequence Learning for Causal Shape Transformation | May 17, 2016 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Deep Bayesian Estimation for Dynamic Treatment Regimes with a Long Follow-up Time | Sep 20, 2021 | Decision Makingregression | —Unverified | 0 | 0 |
| Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games | Apr 24, 2016 | Atari GamesDecision Making | —Unverified | 0 | 0 |
| Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction | Mar 3, 2017 | Decision MakingDependency Parsing | —Unverified | 0 | 0 |
| Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching | Jan 24, 2019 | Decision MakingEfficient Exploration | —Unverified | 0 | 0 |
| Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications | Feb 15, 2023 | Decision MakingManagement | —Unverified | 0 | 0 |
| Deep Q-Network-based Adaptive Alert Threshold Selection Policy for Payment Fraud Systems in Retail Banking | Oct 21, 2020 | Decision MakingFraud Detection | —Unverified | 0 | 0 |
| Deep Reinforcement Learning for Adaptive Mesh Refinement | Sep 25, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Deep Reinforcement Learning for Entity Alignment | Nov 16, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Deep Reinforcement Learning for Multi-Agent Systems: A Review of Challenges, Solutions and Applications | Dec 31, 2018 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Deep Reinforcement Learning for Optimal Critical Care Pain Management with Morphine using Dueling Double-Deep Q Networks | Apr 25, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Deep Reinforcement Learning for Portfolio Optimization using Latent Feature State Space (LFSS) Module | Feb 11, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Deep Reinforcement Learning for Robust Goal-Based Wealth Management | Jul 25, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Selective Network Discovery via Deep Reinforcement Learning on Embedded Spaces | Sep 16, 2019 | AttributeDecision Making | —Unverified | 0 | 0 |
| Deep Reinforcement Learning for Visual Object Tracking in Videos | Jan 31, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |