| Conservative Contextual Bandits: Beyond Linear Representations | Dec 9, 2024 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 | 0 |
| Constrained Online Decision-Making: A Unified Framework | May 11, 2025 | Active Learningcounterfactual | —Unverified | 0 | 0 |
| Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms | Jun 17, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Contextual Bandits for Unbounded Context Distributions | Aug 19, 2024 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Contextual Experience Replay for Self-Improvement of Language Agents | Jun 7, 2025 | Decision MakingLarge Language Model | —Unverified | 0 | 0 |
| Contextual Online Decision Making with Infinite-Dimensional Functional Regression | Jan 30, 2025 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Continual Vision-and-Language Navigation | Mar 22, 2024 | Continual LearningNavigate | —Unverified | 0 | 0 |
| Continuous Episodic Control | Nov 28, 2022 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs | Jun 6, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Convex Markov Games: A New Frontier for Multi-Agent Reinforcement Learning | Oct 22, 2024 | Decision MakingDiversity | —Unverified | 0 | 0 |
| Convex Regularization in Monte-Carlo Tree Search | Jul 1, 2020 | Atari GamesDecision Making | —Unverified | 0 | 0 |
| Cooperative Bayesian Optimization for Imperfect Agents | Mar 7, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 | 0 |
| CoordiQ : Coordinated Q-learning for Electric Vehicle Charging Recommendation | Jan 28, 2021 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation | Mar 11, 2024 | Recommendation SystemsReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Correlational Dueling Bandits with Application to Clinical Treatment in Large Decision Spaces | Jul 8, 2017 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Counterfactual Inference under Thompson Sampling | Apr 3, 2025 | Causal Inferencecounterfactual | —Unverified | 0 | 0 |
| Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing | Jan 10, 2025 | Causal Inferencecounterfactual | —Unverified | 0 | 0 |
| Counterfactually Guided Off-policy Transfer in Clinical Settings | Jun 20, 2020 | counterfactualDecision Making | —Unverified | 0 | 0 |
| Counterfactual Strategies for Markov Decision Processes | May 14, 2025 | counterfactualDecision Making | —Unverified | 0 | 0 |
| CPS-LLM: Large Language Model based Safe Usage Plan Generator for Human-in-the-Loop Human-in-the-Plant Cyber-Physical System | May 19, 2024 | ChatbotLanguage Modeling | —Unverified | 0 | 0 |
| CrowdPlay: Crowdsourcing human demonstration data for offline learning in Atari games | Sep 29, 2021 | Atari GamesDecision Making | —Unverified | 0 | 0 |
| D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection | May 4, 2025 | Causal DiscoveryDecision Making | —Unverified | 0 | 0 |
| Data-Driven Online Model Selection With Regret Guarantees | Jun 5, 2023 | Decision Makingmodel | —Unverified | 0 | 0 |
| Data-Driven Upper Confidence Bounds with Near-Optimal Regret for Heavy-Tailed Bandits | Jun 9, 2024 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Data-Efficient Reinforcement Learning for Malaria Control | May 4, 2021 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 | 0 |