The Role of Diverse Replay for Generalisation in Reinforcement Learning Jun 9, 2023 Diversity reinforcement-learning
— Unverified 0The Role of Environment Access in Agnostic Reinforcement Learning Apr 7, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0The Role of Exploration for Task Transfer in Reinforcement Learning Oct 11, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0The Role of Time Delay in Sim2real Transfer of Reinforcement Learning for Cyber-Physical Systems Sep 30, 2022 Reinforcement Learning (RL)
— Unverified 0The Sample-Complexity of General Reinforcement Learning Aug 22, 2013 General Reinforcement Learning reinforcement-learning
— Unverified 0The Skill-Action Architecture: Learning Abstract Action Embeddings for Reinforcement Learning Jan 1, 2021 Hierarchical Reinforcement Learning reinforcement-learning
— Unverified 0The State-Action-Reward-State-Action Algorithm in Spatial Prisoner's Dilemma Game Jun 25, 2024 Decision Making Imitation Learning
— Unverified 0The Statistical Complexity of Interactive Decision Making Dec 27, 2021 Decision Making reinforcement-learning
— Unverified 0The Steganographic Potentials of Language Models May 6, 2025 Reinforcement Learning (RL)
— Unverified 0The Surprising Ineffectiveness of Pre-Trained Visual Representations for Model-Based Reinforcement Learning Nov 15, 2024 Diversity Model-based Reinforcement Learning
— Unverified 0The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs Jul 10, 2025 Multimodal Reasoning Reinforcement Learning (RL)
— Unverified 0Theta-Resonance: A Single-Step Reinforcement Learning Method for Design Space Exploration Nov 3, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0The Sample Complexity of Teaching-by-Reinforcement on Q-Learning Jun 16, 2020 Q-Learning reinforcement-learning
— Unverified 0The tree reconstruction game: phylogenetic reconstruction using reinforcement learning Mar 12, 2023 Q-Learning reinforcement-learning
— Unverified 0The UMD Neural Machine Translation Systems at WMT17 Bandit Learning Task Aug 3, 2017 Domain Adaptation Machine Translation
— Unverified 0The Utility of Sparse Representations for Control in Reinforcement Learning Nov 15, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0The Value Equivalence Principle for Model-Based Reinforcement Learning Nov 6, 2020 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0The Value Function Polytope in Reinforcement Learning Jan 31, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0The Value-Improvement Path: Towards Better Representations for Reinforcement Learning Jun 3, 2020 Atari Games reinforcement-learning
— Unverified 0The Value of Reward Lookahead in Reinforcement Learning Mar 18, 2024 Offline RL reinforcement-learning
— Unverified 0The Virtues of Pessimism in Inverse Reinforcement Learning Feb 4, 2024 Offline RL reinforcement-learning
— Unverified 0The wisdom of the crowd: reliable deep reinforcement learning through ensembles of Q-functions Sep 27, 2018 Deep Reinforcement Learning Policy Gradient Methods
— Unverified 0Think2SQL: Reinforce LLM Reasoning Capabilities for Text2SQL Apr 21, 2025 Reinforcement Learning (RL) Zero-Shot Learning
— Unverified 0Thinking While Moving: Deep Reinforcement Learning with Concurrent Control Apr 13, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains May 22, 2025 Mathematical Reasoning Reinforcement Learning (RL)
— Unverified 0Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking Mar 25, 2025 Math Reinforcement Learning (RL)
— Unverified 0Thompson Sampling for Contextual Bandit Problems with Auxiliary Safety Constraints Nov 2, 2019 Bayesian Optimization Decision Making
— Unverified 0Thompson Sampling for Learning Parameterized Markov Decision Processes Jun 29, 2014 Form reinforcement-learning
— Unverified 0Thompson Sampling is Asymptotically Optimal in General Environments Feb 25, 2016 reinforcement-learning Reinforcement Learning
— Unverified 0Thompson Sampling on Asymmetric α-Stable Bandits Mar 19, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Thompson Sampling with a Mixture Prior Jun 10, 2021 Decision Making Multi-Task Learning
— Unverified 0Thought-Augmented Policy Optimization: Bridging External Guidance and Internal Capabilities May 21, 2025 Math Reinforcement Learning (RL)
— Unverified 0Throughput Optimization for Grant-Free Multiple Access With Multiagent Deep Reinforcement Learning Feb 1, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Through the Valley: Path to Effective Long CoT Training for Small Language Models Jun 9, 2025 8k Reinforcement Learning (RL)
— Unverified 0Tight Bayesian Ambiguity Sets for Robust MDPs Nov 15, 2018 Decision Making Reinforcement Learning
— Unverified 0Tightening Exploration in Upper Confidence Reinforcement Learning Apr 20, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Tighter Problem-Dependent Regret Bounds in Reinforcement Learning without Domain Knowledge using Value Function Bounds Jan 1, 2019 Learning Theory Reinforcement Learning
— Unverified 0Tight Finite Time Bounds of Two-Time-Scale Linear Stochastic Approximation with Markovian Noise Dec 31, 2023 Reinforcement Learning (RL)
— Unverified 0Tight Guarantees for Interactive Decision Making with the Decision-Estimation Coefficient Jan 19, 2023 Decision Making reinforcement-learning
— Unverified 0Tile Networks: Learning Optimal Geometric Layout for Whole-page Recommendation Mar 3, 2023 Learning-To-Rank reinforcement-learning
— Unverified 0Time Adaptive Reinforcement Learning Apr 18, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Time-Aware Q-Networks: Resolving Temporal Irregularity for Deep Reinforcement Learning May 6, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning Feb 17, 2021 Data Augmentation Deep Reinforcement Learning
— Unverified 0Time-Scale Separation in Q-Learning: Extending TD() for Action-Value Function Decomposition Nov 21, 2024 Q-Learning Reinforcement Learning (RL)
— Unverified 0Time-Variant Variational Transfer for Value Functions May 26, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Time your hedge with Deep Reinforcement Learning Sep 16, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints May 31, 2022 Reinforcement Learning (RL)
— Unverified 0Timing Process Interventions with Causal Inference and Reinforcement Learning Jun 7, 2023 Causal Inference reinforcement-learning
— Unverified 0tinyMAN: Lightweight Energy Manager using Reinforcement Learning for Energy Harvesting Wearable IoT Devices Feb 18, 2022 energy management Management
— Unverified 0To Beam Or Not To Beam: That is a Question of Cooperation for Language GANs Jun 11, 2021 Question Generation Question-Generation
— Unverified 0