FLAG-Trader: Fusion LLM-Agent with Gradient-based Reinforcement Learning for Financial Trading Feb 17, 2025 Decision Making parameter-efficient fine-tuning
— Unverified 0Text Simplification with Reinforcement Learning Using Supervised Rewards on Grammaticality, Meaning Preservation, and Simplicity Dec 1, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0FORM: Learning Expressive and Transferable First-Order Logic Reward Machines Dec 31, 2024 Form Reinforcement Learning (RL)
— Unverified 0That Escalated Quickly: Compounding Complexity by Editing Levels at the Frontier of Agent Capabilities Sep 29, 2021 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0The act of remembering: a study in partially observable reinforcement learning Oct 5, 2020 Partially Observable Reinforcement Learning reinforcement-learning
— Unverified 0The Advantage Regret-Matching Actor-Critic Aug 27, 2020 counterfactual Reinforcement Learning (RL)
— Unverified 0The Archimedean trap: Why traditional reinforcement learning will probably not yield AGI Feb 15, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems Dec 8, 2020 CPU Deep Reinforcement Learning
— Unverified 0The association problem in wireless networks: a Policy Gradient Reinforcement Learning approach Jun 11, 2013 Q-Learning reinforcement-learning
— Unverified 0The Bandit Whisperer: Communication Learning for Restless Bandits Aug 11, 2024 Reinforcement Learning (RL)
— Unverified 0The Best of Both Worlds: Reinforcement Learning with Logarithmic Regret and Policy Switches Mar 3, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and Beyond May 18, 2023 Q-Learning Reinforcement Learning (RL)
— Unverified 0The Bottleneck Simulator: A Model-based Deep Reinforcement Learning Approach Jul 12, 2018 Deep Reinforcement Learning Model-based Reinforcement Learning
— Unverified 0The Case for Automatic Database Administration using Deep Reinforcement Learning Jan 17, 2018 Deep Reinforcement Learning reinforcement-learning
— Unverified 0The Central Role of the Loss Function in Reinforcement Learning Sep 19, 2024 Decision Making reinforcement-learning
— Unverified 0The Challenges of Exploration for Offline Reinforcement Learning Jan 27, 2022 Model Predictive Control Offline RL
— Unverified 0The Complexity of Markov Equilibrium in Stochastic Games Apr 8, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0The Complex Negotiation Dialogue Game Jul 5, 2017 One-Shot Learning Position
— Unverified 0The Concept of Criticality in Reinforcement Learning Oct 16, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning Jun 16, 2025 Deep Reinforcement Learning MuJoCo
— Unverified 0The Crucial Role of Problem Formulation in Real-World Reinforcement Learning Mar 26, 2025 Reinforcement Learning (RL)
— Unverified 0The Curious Price of Distributional Robustness in Reinforcement Learning with a Generative Model May 26, 2023 Reinforcement Learning (RL)
— Unverified 0The Differences Between Direct Alignment Algorithms are a Blur Feb 3, 2025 Language Modeling Language Modelling
— Unverified 0The Difficulty of Passive Learning in Deep Reinforcement Learning Oct 26, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0The Ecosystem Path to General AI Aug 17, 2021 Reinforcement Learning (RL) Unity
— Unverified 0The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning Jun 23, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Tuning the Weights: The Impact of Initial Matrix Configurations on Successor Features Learning Efficacy Nov 3, 2021 Reinforcement Learning (RL) Representation Learning
— Unverified 0The effects of negative adaptation in Model-Agnostic Meta-Learning Dec 5, 2018 Few-Shot Learning Meta-Learning
— Unverified 0The Eigenoption-Critic Framework Dec 11, 2017 Efficient Exploration Hierarchical Reinforcement Learning
— Unverified 0The Emergence of Individuality in Multi-Agent Reinforcement Learning Sep 28, 2020 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0The Emergence of Wireless MAC Protocols with Multi-Agent Reinforcement Learning Aug 16, 2021 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0The End of Optimism? An Asymptotic Analysis of Finite-Armed Linear Bandits Oct 14, 2016 reinforcement-learning Reinforcement Learning
— Unverified 0The Essential Elements of Offline RL via Supervised Learning Sep 29, 2021 Offline RL reinforcement-learning
— Unverified 0The Evolution of Reinforcement Learning in Quantitative Finance: A Survey Aug 20, 2024 Meta-Learning reinforcement-learning
— Unverified 0The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning Feb 21, 2025 Decision Making reinforcement-learning
— Unverified 0The Exploratory Multi-Asset Mean-Variance Portfolio Selection using Reinforcement Learning May 12, 2025 Reinforcement Learning (RL)
— Unverified 0The Fallacy of Minimizing Cumulative Regret in the Sequential Task Setting Mar 16, 2024 Reinforcement Learning (RL)
— Unverified 0The False Dawn: Reevaluating Google's Reinforcement Learning for Chip Macro Placement Jun 16, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0The Feasibility of Constrained Reinforcement Learning Algorithms: A Tutorial Study Apr 15, 2024 Model Predictive Control reinforcement-learning
— Unverified 0The Frost Hollow Experiments: Pavlovian Signalling as a Path to Coordination and Communication Between Agents Mar 17, 2022 Decision Making reinforcement-learning
— Unverified 0The Gambler's Problem and Beyond Dec 31, 2019 Q-Learning reinforcement-learning
— Unverified 0The Gap Between Model-Based and Model-Free Methods on the Linear Quadratic Regulator: An Asymptotic Viewpoint Dec 9, 2018 continuous-control Continuous Control
— Unverified 0The Gradient Convergence Bound of Federated Multi-Agent Reinforcement Learning with Efficient Communication Mar 24, 2021 Decision Making Deep Reinforcement Learning
— Unverified 0The Greatest Teacher, Failure is: Using Reinforcement Learning for SFC Placement Based on Availability and Energy Consumption Oct 12, 2020 Reinforcement Learning (RL)
— Unverified 0The guide and the explorer: smart agents for resource-limited iterated batch reinforcement learning Sep 29, 2021 Acrobot Model Predictive Control
— Unverified 0The Hierarchical Adaptive Forgetting Variational Filter May 15, 2018 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0The Immersion of Directed Multi-graphs in Embedding Fields. Generalisations Apr 28, 2020 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Missing Velocity in Dynamic Obstacle Avoidance based on Deep Reinforcement Learning Dec 23, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0The impact of moving expenses on social segregation: a simulation with RL and ABM Nov 22, 2022 Reinforcement Learning (RL)
— Unverified 0Transient Non-Stationarity and Generalisation in Deep Reinforcement Learning Jun 10, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0