Second-Order Bounds for [0,1]-Valued Regression via Betting Loss Jul 16, 2025 Distributional Reinforcement Learning regression
— Unverified 0Distributional Reinforcement Learning on Path-dependent Options Jul 16, 2025 Distributional Reinforcement Learning reinforcement-learning
— Unverified 0CTRLS: Chain-of-Thought Reasoning via Latent State-Transition Jul 10, 2025 Distributional Reinforcement Learning reinforcement-learning
— Unverified 0ADDQ: Adaptive Distributional Double Q-Learning Jun 24, 2025 Distributional Reinforcement Learning MuJoCo
Code Code Available 0A Point-Based Algorithm for Distributional Reinforcement Learning in Partially Observable Domains May 10, 2025 Decision Making Distributional Reinforcement Learning
— Unverified 0Flow Models for Unbounded and Geometry-Aware Distributional Reinforcement Learning May 7, 2025 Distributional Reinforcement Learning
— Unverified 0Deep Distributional Learning with Non-crossing Quantile Network Apr 11, 2025 Distributional Reinforcement Learning quantile regression
— Unverified 0Offline and Distributional Reinforcement Learning for Wireless Communications Apr 4, 2025 Distributional Reinforcement Learning Management
— Unverified 0RIZE: Regularized Imitation Learning via Distributional Reinforcement Learning Feb 27, 2025 Distributional Reinforcement Learning Imitation Learning
Code Code Available 0Adaptive Nesterov Accelerated Distributional Deep Hedging for Efficient Volatility Risk Management Feb 25, 2025 Distributional Reinforcement Learning Management
— Unverified 0A Finite Sample Analysis of Distributional TD Learning with Linear Function Approximation Feb 20, 2025 Distributional Reinforcement Learning
— Unverified 0Robust Probabilistic Model Checking with Continuous Reward Domains Feb 6, 2025 Distributional Reinforcement Learning model
— Unverified 0Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination Dynamics Jan 21, 2025 Distributional Reinforcement Learning Multi-agent Reinforcement Learning
Code Code Available 0Risk-averse policies for natural gas futures trading using distributional reinforcement learning Jan 8, 2025 Distributional Reinforcement Learning energy trading
— Unverified 0Beyond CVaR: Leveraging Static Spectral Risk Measures for Enhanced Decision-Making in Distributional Reinforcement Learning Jan 3, 2025 Decision Making Distributional Reinforcement Learning
Code Code Available 0Hedging and Pricing Structured Products Featuring Multiple Underlying Assets Nov 2, 2024 Distributional Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning Oct 14, 2024 Distributional Reinforcement Learning reinforcement-learning
— Unverified 0Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space Oct 2, 2024 Decision Making Distributional Reinforcement Learning
— Unverified 0Offline and Distributional Reinforcement Learning for Radio Resource Management Sep 25, 2024 Distributional Reinforcement Learning Management
— Unverified 0Foundations of Multivariate Distributional Reinforcement Learning Aug 31, 2024 Decision Making Distributional Reinforcement Learning
— Unverified 0EX-DRL: Hedging Against Heavy Losses with EXtreme Distributional Reinforcement Learning Aug 22, 2024 Distributional Reinforcement Learning quantile regression
Code Code Available 0Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation Jul 31, 2024 Distributional Reinforcement Learning reinforcement-learning
— Unverified 0On Policy Evaluation Algorithms in Distributional Reinforcement Learning Jul 19, 2024 Distributional Reinforcement Learning reinforcement-learning
— Unverified 0PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methods Jul 18, 2024 Atari Games Decision Making
— Unverified 0Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence May 23, 2024 Distributional Reinforcement Learning Policy Gradient Methods
— Unverified 0CTD4 -- A Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple Critics May 4, 2024 continuous-control Continuous Control
Code Code Available 0Statistical Efficiency of Distributional Temporal Difference Learning and Freedman's Inequality in Hilbert Spaces Mar 9, 2024 Distributional Reinforcement Learning
— Unverified 0Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation Feb 28, 2024 Distributional Reinforcement Learning reinforcement-learning
— Unverified 0Uncertainty-Aware Transient Stability-Constrained Preventive Redispatch: A Distributional Reinforcement Learning Approach Feb 14, 2024 Deep Reinforcement Learning Distributional Reinforcement Learning
— Unverified 0A Distributional Analogue to the Successor Representation Feb 13, 2024 Distributional Reinforcement Learning Model-based Reinforcement Learning
Code Code Available 1Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model Feb 12, 2024 Distributional Reinforcement Learning reinforcement-learning
— Unverified 0More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning Feb 11, 2024 Distributional Reinforcement Learning Multi-Armed Bandits
— Unverified 0Echoes of Socratic Doubt: Embracing Uncertainty in Calibrated Evidential Reinforcement Learning Feb 11, 2024 Atari Games Distributional Reinforcement Learning
Code Code Available 0Distributional Off-policy Evaluation with Bellman Residual Minimization Feb 2, 2024 Distributional Reinforcement Learning Off-policy evaluation
Code Code Available 0A Robust Quantile Huber Loss With Interpretable Parameter Adjustment In Distributional Reinforcement Learning Jan 4, 2024 Atari Games Distributional Reinforcement Learning
Code Code Available 0Distributional Reinforcement Learning-based Energy Arbitrage Strategies in Imbalance Settlement Mechanism Dec 23, 2023 Distributional Reinforcement Learning Q-Learning
— Unverified 0Noise Distribution Decomposition based Multi-Agent Distributional Reinforcement Learning Dec 12, 2023 Distributional Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 0Distributional Bellman Operators over Mean Embeddings Dec 9, 2023 Atari Games Deep Reinforcement Learning
Code Code Available 0An introduction to reinforcement learning for neuroscience Nov 13, 2023 Deep Reinforcement Learning Distributional Reinforcement Learning
— Unverified 0Beyond Average Return in Markov Decision Processes Oct 31, 2023 Distributional Reinforcement Learning
— Unverified 0Pitfall of Optimism: Distributional Reinforcement Learning by Randomizing Risk Criterion Oct 25, 2023 Distributional Reinforcement Learning reinforcement-learning
— Unverified 0Distributional Reinforcement Learning with Online Risk-awareness Adaption Oct 8, 2023 Distributional Reinforcement Learning reinforcement-learning
— Unverified 0Estimation and Inference in Distributional Reinforcement Learning Sep 29, 2023 Distributional Reinforcement Learning reinforcement-learning
Code Code Available 0Learning Risk-Aware Quadrupedal Locomotion using Distributional Reinforcement Learning Sep 25, 2023 Distributional Reinforcement Learning reinforcement-learning
— Unverified 0Deep Reinforcement Learning for Artificial Upwelling Energy Management Aug 20, 2023 Deep Reinforcement Learning Distributional Reinforcement Learning
— Unverified 0Value-Distributional Model-Based Reinforcement Learning Aug 12, 2023 continuous-control Continuous Control
Code Code Available 0Variance Control for Distributional Reinforcement Learning Jul 30, 2023 Distributional Reinforcement Learning MuJoCo
Code Code Available 0Cramer Type Distances for Learning Gaussian Mixture Models by Gradient Descent Jul 13, 2023 Distributional Reinforcement Learning
— Unverified 0Distributional Model Equivalence for Risk-Sensitive Reinforcement Learning Jul 4, 2023 Distributional Reinforcement Learning model
Code Code Available 0Is Risk-Sensitive Reinforcement Learning Properly Resolved? Jul 2, 2023 Distributional Reinforcement Learning Management
— Unverified 0