Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs and Practical Solutions Mar 30, 2023 Diversity Offline RL
— Unverified 0Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning May 16, 2024 Decision Making Instruction Following
— Unverified 0Fine-Tuning Next-Scale Visual Autoregressive Models with Group Relative Policy Optimization May 29, 2025 Reinforcement Learning (RL)
— Unverified 0Fine-Tuning Offline Reinforcement Learning with Model-Based Policy Optimization Jan 1, 2021 D4RL MuJoCo
— Unverified 0Finetuning Offline World Models in the Real World Oct 24, 2023 Offline RL Reinforcement Learning (RL)
— Unverified 0Fingerprint Policy Optimisation for Robust Reinforcement Learning May 27, 2018 Bayesian Optimisation Continuous Control
— Unverified 0Finite Horizon Q-learning: Stability, Convergence, Simulations and an application on Smart Grids Oct 27, 2021 Q-Learning reinforcement-learning
— Unverified 0Finite-Sample Analysis For Decentralized Batch Multi-Agent Reinforcement Learning With Networked Agents Dec 6, 2018 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Finite Sample Analyses for TD(0) with Function Approximation Apr 4, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Non-Asymptotic Analysis for Two Time-scale TDC with General Smooth Function Approximation Apr 7, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Finite-sample Analysis of Greedy-GQ with Linear Function Approximation under Markovian Noise May 20, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Finite Sample Analysis of LSTD with Random Projections and Eligibility Traces May 25, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Finite Sample Analysis of Minimax Offline Reinforcement Learning: Completeness, Fast Rates and First-Order Efficiency Feb 5, 2021 Off-policy evaluation reinforcement-learning
— Unverified 0Finite-Sample Analysis of Stochastic Approximation Using Smooth Convex Envelopes Feb 3, 2020 Q-Learning Reinforcement Learning
— Unverified 0Finite Sample Analysis of the GTD Policy Evaluation Algorithms in Markov Setting Sep 21, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Finite Sample Analysis of Two-Timescale Stochastic Approximation with Applications to Reinforcement Learning Mar 15, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Finite-Sample Convergence Bounds for Trust Region Policy Optimization in Mean-Field Games May 28, 2025 Decision Making Reinforcement Learning (RL)
— Unverified 0Finite-time Analysis of Approximate Policy Iteration for the Linear Quadratic Regulator May 30, 2019 continuous-control Continuous Control
— Unverified 0Final Iteration Convergence Bound of Q-Learning: Switching System Approach May 11, 2022 Q-Learning reinforcement-learning
— Unverified 0Finite Time Analysis of Linear Two-timescale Stochastic Approximation with Markovian Noise Feb 4, 2020 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Finite-Time Analysis of Simultaneous Double Q-learning Jun 14, 2024 Q-Learning Reinforcement Learning (RL)
— Unverified 0Finite-Time Analysis of Stochastic Gradient Descent under Markov Randomness Mar 24, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Finite-Time Convergence and Sample Complexity of Multi-Agent Actor-Critic Reinforcement Learning with Average Reward Sep 29, 2021 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Finite-Time Error Bounds for Greedy-GQ Sep 6, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0FinRL: Deep Reinforcement Learning Framework to Automate Trading in Quantitative Finance Nov 7, 2021 Deep Reinforcement Learning Friction
— Unverified 0FinRL-Podracer: High Performance and Scalable Deep Reinforcement Learning for Quantitative Finance Nov 7, 2021 Deep Reinforcement Learning GPU
— Unverified 0FIRE: A Failure-Adaptive Reinforcement Learning Framework for Edge Computing Migrations Sep 28, 2022 Autonomous Driving Edge-computing
— Unverified 0First Go, then Post-Explore: the Benefits of Post-Exploration in Intrinsic Motivation Dec 6, 2022 continuous-control Continuous Control
— Unverified 0First-Order Problem Solving through Neural MCTS based Reinforcement Learning Jan 11, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach Dec 7, 2021 Decision Making reinforcement-learning
— Unverified 0First-Person Activity Forecasting with Online Inverse Reinforcement Learning Dec 22, 2016 reinforcement-learning Reinforcement Learning
— Unverified 0First-spike based visual categorization using reward-modulated STDP May 25, 2017 Game of Go Object Recognition
— Unverified 0First Three Years of the International Verification of Neural Networks Competition (VNN-COMP) Jan 14, 2023 image-classification Image Classification
— Unverified 0BSODA: A Bipartite Scalable Framework for Online Disease Diagnosis Dec 2, 2020 Disease Prediction Reinforcement Learning (RL)
— Unverified 0FitLight: Federated Imitation Learning for Plug-and-Play Autonomous Traffic Signal Control Feb 17, 2025 Imitation Learning reinforcement-learning
— Unverified 0Fitted Q-iteration in continuous action-space MDPs Dec 1, 2007 reinforcement-learning Reinforcement Learning
— Unverified 0Five Properties of Specific Curiosity You Didn't Know Curious Machines Should Have Dec 1, 2022 Decision Making reinforcement-learning
— Unverified 0FIXAR: A Fixed-Point Deep Reinforcement Learning Platform with Quantization-Aware Training and Adaptive Parallelism Feb 24, 2021 CPU Deep Reinforcement Learning
— Unverified 0Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning Sep 9, 2019 Q-Learning reinforcement-learning
— Unverified 0Fixed Points in Cyber Space: Rethinking Optimal Evasion Attacks in the Age of AI-NIDS Nov 23, 2021 Continual Learning Multi-agent Reinforcement Learning
— Unverified 0FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs Jun 18, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0FLAME: Factuality-Aware Alignment for Large Language Models May 2, 2024 Hallucination Instruction Following
— Unverified 0FLAM: Foundation Model-Based Body Stabilization for Humanoid Locomotion and Manipulation Mar 28, 2025 Reinforcement Learning (RL)
— Unverified 0FlashRL: A Reinforcement Learning Platform for Flash Games Jan 26, 2018 CPU Diversity
— Unverified 0Flatland: a Lightweight First-Person 2-D Environment for Reinforcement Learning Sep 3, 2018 Lifelong learning reinforcement-learning
— Unverified 0Flatland-RL : Multi-Agent Reinforcement Learning on Trains Dec 10, 2020 Imitation Learning Multi-agent Reinforcement Learning
— Unverified 0FLEX: A Framework for Learning Robot-Agnostic Force-based Skills Involving Sustained Contact Object Manipulation Mar 17, 2025 Imitation Learning Object
— Unverified 0Flexible and Efficient Long-Range Planning Through Curious Exploration Apr 22, 2020 Deep Reinforcement Learning Imitation Learning
— Unverified 0Flexible Blood Glucose Control: Offline Reinforcement Learning from Human Feedback Jan 27, 2025 Offline RL Reinforcement Learning (RL)
— Unverified 0Flexible Multiple-Objective Reinforcement Learning for Chip Placement Apr 13, 2022 Diversity reinforcement-learning
— Unverified 0