Is Pessimism Provably Efficient for Offline RL? Dec 30, 2020 Offline RL Reinforcement Learning (RL)
— Unverified 0Is Plug-in Solver Sample-Efficient for Feature-based Reinforcement Learning? Oct 12, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Is Q-Learning Provably Efficient? An Extended Analysis Sep 22, 2020 Q-Learning reinforcement-learning
— Unverified 0Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon Sep 28, 2020 Decision Making Multi-Armed Bandits
— Unverified 0Is RLHF More Difficult than Standard RL? Jun 25, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Issues concerning realizability of Blackwell optimal policies in reinforcement learning May 20, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Is the Bellman residual a bad proxy? Jun 24, 2016 reinforcement-learning Reinforcement Learning
— Unverified 0Iterated Q-Network: Beyond One-Step Bellman Updates in Deep Reinforcement Learning Mar 4, 2024 Atari Games continuous-control
— Unverified 0Iterative Bounding MDPs: Learning Interpretable Policies via Non-Interpretable Methods Feb 25, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Iteratively Learn Diverse Strategies with State Distance Information Oct 23, 2023 Diversity Reinforcement Learning (RL)
— Unverified 0Iteratively-Refined Interactive 3D Medical Image Segmentation with Multi-Agent Reinforcement Learning Nov 23, 2019 Image Segmentation Medical Image Segmentation
— Unverified 0Iterative Model-Based Reinforcement Learning Using Simulations in the Differentiable Neural Computer Jun 17, 2019 Lifelong learning Model-based Reinforcement Learning
— Unverified 0Iterative Policy Learning in End-to-End Trainable Task-Oriented Neural Dialog Models Sep 18, 2017 Deep Reinforcement Learning Reinforcement Learning
— Unverified 0Iterative Policy-Space Expansion in Reinforcement Learning Dec 5, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Iterative Reachability Estimation for Safe Reinforcement Learning Sep 24, 2023 MuJoCo reinforcement-learning
— Unverified 0Iterative Update and Unified Representation for Multi-Agent Reinforcement Learning Aug 16, 2019 Lifelong learning Multi-agent Reinforcement Learning
— Unverified 0IV-Posterior: Inverse Value Estimation for Interpretable Policy Certificates Nov 30, 2020 Reinforcement Learning (RL)
— Unverified 0J1: Exploring Simple Test-Time Scaling for LLM-as-a-Judge May 17, 2025 Reinforcement Learning (RL)
— Unverified 0J4R: Learning to Judge with Equivalent Initial State Group Relative Policy Optimization May 19, 2025 Reinforcement Learning (RL)
— Unverified 0Conditioning of Reinforcement Learning Agents and its Policy Regularization Application Jun 13, 2019 continuous-control Continuous Control
— Unverified 0"Jam Me If You Can'': Defeating Jammer with Deep Dueling Neural Network Architecture and Ambient Backscattering Augmented Communications Apr 8, 2019 Deep Reinforcement Learning Q-Learning
— Unverified 0Jamming-Resilient Path Planning for Multiple UAVs via Deep Reinforcement Learning Apr 9, 2021 Collision Avoidance Decision Making
— Unverified 0JAX-LOB: A GPU-Accelerated limit order book simulator to unlock large scale reinforcement learning for trading Aug 25, 2023 GPU reinforcement-learning
— Unverified 0Job Scheduling in Datacenters using Constraint Controlled RL Nov 10, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Data Centers Job Scheduling with Deep Reinforcement Learning Sep 16, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Joint Attention for Multi-Agent Coordination and Social Learning Apr 15, 2021 Deep Reinforcement Learning Inductive Bias
— Unverified 0Joint Band Assignment and Beam Management using Hierarchical Reinforcement Learning for Multi-Band Communication Aug 25, 2023 Hierarchical Reinforcement Learning Management
— Unverified 0Learning Reward and Policy Jointly from Demonstration and Preference Improves Alignment Jun 11, 2024 MuJoCo reinforcement-learning
— Unverified 0Joint Differentiable Optimization and Verification for Certified Reinforcement Learning Jan 28, 2022 Bilevel Optimization Model-based Reinforcement Learning
— Unverified 0Joint Energy Dispatch and Unit Commitment in Microgrids Based on Deep Reinforcement Learning Jun 3, 2022 Deep Reinforcement Learning energy management
— Unverified 0Joint Entity Linking with Deep Reinforcement Learning Feb 1, 2019 Deep Reinforcement Learning Entity Disambiguation
— Unverified 0Joint Goal and Strategy Inference across Heterogeneous Demonstrators via Reward Network Distillation Jan 2, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Joint Inference of Reward Machines and Policies for Reinforcement Learning Sep 12, 2019 Q-Learning reinforcement-learning
— Unverified 0Joint Learning-Based Stabilization of Multiple Unknown Linear Systems Jan 1, 2022 Reinforcement Learning (RL)
— Unverified 0Joint Learning of Policy with Unknown Temporal Constraints for Safe Reinforcement Learning Apr 30, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Joint Learning of Reward Machines and Policies in Environments with Partially Known Semantics Apr 20, 2022 Q-Learning reinforcement-learning
— Unverified 0Jointly Reinforced User Simulator and Task-oriented Dialog System with Simplified Generative Architecture Jan 16, 2022 Language Modeling Language Modelling
— Unverified 0Jointly-Trained State-Action Embedding for Efficient Reinforcement Learning Sep 28, 2020 Model-based Reinforcement Learning Recommendation Systems
— Unverified 0Jointly Training and Pruning CNNs via Learnable Agent Guidance and Alignment Mar 28, 2024 Reinforcement Learning (RL)
— Unverified 0Joint Modeling for Learning Decision-Making Dynamics in Behavioral Experiments Jun 3, 2025 Decision Making Reinforcement Learning (RL)
— Unverified 0Joint Modeling for Query Expansion and Information Extraction with Reinforcement Learning Nov 1, 2018 Decision Making reinforcement-learning
— Unverified 0Joint Optimization of Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm May 28, 2021 Multi-Objective Reinforcement Learning reinforcement-learning
— Unverified 0Joint Power Allocation and Beamformer for mmW-NOMA Downlink Systems by Deep Reinforcement Learning May 13, 2022 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Joint Representation Training in Sequential Tasks with Shared Structure Jun 24, 2022 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Joint Resource Management for MC-NOMA: A Deep Reinforcement Learning Approach Mar 29, 2021 Deep Reinforcement Learning Management
— Unverified 0Joint Self-Supervised Learning for Vision-based Reinforcement Learning Sep 29, 2021 Autonomous Driving continuous-control
— Unverified 0Joint Sensing and Communications for Deep Reinforcement Learning-based Beam Management in 6G Aug 3, 2022 Clustering Deep Reinforcement Learning
— Unverified 0Jointly-Learned State-Action Embedding for Efficient Reinforcement Learning Oct 9, 2020 Model-based Reinforcement Learning Recommendation Systems
— Unverified 0Joint Synthesis of Safety Certificate and Safe Control Policy using Constrained Reinforcement Learning Nov 15, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0JudgeLRM: Large Reasoning Models as a Judge Mar 31, 2025 Reinforcement Learning (RL)
— Unverified 0