Reinforcement Learning in a Birth and Death Process: Breaking the Dependence on the State Space Feb 21, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management Feb 21, 2023 Dialogue Management Diversity
— Unverified 0Reinforcement Learning-based Control of Nonlinear Systems using Carleman Approximation: Structured and Unstructured Designs Feb 21, 2023 Reinforcement Learning (RL)
— Unverified 0MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization Feb 21, 2023 Decision Making Multi-agent Reinforcement Learning
Code Code Available 0Robust Auto-landing Control of an agile Regional Jet Using Fuzzy Q-learning Feb 21, 2023 Q-Learning reinforcement-learning
— Unverified 0Towards a Sustainable Internet-of-Underwater-Things based on AUVs, SWIPT, and Reinforcement Learning Feb 21, 2023 Decision Making Reinforcement Learning (RL)
— Unverified 0BadGPT: Exploring Security Vulnerabilities of ChatGPT via Backdoor Attacks to InstructGPT Feb 21, 2023 Backdoor Attack Language Modeling
— Unverified 0Handling Long and Richly Constrained Tasks through Constrained Hierarchical Reinforcement Learning Feb 21, 2023 Decision Making Hierarchical Reinforcement Learning
— Unverified 0Kernel-Based Distributed Q-Learning: A Scalable Reinforcement Learning Approach for Dynamic Treatment Regimes Feb 21, 2023 Learning Theory Medical Diagnosis
— Unverified 0Constrained Reinforcement Learning for Predictive Control in Real-Time Stochastic Dynamic Optimal Power Flow Feb 21, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Curiosity-driven Exploration in Sparse-reward Multi-agent Reinforcement Learning Feb 21, 2023 Deep Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 0Adversarial Model for Offline Reinforcement Learning Feb 21, 2023 model reinforcement-learning
— Unverified 0Deep Reinforcement Learning for Robotic Pushing and Picking in Cluttered Environment Feb 21, 2023 Deep Reinforcement Learning Object
— Unverified 0A Reinforcement Learning Framework for Online Speaker Diarization Feb 21, 2023 Decision Making Domain Adaptation
— Unverified 0Backstepping Temporal Difference Learning Feb 20, 2023 Reinforcement Learning (RL)
— Unverified 0Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems Feb 20, 2023 Learning-To-Rank Reinforcement Learning (RL)
Code Code Available 0Differentiable Arbitrating in Zero-sum Markov Games Feb 20, 2023 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0DC4L: Distribution Shift Recovery via Data-Driven Control for Deep Learning Models Feb 20, 2023 Data Augmentation Dimensionality Reduction
Code Code Available 0Reinforcement Learning with Function Approximation: From Linear to Nonlinear Feb 20, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Safe Deep Reinforcement Learning by Verifying Task-Level Properties Feb 20, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Multiagent Inverse Reinforcement Learning via Theory of Mind Reasoning Feb 20, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Robust and Versatile Bipedal Jumping Control through Reinforcement Learning Feb 19, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Generalization in Visual Reinforcement Learning with the Reward Sequence Distribution Feb 19, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Compositionality and Bounds for Optimal Value Functions in Reinforcement Learning Feb 19, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Interactive Video Corpus Moment Retrieval using Reinforcement Learning Feb 19, 2023 Moment Retrieval reinforcement-learning
— Unverified 0AutoDOViz: Human-Centered Automation for Decision Optimization Feb 19, 2023 AutoML reinforcement-learning
— Unverified 0Auto.gov: Learning-based Governance for Decentralized Finance (DeFi) Feb 19, 2023 Reinforcement Learning (RL)
Code Code Available 0HOPE: Human-Centric Off-Policy Evaluation for E-Learning and Healthcare Feb 18, 2023 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Effective Multimodal Reinforcement Learning with Modality Alignment and Importance Enhancement Feb 18, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Efficient Exploration via Epistemic-Risk-Seeking Policy Optimization Feb 18, 2023 Deep Reinforcement Learning Efficient Exploration
— Unverified 0Reinforcement Learning in the Wild with Maximum Likelihood-based Model Transfer Feb 18, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Promoting Cooperation in Multi-Agent Reinforcement Learning via Mutual Help Feb 18, 2023 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation Feb 18, 2023 Instruction Following Reinforcement Learning (RL)
— Unverified 0Post Reinforcement Learning Inference Feb 17, 2023 counterfactual Off-policy evaluation
Code Code Available 0Robot path planning using deep reinforcement learning Feb 17, 2023 Autonomous Navigation Deep Reinforcement Learning
— Unverified 0Deep Reinforcement Learning for mmWave Initial Beam Alignment Feb 17, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0A State Augmentation based approach to Reinforcement Learning from Human Preferences Feb 17, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Mixed Traffic Control and Coordination from Pixels Feb 17, 2023 Reinforcement Learning (RL)
— Unverified 0Exploiting Unlabeled Data for Feedback Efficient Human Preference based Reinforcement Learning Feb 17, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Learning to Forecast Aleatoric and Epistemic Uncertainties over Long Horizon Trajectories Feb 17, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Data Driven Reward Initialization for Preference based Reinforcement Learning Feb 17, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Tuning computer vision models with task rewards Feb 16, 2023 Colorization Image Captioning
— Unverified 0Quantum Computing Provides Exponential Regret Improvement in Episodic Reinforcement Learning Feb 16, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Optimal Sample Complexity of Reinforcement Learning for Mixing Discounted Markov Decision Processes Feb 15, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Meta-Reinforcement Learning via Exploratory Task Clustering Feb 15, 2023 Clustering Meta Reinforcement Learning
— Unverified 0Prioritized offline Goal-swapping Experience Replay Feb 15, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning Based Power Grid Day-Ahead Planning and AI-Assisted Control Feb 15, 2023 Management reinforcement-learning
— Unverified 0Scalable Multi-Agent Reinforcement Learning with General Utilities Feb 15, 2023 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0CERiL: Continuous Event-based Reinforcement Learning Feb 15, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications Feb 15, 2023 Decision Making Management
— Unverified 0