Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic Jan 28, 2023 Reinforcement Learning (RL)
— Unverified 0Constrained Policy Optimization with Explicit Behavior Density for Offline Reinforcement Learning Jan 28, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0A Memory Efficient Deep Reinforcement Learning Approach For Snake Game Autonomous Agents Jan 27, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Generalized Munchausen Reinforcement Learning using Tsallis KL Divergence Jan 27, 2023 Atari Games reinforcement-learning
— Unverified 0Exploring Deep Reinforcement Learning for Holistic Smart Building Control Jan 27, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Improving Behavioural Cloning with Positive Unlabeled Learning Jan 27, 2023 Behavioural cloning D4RL
— Unverified 0Reinforcement Learning from Diverse Human Preferences Jan 27, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Solving Richly Constrained Reinforcement Learning through State Augmentation and Reward Penalties Jan 27, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Modeling human road crossing decisions as reward maximization with visual perception limitations Jan 27, 2023 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Single-Trajectory Distributionally Robust Reinforcement Learning Jan 27, 2023 Decision Making Q-Learning
— Unverified 0SNeRL: Semantic-aware Neural Radiance Fields for Reinforcement Learning Jan 27, 2023 3D Reconstruction NeRF
— Unverified 0Which Experiences Are Influential for Your Agent? Policy Iteration with Turn-over Dropout Jan 26, 2023 MuJoCo reinforcement-learning
Code Code Available 0Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning Jan 26, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Principled Reinforcement Learning with Human Feedback from Pairwise or K-wise Comparisons Jan 26, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Model-based Offline Reinforcement Learning with Local Misspecification Jan 26, 2023 D4RL model
— Unverified 0Certifiably Robust Reinforcement Learning through Model-Based Abstract Interpretation Jan 26, 2023 Adversarial Robustness MuJoCo
— Unverified 0On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures Jan 26, 2023 Decision Making Policy Gradient Methods
— Unverified 0Learning from Multiple Independent Advisors in Multi-agent Reinforcement Learning Jan 26, 2023 Multi-agent Reinforcement Learning Q-Learning
Code Code Available 0Learning to Generate All Feasible Actions Jan 26, 2023 All Reinforcement Learning (RL)
— Unverified 0FedHQL: Federated Heterogeneous Q-Learning Jan 26, 2023 Q-Learning reinforcement-learning
— Unverified 0Communication-Efficient Collaborative Regret Minimization in Multi-Armed Bandits Jan 26, 2023 Multi-agent Reinforcement Learning Multi-Armed Bandits
— Unverified 0A Deep Neural Network Algorithm for Linear-Quadratic Portfolio Optimization with MGARCH and Small Transaction Costs Jan 25, 2023 Portfolio Optimization Reinforcement Learning (RL)
— Unverified 0ASQ-IT: Interactive Explanations for Reinforcement-Learning Agents Jan 24, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Explainable Deep Reinforcement Learning: State of the Art and Challenges Jan 24, 2023 Decision Making Deep Reinforcement Learning
— Unverified 0Autonomous particles Jan 24, 2023 Autonomous Driving Autonomous Vehicles
— Unverified 0Constrained Reinforcement Learning for Dexterous Manipulation Jan 24, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Intrinsic Motivation in Model-based Reinforcement Learning: A Brief Review Jan 24, 2023 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0AutoCost: Evolving Intrinsic Cost for Zero-violation Reinforcement Learning Jan 24, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0A Novel Deep Reinforcement Learning-based Approach for Enhancing Spectral Efficiency of IRS-assisted Wireless Systems Jan 24, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0SMART: Self-supervised Multi-task pretrAining with contRol Transformers Jan 24, 2023 Decision Making Imitation Learning
— Unverified 0Minimal Value-Equivalent Partial Models for Scalable and Robust Planning in Lifelong Reinforcement Learning Jan 24, 2023 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0Story Shaping: Teaching Agents Human-like Behavior with Stories Jan 24, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Model Based Reinforcement Learning with Non-Gaussian Environment Dynamics and its Application to Portfolio Optimization Jan 23, 2023 Algorithmic Trading Decision Making
— Unverified 0Learning to View: Decision Transformers for Active Object Detection Jan 23, 2023 Active Object Detection Motion Planning
— Unverified 0Forecaster-aided User Association and Load Balancing in Multi-band Mobile Networks Jan 23, 2023 Model Predictive Control Reinforcement Learning (RL)
— Unverified 0Quasi-optimal Reinforcement Learning with Continuous Actions Jan 21, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0The configurable tree graph (CT-graph): measurable problems in partially observable and distal reward environments for lifelong reinforcement learning Jan 21, 2023 Lifelong learning reinforcement-learning
Code Code Available 0Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning Jan 20, 2023 continuous-control Continuous Control
— Unverified 0Multi-Armed Bandits and Quantum Channel Oracles Jan 20, 2023 Multi-Armed Bandits reinforcement-learning
— Unverified 0Multi-agent Reinforcement Learning with Graph Q-Networks for Antenna Tuning Jan 20, 2023 Graph Neural Network Multi-agent Reinforcement Learning
— Unverified 0Modeling Moral Choices in Social Dilemmas with Multi-Agent Reinforcement Learning Jan 20, 2023 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Reinforcement learning-based estimation for partial differential equations Jan 20, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Asynchronous Deep Double Duelling Q-Learning for Trading-Signal Execution in Limit Order Book Markets Jan 20, 2023 Deep Reinforcement Learning Management
— Unverified 0Generative Slate Recommendation with Reinforcement Learning Jan 20, 2023 Recommendation Systems reinforcement-learning
— Unverified 0Domain-adapted Learning and Imitation: DRL for Power Arbitrage Jan 19, 2023 Imitation Learning reinforcement-learning
— Unverified 0A Survey of Meta-Reinforcement Learning Jan 19, 2023 Deep Reinforcement Learning Meta Reinforcement Learning
— Unverified 0Generalization through Diversity: Improving Unsupervised Environment Design Jan 19, 2023 Decision Making Diversity
— Unverified 0Advanced Scaling Methods for VNF deployment with Reinforcement Learning Jan 19, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Domain-adapted Learning and Interpretability: DRL for Gas Trading Jan 19, 2023 Deep Reinforcement Learning Ensemble Learning
— Unverified 0Tight Guarantees for Interactive Decision Making with the Decision-Estimation Coefficient Jan 19, 2023 Decision Making reinforcement-learning
— Unverified 0