B-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis Oct 4, 2023 Code Generation Deep Reinforcement Learning
— Unverified 0Multi-Agent Reinforcement Learning for Power Grid Topology Optimization Oct 4, 2023 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making Oct 4, 2023 Decision Making Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning with Foundation Priors: Let the Embodied Agent Efficiently Learn on Its Own Oct 4, 2023 Quantization reinforcement-learning
— Unverified 0Learning and reusing primitive behaviours to improve Hindsight Experience Replay sample efficiency Oct 3, 2023 Reinforcement Learning (RL)
Code Code Available 0Blending Imitation and Reinforcement Learning for Robust Policy Improvement Oct 3, 2023 Imitation Learning reinforcement-learning
— Unverified 0A Deep Reinforcement Learning Approach for Interactive Search with Sentence-level Feedback Oct 3, 2023 Deep Reinforcement Learning Q-Learning
— Unverified 0AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model Oct 3, 2023 Attribute Reinforcement Learning (RL)
— Unverified 0On Representation Complexity of Model-based and Model-free Reinforcement Learning Oct 3, 2023 model MuJoCo
— Unverified 0Prioritized Soft Q-Decomposition for Lexicographic Reinforcement Learning Oct 3, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Navigating Uncertainty in ESG Investing Oct 3, 2023 Navigate Reinforcement Learning (RL)
— Unverified 0Towards a Unified Framework for Sequential Decision Making Oct 3, 2023 Bayesian Inference Decision Making
— Unverified 0Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning Oct 2, 2023 Offline RL reinforcement-learning
— Unverified 0REMEDI: REinforcement learning-driven adaptive MEtabolism modeling of primary sclerosing cholangitis DIsease progression Oct 2, 2023 Reinforcement Learning (RL)
— Unverified 0Improving Dialogue Management: Quality Datasets vs Models Oct 2, 2023 Dialog Learning Dialogue Management
Code Code Available 0From Bandits Model to Deep Deterministic Policy Gradient, Reinforcement Learning with Contextual Information Oct 1, 2023 Decision Making reinforcement-learning
— Unverified 0Controlling Neural Style Transfer with Deep Reinforcement Learning Sep 30, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Adaptive Control of an Inverted Pendulum by a Reinforcement Learning-based LQR Method Sep 30, 2023 Benchmarking Reinforcement Learning (RL)
— Unverified 0A Quantum States Preparation Method Based on Difference-Driven Reinforcement Learning Sep 29, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Learning to Rewrite Prompts for Personalized Text Generation Sep 29, 2023 Language Modelling Large Language Model
— Unverified 0Adversarial Driving Behavior Generation Incorporating Human Risk Cognition for Autonomous Vehicle Evaluation Sep 29, 2023 Reinforcement Learning (RL)
— Unverified 0ComSD: Balancing Behavioral Quality and Diversity in Unsupervised Skill Discovery Sep 29, 2023 Contrastive Learning Diversity
Code Code Available 0Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness Sep 29, 2023 Offline RL reinforcement-learning
Code Code Available 0Reinforcement Learning for Node Selection in Branch-and-Bound Sep 29, 2023 Graph Neural Network reinforcement-learning
— Unverified 0Uncertainty-Aware Decision Transformer for Stochastic Driving Environments Sep 28, 2023 Autonomous Driving Offline RL
— Unverified 0Robust Offline Reinforcement Learning -- Certify the Confidence Interval Sep 28, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Stackelberg Batch Policy Learning Sep 28, 2023 Decision Making Reinforcement Learning (RL)
— Unverified 0Efficiency Separation between RL Methods: Model-Free, Model-Based and Goal-Conditioned Sep 28, 2023 model Reinforcement Learning (RL)
— Unverified 0Raijū: Reinforcement Learning-Guided Post-Exploitation for Automating Security Assessment of Network Systems Sep 27, 2023 Reinforcement Learning (RL)
— Unverified 0PlotMap: Automated Layout Design for Building Game Worlds Sep 26, 2023 Decision Making Layout Design
Code Code Available 0Tempo Adaptation in Non-stationary Reinforcement Learning Sep 26, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0ODE-based Recurrent Model-free Reinforcement Learning for POMDPs Sep 25, 2023 continuous-control Continuous Control
— Unverified 0Tracking Control for a Spherical Pendulum via Curriculum Reinforcement Learning Sep 25, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Sample Complexity of Neural Policy Mirror Descent for Policy Optimization on Low-Dimensional Manifolds Sep 25, 2023 Policy Gradient Methods Reinforcement Learning (RL)
— Unverified 0On the Effectiveness of Adversarial Samples against Ensemble Learning-based Windows PE Malware Detectors Sep 25, 2023 Ensemble Learning Malware Analysis
— Unverified 0A comparison of controller architectures and learning mechanisms for arbitrary robot morphologies Sep 25, 2023 Reinforcement Learning (RL)
— Unverified 0Boosting Offline Reinforcement Learning for Autonomous Driving with Hierarchical Latent Skills Sep 24, 2023 Autonomous Driving Offline RL
— Unverified 0Guided Cooperation in Hierarchical Reinforcement Learning via Model-based Rollout Sep 24, 2023 Hierarchical Reinforcement Learning reinforcement-learning
Code Code Available 0Iterative Reachability Estimation for Safe Reinforcement Learning Sep 24, 2023 MuJoCo reinforcement-learning
— Unverified 0Limits of Actor-Critic Algorithms for Decision Tree Policies Learning in IBMDPs Sep 23, 2023 Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning for Robust Header Compression under Model Uncertainty Sep 23, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Offline to Online Learning for Real-Time Bandwidth Estimation Sep 23, 2023 Imitation Learning Reinforcement Learning (RL)
— Unverified 0Robotic Offline RL from Internet Videos via Value-Function Pre-Training Sep 22, 2023 Offline RL Reinforcement Learning (RL)
— Unverified 0H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps Sep 22, 2023 Offline RL Reinforcement Learning (RL)
— Unverified 0Curriculum Reinforcement Learning via Morphology-Environment Co-Evolution Sep 21, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Delays in Reinforcement Learning Sep 20, 2023 Decision Making reinforcement-learning
— Unverified 0Deep Reinforcement Learning for Infinite Horizon Mean Field Problems in Continuous Spaces Sep 19, 2023 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Physics-Informed Machine Learning for Data Anomaly Detection, Classification, Localization, and Mitigation: A Review, Challenges, and Path Forward Sep 19, 2023 Anomaly Detection Physics-informed machine learning
— Unverified 0Mechanic Maker 2.0: Reinforcement Learning for Evaluating Generated Rules Sep 18, 2023 Game Design reinforcement-learning
— Unverified 0Privileged to Predicted: Towards Sensorimotor Reinforcement Learning for Urban Driving Sep 18, 2023 Autonomous Driving Imitation Learning
— Unverified 0