AnyBipe: An End-to-End Framework for Training and Deploying Bipedal Robots Guided by Large Language Models Sep 13, 2024 Reinforcement Learning (RL)
Code Code Available 1Optimal Management of Grid-Interactive Efficient Buildings via Safe Reinforcement Learning Sep 12, 2024 Decision Making Management
— Unverified 0Digital Twin for Autonomous Guided Vehicles based on Integrated Sensing and Communications Sep 12, 2024 ISAC Reinforcement Learning (RL)
— Unverified 0Scores as Actions: a framework of fine-tuning diffusion models by continuous-time reinforcement learning Sep 12, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Hand-Object Interaction Pretraining from Videos Sep 12, 2024 Object Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning Discovers Efficient Decentralized Graph Path Search Strategies Sep 12, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0The Role of Deep Learning Regularizations on Actors in Offline RL Sep 11, 2024 D4RL Offline RL
Code Code Available 0Learning Efficient Recursive Numeral Systems via Reinforcement Learning Sep 11, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Online Decision MetaMorphFormer: A Casual Transformer-Based Reinforcement Learning Framework of Universal Embodied Intelligence Sep 11, 2024 Reinforcement Learning (RL)
— Unverified 0Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learning Sep 10, 2024 Deep Reinforcement Learning OpenAI Gym
Code Code Available 0Automated Data Augmentation for Few-Shot Time Series Forecasting: A Reinforcement Learning Approach Guided by a Model Zoo Sep 10, 2024 Data Augmentation Diversity
— Unverified 0Superior Computer Chess with Model Predictive Control, Reinforcement Learning, and Rollout Sep 10, 2024 Model Predictive Control Position
— Unverified 0BetterBodies: Reinforcement Learning guided Diffusion for Antibody Sequence Design Sep 9, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Markov Chain Variance Estimation: A Stochastic Approximation Approach Sep 9, 2024 Reinforcement Learning (RL)
— Unverified 0Forward KL Regularized Preference Optimization for Aligning Diffusion Policies Sep 9, 2024 D4RL Decision Making
— Unverified 0BAMDP Shaping: a Unified Theoretical Framework for Intrinsic Motivation and Reward Shaping Sep 9, 2024 Reinforcement Learning (RL)
— Unverified 0Semifactual Explanations for Reinforcement Learning Sep 9, 2024 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0An Introduction to Quantum Reinforcement Learning (QRL) Sep 9, 2024 Decision Making reinforcement-learning
— Unverified 0Causality-Driven Reinforcement Learning for Joint Communication and Sensing Sep 7, 2024 Autonomous Driving Causal Discovery
— Unverified 0Reinforcement Learning for Rate Maximization in IRS-aided OWC Networks Sep 7, 2024 Q-Learning reinforcement-learning
— Unverified 0Reward-Directed Score-Based Diffusion Models via q-Learning Sep 7, 2024 Q-Learning Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning-Based Adaptive Load Balancing for Dynamic Cloud Environments Sep 7, 2024 Cloud Computing reinforcement-learning
— Unverified 0Sample and Oracle Efficient Reinforcement Learning for MDPs with Linearly-Realizable Value Functions Sep 7, 2024 Reinforcement Learning (RL)
— Unverified 0Reward Guidance for Reinforcement Learning Tasks Based on Large Language Models: The LMGT Framework Sep 7, 2024 Language Modeling Language Modelling
— Unverified 0Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn Sep 7, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0Gaussian-Mixture-Model Q-Functions for Reinforcement Learning by Riemannian Optimization Sep 6, 2024 Reinforcement Learning (RL) Riemannian optimization
— Unverified 0InfraLib: Enabling Reinforcement Learning and Decision-Making for Large-Scale Infrastructure Management Sep 5, 2024 Benchmarking Computational Efficiency
— Unverified 0Differentiable Discrete Event Simulation for Queuing Network Control Sep 5, 2024 GPU Reinforcement Learning (RL)
— Unverified 0CHIRPs: Change-Induced Regret Proxy metrics for Lifelong Reinforcement Learning Sep 5, 2024 Lifelong learning reinforcement-learning
— Unverified 0Robust synchronization and policy adaptation for networked heterogeneous agents Sep 5, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning Approach to Optimizing Profilometric Sensor Trajectories for Surface Inspection Sep 5, 2024 Defect Detection Reinforcement Learning (RL)
— Unverified 0ELO-Rated Sequence Rewards: Advancing Reinforcement Learning Models Sep 5, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Enhancing Information Freshness: An AoI Optimized Markov Decision Process Dedicated In the Underwater Task Sep 4, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0Continual Diffuser (CoD): Mastering Continual Offline Reinforcement Learning with Experience Rehearsal Sep 4, 2024 Reinforcement Learning (RL)
Code Code Available 0Large Language Models as Efficient Reward Function Searchers for Custom-Environment Multi-Objective Reinforcement Learning Sep 4, 2024 Long-Context Understanding Multi-Objective Reinforcement Learning
— Unverified 0Tractable Offline Learning of Regular Decision Processes Sep 4, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0State and Action Factorization in Power Grids Sep 3, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning-enabled Satellite Constellation Reconfiguration and Retasking for Mission-Critical Applications Sep 3, 2024 Q-Learning Reinforcement Learning (RL)
— Unverified 0Grounding Language Models in Autonomous Loco-manipulation Tasks Sep 2, 2024 Language Modeling Language Modelling
— Unverified 0MOOSS: Mask-Enhanced Temporal Contrastive Learning for Smooth State Evolution in Visual Reinforcement Learning Sep 2, 2024 Contrastive Learning graph construction
Code Code Available 0Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy Optimization Sep 2, 2024 Diversity Offline RL
Code Code Available 2Diffusion Policy Policy Optimization Sep 1, 2024 continuous-control Continuous Control
Code Code Available 4AgGym: An agricultural biotic stress simulation environment for ultra-precision management planning Sep 1, 2024 Deep Reinforcement Learning Management
Code Code Available 0Foundations of Multivariate Distributional Reinforcement Learning Aug 31, 2024 Decision Making Distributional Reinforcement Learning
— Unverified 0Robust off-policy Reinforcement Learning via Soft Constrained Adversary Aug 31, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Traffic expertise meets residual RL: Knowledge-informed model-based residual reinforcement learning for CAV trajectory control Aug 30, 2024 Model-based Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 1Discovery of False Data Injection Schemes on Frequency Controllers with Reinforcement Learning Aug 30, 2024 Reinforcement Learning (RL)
— Unverified 0AdapShare: An RL-Based Dynamic Spectrum Sharing Solution for O-RAN Aug 29, 2024 Fairness Reinforcement Learning (RL)
— Unverified 0On Convergence of Average-Reward Q-Learning in Weakly Communicating Markov Decision Processes Aug 29, 2024 Q-Learning Reinforcement Learning (RL)
— Unverified 0RAIN: Reinforcement Algorithms for Improving Numerical Weather and Climate Models Aug 28, 2024 Reinforcement Learning (RL)
Code Code Available 0