Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts Apr 14, 2024 Language Modeling Language Modelling
— Unverified 0SmartPathfinder: Pushing the Limits of Heuristic Solutions for Vehicle Routing Problem with Drones Using Reinforcement Learning Apr 13, 2024 Reinforcement Learning (RL)
— Unverified 0WROOM: An Autonomous Driving Approach for Off-Road Navigation Apr 12, 2024 Autonomous Driving Reinforcement Learning (RL)
Code Code Available 1Enhancing Autonomous Vehicle Training with Language Model Integration and Critical Scenario Generation Apr 12, 2024 Language Modeling Language Modelling
— Unverified 0Dataset Reset Policy Optimization for RLHF Apr 12, 2024 Reinforcement Learning (RL)
Code Code Available 1Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement Learning Apr 12, 2024 Computational Efficiency Hyperparameter Optimization
Code Code Available 0FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning Apr 11, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Enhancing Policy Gradient with the Polyak Step-Size Adaption Apr 11, 2024 Reinforcement Learning (RL) Sensitivity
— Unverified 0Efficient Duple Perturbation Robustness in Low-rank MDPs Apr 11, 2024 Reinforcement Learning (RL)
— Unverified 0Leveraging Domain-Unlabeled Data in Offline Reinforcement Learning across Two Domains Apr 11, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0On the Sample Efficiency of Abstractions and Potential-Based Reward Shaping in Reinforcement Learning Apr 11, 2024 Reinforcement Learning (RL)
— Unverified 0Dual Ensemble Kalman Filter for Stochastic Optimal Control Apr 10, 2024 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0UAV-Assisted Enhanced Coverage and Capacity in Dynamic MU-mMIMO IoT Systems: A Deep Reinforcement Learning Approach Apr 10, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery Apr 10, 2024 Decision Making Imitation Learning
— Unverified 0Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and Detection Apr 10, 2024 Out-of-Distribution Detection Out of Distribution (OOD) Detection
Code Code Available 0How Consistent are Clinicians? Evaluating the Predictability of Sepsis Disease Progression with Dynamics Models Apr 10, 2024 Diversity Reinforcement Learning (RL)
Code Code Available 1Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis Apr 9, 2024 MuJoCo Reinforcement Learning (RL)
— Unverified 0Adaptable Recovery Behaviors in Robotics: A Behavior Trees and Motion Generators(BTMG) Approach for Failure Management Apr 9, 2024 Management Reinforcement Learning (RL)
— Unverified 0Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning Apr 9, 2024 Diversity Reinforcement Learning (RL)
— Unverified 0Humanoid-Gym: Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real Transfer Apr 8, 2024 MuJoCo Physical Simulations
Code Code Available 5FGAIF: Aligning Large Vision-Language Models with Fine-grained AI Feedback Apr 7, 2024 Attribute Hallucination
— Unverified 0Compositional Conservatism: A Transductive Approach in Offline Reinforcement Learning Apr 6, 2024 D4RL Offline RL
Code Code Available 0Transform then Explore: a Simple and Effective Technique for Exploratory Combinatorial Optimization with Reinforcement Learning Apr 6, 2024 Combinatorial Optimization Feature Engineering
— Unverified 0Enhancing IoT Intelligence: A Transformer-based Reinforcement Learning Methodology Apr 5, 2024 Decision Making Navigate
— Unverified 0Continual Policy Distillation of Reinforcement Learning-based Controllers for Soft Robotic In-Hand Manipulation Apr 5, 2024 Reinforcement Learning (RL)
Code Code Available 0A Reinforcement Learning based Reset Policy for CDCL SAT Solvers Apr 4, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Exploration is Harder than Prediction: Cryptographically Separating Reinforcement Learning from Supervised Learning Apr 4, 2024 regression Reinforcement Learning (RL)
— Unverified 0Sequential Recommendation for Optimizing Both Immediate Feedback and Long-term Retention Apr 4, 2024 Contrastive Learning Multi-Task Learning
Code Code Available 0Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm Apr 4, 2024 Reinforcement Learning (RL)
— Unverified 0REACT: Revealing Evolutionary Action Consequence Trajectories for Interpretable Reinforcement Learning Apr 4, 2024 Descriptive Diversity
— Unverified 0SliceIt! -- A Dual Simulator Framework for Learning Robot Food Slicing Apr 3, 2024 Reinforcement Learning (RL)
Code Code Available 0Methodology for Interpretable Reinforcement Learning for Optimizing Mechanical Ventilation Apr 3, 2024 Off-policy evaluation reinforcement-learning
— Unverified 0Electric Vehicle Routing Problem for Emergency Power Supply: Towards Telecom Base Station Relief Apr 3, 2024 Reinforcement Learning (RL)
Code Code Available 1Reinforcement Learning in Categorical Cybernetics Apr 3, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Is Exploration All You Need? Effective Exploration Characteristics for Transfer in Reinforcement Learning Apr 2, 2024 All Deep Reinforcement Learning
— Unverified 0Asymptotics of Language Model Alignment Apr 2, 2024 Language Modeling Language Modelling
— Unverified 0Emergence of Chemotactic Strategies with Multi-Agent Reinforcement Learning Apr 2, 2024 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0EV2Gym: A Flexible V2G Simulator for EV Smart Charging Research and Benchmarking Apr 2, 2024 Benchmarking Reinforcement Learning (RL)
Code Code Available 2Active Exploration in Bayesian Model-based Reinforcement Learning for Robot Manipulation Apr 2, 2024 Active Learning Bayesian Inference
— Unverified 0Entity-Centric Reinforcement Learning for Object Manipulation from Pixels Apr 1, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1MTLight: Efficient Multi-Task Reinforcement Learning for Traffic Signal Control Apr 1, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Utilizing Maximum Mean Discrepancy Barycenter for Propagating the Uncertainty of Value Functions in Reinforcement Learning Mar 31, 2024 Atari Games Q-Learning
— Unverified 0Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration Mar 31, 2024 continuous-control Continuous Control
— Unverified 0Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods Mar 30, 2024 Autonomous Driving Language Modeling
— Unverified 0Molecular Generative Adversarial Network with Multi-Property Optimization Mar 29, 2024 Drug Discovery Generative Adversarial Network
— Unverified 0Learning Visual Quadrupedal Loco-Manipulation from Demonstrations Mar 29, 2024 Reinforcement Learning (RL)
— Unverified 0CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning Mar 29, 2024 counterfactual Offline RL
— Unverified 0Nonparametric Bellman Mappings for Reinforcement Learning: Application to Robust Adaptive Filtering Mar 29, 2024 Dimensionality Reduction Reinforcement Learning (RL)
— Unverified 0Jointly Training and Pruning CNNs via Learnable Agent Guidance and Alignment Mar 28, 2024 Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning in Agent-Based Market Simulation: Unveiling Realistic Stylized Facts and Behavior Mar 28, 2024 Reinforcement Learning (RL)
— Unverified 0