Asynchronous Fractional Multi-Agent Deep Reinforcement Learning for Age-Minimal Mobile Edge Computing Sep 25, 2024 Deep Reinforcement Learning Edge-computing
— Unverified 0Spiders Based on Anxiety: How Reinforcement Learning Can Deliver Desired User Experience in Virtual Reality Personalized Arachnophobia Treatment Sep 25, 2024 Reinforcement Learning (RL)
Code Code Available 0Revisiting Space Mission Planning: A Reinforcement Learning-Guided Approach for Multi-Debris Rendezvous Sep 25, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Offline and Distributional Reinforcement Learning for Radio Resource Management Sep 25, 2024 Distributional Reinforcement Learning Management
— Unverified 0On-orbit Servicing for Spacecraft Collision Avoidance With Autonomous Decision Making Sep 25, 2024 Collision Avoidance Decision Making
— Unverified 0OffRIPP: Offline RL-based Informative Path Planning Sep 25, 2024 Offline RL reinforcement-learning
— Unverified 0Learning with Dynamics: Autonomous Regulation of UAV Based Communication Networks with Dynamic UAV Crew Sep 25, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0Reinforcement Leaning for Infinite-Dimensional Systems Sep 24, 2024 Reinforcement Learning (RL)
— Unverified 0From Goal-Conditioned to Language-Conditioned Agents via Vision-Language Models Sep 24, 2024 Reinforcement Learning (RL) Zero-shot Generalization
— Unverified 0Whole-body End-Effector Pose Tracking Sep 24, 2024 Pose Tracking Reinforcement Learning (RL)
— Unverified 0Stage-Wise Reward Shaping for Acrobatic Robots: A Constrained Multi-Objective Reinforcement Learning Approach Sep 24, 2024 Multi-Objective Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 2Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm Sep 24, 2024 Offline RL Off-policy evaluation
— Unverified 0Energy Saving in 6G O-RAN Using DQN-based xApp Sep 23, 2024 Reinforcement Learning (RL)
— Unverified 0Intelligent Routing Algorithm over SDN: Reusable Reinforcement Learning Approach Sep 23, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0CANDERE-COACH: Reinforcement Learning from Noisy Feedback Sep 23, 2024 Imitation Learning reinforcement-learning
— Unverified 0Physics Enhanced Residual Policy Learning (PERPL) for safety cruising in mixed traffic platooning under actuator and communication delay Sep 23, 2024 Reinforcement Learning (RL)
— Unverified 0A novel agent with formal goal-reaching guarantees: an experimental study with a mobile robot Sep 23, 2024 Reinforcement Learning (RL)
— Unverified 0A Distribution-Aware Flow-Matching for Generating Unstructured Data for Few-Shot Reinforcement Learning Sep 21, 2024 Few-Shot Learning Reinforcement Learning (RL)
— Unverified 0OMG-RL:Offline Model-based Guided Reward Learning for Heparin Treatment Sep 20, 2024 Reinforcement Learning (RL)
— Unverified 0Scalable Multi-agent Reinforcement Learning for Factory-wide Dynamic Scheduling Sep 20, 2024 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0SoloParkour: Constrained Reinforcement Learning for Visual Locomotion from Privileged Experience Sep 20, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0MAGICS: Adversarial RL with Minimax Actors Guided by Implicit Critic Stackelberg for Convergent Neural Synthesis of Robot Safety Sep 20, 2024 OpenAI Gym Reinforcement Learning (RL)
— Unverified 0Disentangling Recognition and Decision Regrets in Image-Based Reinforcement Learning Sep 19, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning-based Model Predictive Control for Greenhouse Climate Control Sep 19, 2024 Model Predictive Control Prediction
Code Code Available 1Training Language Models to Self-Correct via Reinforcement Learning Sep 19, 2024 HumanEval Math
Code Code Available 2Assessing the Zero-Shot Capabilities of LLMs for Action Evaluation in RL Sep 19, 2024 Reinforcement Learning (RL)
— Unverified 0TACO-RL: Task Aware Prompt Compression Optimization with Reinforcement Learning Sep 19, 2024 Code Summarization Computational Efficiency
— Unverified 0The Central Role of the Loss Function in Reinforcement Learning Sep 19, 2024 Decision Making reinforcement-learning
— Unverified 0Reinforcement Learning as an Improvement Heuristic for Real-World Production Scheduling Sep 18, 2024 Reinforcement Learning (RL) Scheduling
— Unverified 0Data-Efficient Quadratic Q-Learning Using LMIs Sep 18, 2024 Q-Learning Reinforcement Learning (RL)
— Unverified 0IMRL: Integrating Visual, Physical, Temporal, and Geometric Representations for Enhanced Food Acquisition Sep 18, 2024 Imitation Learning Reinforcement Learning (RL)
— Unverified 0An Enhanced-State Reinforcement Learning Algorithm for Multi-Task Fusion in Large-Scale Recommender Systems Sep 18, 2024 Multi-Task Learning Recommendation Systems
— Unverified 0On-policy Actor-Critic Reinforcement Learning for Multi-UAV Exploration Sep 17, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0A Reinforcement Learning Environment for Automatic Code Optimization in the MLIR Compiler Sep 17, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Leveraging Symmetry to Accelerate Learning of Trajectory Tracking Controllers for Free-Flying Robotic Systems Sep 17, 2024 Reinforcement Learning (RL)
Code Code Available 1Mitigating Partial Observability in Adaptive Traffic Signal Control with Transformers Sep 16, 2024 Management Reinforcement Learning (RL)
— Unverified 0Logic Synthesis Optimization with Predictive Self-Supervision via Causal Transformers Sep 16, 2024 Reinforcement Learning (RL)
— Unverified 0Offline Reinforcement Learning for Learning to Dispatch for Job Shop Scheduling Sep 16, 2024 Combinatorial Optimization counterfactual
Code Code Available 0Instigating Cooperation among LLM Agents Using Adaptive Information Modulation Sep 16, 2024 Reinforcement Learning (RL)
— Unverified 0Enhancing RL Safety with Counterfactual LLM Reasoning Sep 16, 2024 counterfactual Language Modeling
Code Code Available 1Robust Reinforcement Learning with Dynamic Distortion Risk Measures Sep 16, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Safety-Oriented Pruning and Interpretation of Reinforcement Learning Policies Sep 16, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning Sep 16, 2024 Multi-Objective Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Mitigating Dimensionality in 2D Rectangle Packing Problem under Reinforcement Learning Schema Sep 15, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0KAN v.s. MLP for Offline Reinforcement Learning Sep 15, 2024 D4RL Kolmogorov-Arnold Networks
— Unverified 0PIP-Loco: A Proprioceptive Infinite Horizon Planning Framework for Quadrupedal Robot Locomotion Sep 14, 2024 Model Predictive Control Reinforcement Learning (RL)
— Unverified 0Average-Reward Maximum Entropy Reinforcement Learning for Underactuated Double Pendulum Tasks Sep 13, 2024 Acrobot Reinforcement Learning (RL)
— Unverified 0Batch Ensemble for Variance Dependent Regret in Stochastic Bandits Sep 13, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0CPL: Critical Plan Step Learning Boosts LLM Generalization in Reasoning Tasks Sep 13, 2024 ARC Code Generation
— Unverified 0Quasimetric Value Functions with Dense Rewards Sep 13, 2024 continuous-control Continuous Control
— Unverified 0