PickLLM: Context-Aware RL-Assisted Large Language Model Routing Dec 12, 2024 Language Modeling Language Modelling
— Unverified 0Ask1: Development and Reinforcement Learning-Based Control of a Custom Quadruped Robot Dec 11, 2024 Reinforcement Learning (RL)
— Unverified 0Coarse-to-Fine: A Dual-Phase Channel-Adaptive Method for Wireless Image Transmission Dec 11, 2024 Reinforcement Learning (RL)
— Unverified 0Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement Learning Dec 11, 2024 Autonomous Driving Offline RL
Code Code Available 0Progressive-Resolution Policy Distillation: Leveraging Coarse-Resolution Simulations for Time-Efficient Fine-Resolution Policy Learning Dec 10, 2024 Reinforcement Learning (RL)
— Unverified 0Optimizing Sensor Redundancy in Sequential Decision-Making Problems Dec 10, 2024 Decision Making OpenAI Gym
— Unverified 0Mobile-TeleVision: Predictive Motion Priors for Humanoid Whole-Body Control Dec 10, 2024 motion retargeting Reinforcement Learning (RL)
— Unverified 0Swarm Behavior Cloning Dec 10, 2024 Decision Making Imitation Learning
— Unverified 0Preference Adaptive and Sequential Text-to-Image Generation Dec 10, 2024 Image Generation Language Modeling
— Unverified 0Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone Dec 9, 2024 global-optimization Imitation Learning
— Unverified 0Skill-Enhanced Reinforcement Learning Acceleration from Demonstrations Dec 9, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation Dec 9, 2024 Reinforcement Learning (RL)
— Unverified 0Mean--Variance Portfolio Selection by Continuous-Time Reinforcement Learning: Algorithms, Regret Analysis, and Empirical Study Dec 8, 2024 Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning for a Discrete-Time Linear-Quadratic Control Problem with an Application Dec 8, 2024 Management Reinforcement Learning (RL)
— Unverified 0Learning Soft Driving Constraints from Vectorized Scene Embeddings while Imitating Expert Trajectories Dec 7, 2024 Imitation Learning Motion Planning
— Unverified 0AI Planning: A Primer and Survey (Preliminary Report) Dec 7, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0RLZero: Direct Policy Inference from Language Without In-Domain Supervision Dec 7, 2024 Imitation Learning Reinforcement Learning (RL)
— Unverified 0ELEMENT: Episodic and Lifelong Exploration via Maximum Entropy Dec 5, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting Dec 5, 2024 D4RL Offline RL
— Unverified 0Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy Dec 5, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Traffic Co-Simulation Framework Empowered by Infrastructure Camera Sensing and Reinforcement Learning Dec 5, 2024 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning Dec 4, 2024 Efficient Exploration reinforcement-learning
— Unverified 0Using Deep Reinforcement Learning to Enhance Channel Sampling Patterns in Integrated Sensing and Communication Dec 4, 2024 Deep Reinforcement Learning Integrated sensing and communication
— Unverified 0Learning Whole-Body Loco-Manipulation for Omni-Directional Task Space Pose Tracking with a Wheeled-Quadrupedal-Manipulator Dec 4, 2024 Pose Tracking Reinforcement Learning (RL)
— Unverified 0Learning on One Mode: Addressing Multi-Modality in Offline Reinforcement Learning Dec 4, 2024 D4RL Imitation Learning
Code Code Available 0Out-of-Distribution Detection for Neurosymbolic Autonomous Cyber Agents Dec 3, 2024 Out-of-Distribution Detection Reinforcement Learning (RL)
— Unverified 0AI-Driven Resource Allocation Framework for Microservices in Hybrid Cloud Platforms Dec 3, 2024 Management reinforcement-learning
— Unverified 0Technical Report on Reinforcement Learning Control on the Lucas-Nülle Inverted Pendulum Dec 3, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0Generating Critical Scenarios for Testing Automated Driving Systems Dec 3, 2024 Autonomous Driving Autonomous Vehicles
— Unverified 0Reinforcement learning to learn quantum states for Heisenberg scaling accuracy Dec 3, 2024 Meta-Learning Quantum Machine Learning
Code Code Available 0Selective Reviews of Bandit Problems in AI via a Statistical View Dec 3, 2024 Decision Making Decision Making Under Uncertainty
— Unverified 0A Memory-Based Reinforcement Learning Approach to Integrated Sensing and Communication Dec 2, 2024 Deep Reinforcement Learning Integrated sensing and communication
— Unverified 0Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations Dec 2, 2024 continuous-control Continuous Control
— Unverified 0Explore Reinforced: Equilibrium Approximation with Reinforcement Learning Dec 2, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Approximately Optimal Search on a Higher-dimensional Sliding Puzzle Dec 2, 2024 Reinforcement Learning (RL)
Code Code Available 0RL2: Reinforce Large Language Model to Assist Safe Reinforcement Learning for Energy Management of Active Distribution Networks Dec 2, 2024 energy management In-Context Learning
— Unverified 0Provable Partially Observable Reinforcement Learning with Privileged Information Dec 1, 2024 Partially Observable Reinforcement Learning reinforcement-learning
— Unverified 0Bilinear Convolution Decomposition for Causal RL Interpretability Dec 1, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0BOTS: Batch Bayesian Optimization of Extended Thompson Sampling for Severely Episode-Limited RL Settings Nov 30, 2024 Bayesian Optimization Policy Gradient Methods
— Unverified 0HVAC-DPT: A Decision Pretrained Transformer for HVAC Control Nov 29, 2024 In-Context Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Solving Rubik's Cube Without Tricky Sampling Nov 29, 2024 Policy Gradient Methods Reinforcement Learning (RL)
— Unverified 0RL-MILP Solver: A Reinforcement Learning Approach for Solving Mixed-Integer Linear Programs with Graph Neural Networks Nov 29, 2024 Graph Neural Network Reinforcement Learning (RL)
— Unverified 0TEA: Trajectory Encoding Augmentation for Robust and Transferable Policies in Offline Reinforcement Learning Nov 28, 2024 Reinforcement Learning (RL)
— Unverified 0Convex Regularization and Convergence of Policy Gradient Flows under Safety Constraints Nov 28, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0A Comprehensive Survey of Reinforcement Learning: From Algorithms to Practical Challenges Nov 28, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Supervised Learning-enhanced Multi-Group Actor Critic for Live Stream Allocation in Feed Nov 28, 2024 Recommendation Systems Reinforcement Learning (RL)
Code Code Available 0NeoHebbian Synapses to Accelerate Online Training of Neuromorphic Hardware Nov 27, 2024 Reinforcement Learning (RL)
— Unverified 0Dynamic Non-Prehensile Object Transport via Model-Predictive Reinforcement Learning Nov 27, 2024 Model Predictive Control reinforcement-learning
— Unverified 0ScaleViz: Scaling Visualization Recommendation Models on Large Data Nov 27, 2024 Reinforcement Learning (RL)
— Unverified 0ELEMENTAL: Interactive Learning from Demonstrations and Vision-Language Models for Reward Design in Robotics Nov 27, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0