Learning Soft Driving Constraints from Vectorized Scene Embeddings while Imitating Expert Trajectories Dec 7, 2024 Imitation Learning Motion Planning
— Unverified 0RLZero: Direct Policy Inference from Language Without In-Domain Supervision Dec 7, 2024 Imitation Learning Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning Enhanced LLMs: A Survey Dec 5, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 3Mind the Gap: Towards Generalizable Autonomous Penetration Testing via Domain Randomization and Meta-Reinforcement Learning Dec 5, 2024 Large Language Model Meta Reinforcement Learning
Code Code Available 1Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy Dec 5, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting Dec 5, 2024 D4RL Offline RL
— Unverified 0ELEMENT: Episodic and Lifelong Exploration via Maximum Entropy Dec 5, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Traffic Co-Simulation Framework Empowered by Infrastructure Camera Sensing and Reinforcement Learning Dec 5, 2024 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Using Deep Reinforcement Learning to Enhance Channel Sampling Patterns in Integrated Sensing and Communication Dec 4, 2024 Deep Reinforcement Learning Integrated sensing and communication
— Unverified 0Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning Dec 4, 2024 Efficient Exploration reinforcement-learning
— Unverified 0AI-Driven Day-to-Day Route Choice Dec 4, 2024 Decision Making Reinforcement Learning (RL)
Code Code Available 1Learning Whole-Body Loco-Manipulation for Omni-Directional Task Space Pose Tracking with a Wheeled-Quadrupedal-Manipulator Dec 4, 2024 Pose Tracking Reinforcement Learning (RL)
— Unverified 0Learning on One Mode: Addressing Multi-Modality in Offline Reinforcement Learning Dec 4, 2024 D4RL Imitation Learning
Code Code Available 0Out-of-Distribution Detection for Neurosymbolic Autonomous Cyber Agents Dec 3, 2024 Out-of-Distribution Detection Reinforcement Learning (RL)
— Unverified 0Technical Report on Reinforcement Learning Control on the Lucas-Nülle Inverted Pendulum Dec 3, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0AI-Driven Resource Allocation Framework for Microservices in Hybrid Cloud Platforms Dec 3, 2024 Management reinforcement-learning
— Unverified 0Generating Critical Scenarios for Testing Automated Driving Systems Dec 3, 2024 Autonomous Driving Autonomous Vehicles
— Unverified 0Selective Reviews of Bandit Problems in AI via a Statistical View Dec 3, 2024 Decision Making Decision Making Under Uncertainty
— Unverified 0Conformal Symplectic Optimization for Stable Reinforcement Learning Dec 3, 2024 Atari Games Deep Reinforcement Learning
Code Code Available 2Reinforcement learning to learn quantum states for Heisenberg scaling accuracy Dec 3, 2024 Meta-Learning Quantum Machine Learning
Code Code Available 0A Memory-Based Reinforcement Learning Approach to Integrated Sensing and Communication Dec 2, 2024 Deep Reinforcement Learning Integrated sensing and communication
— Unverified 0Approximately Optimal Search on a Higher-dimensional Sliding Puzzle Dec 2, 2024 Reinforcement Learning (RL)
Code Code Available 0RL2: Reinforce Large Language Model to Assist Safe Reinforcement Learning for Energy Management of Active Distribution Networks Dec 2, 2024 energy management In-Context Learning
— Unverified 0Explore Reinforced: Equilibrium Approximation with Reinforcement Learning Dec 2, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Revisiting Generative Policies: A Simpler Reinforcement Learning Algorithmic Perspective Dec 2, 2024 Density Estimation Offline RL
Code Code Available 2Dense Dynamics-Aware Reward Synthesis: Integrating Prior Experience with Demonstrations Dec 2, 2024 continuous-control Continuous Control
— Unverified 0Provable Partially Observable Reinforcement Learning with Privileged Information Dec 1, 2024 Partially Observable Reinforcement Learning reinforcement-learning
— Unverified 0Bilinear Convolution Decomposition for Causal RL Interpretability Dec 1, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0BOTS: Batch Bayesian Optimization of Extended Thompson Sampling for Severely Episode-Limited RL Settings Nov 30, 2024 Bayesian Optimization Policy Gradient Methods
— Unverified 0o1-Coder: an o1 Replication for Coding Nov 29, 2024 Reinforcement Learning (RL)
Code Code Available 3RL-MILP Solver: A Reinforcement Learning Approach for Solving Mixed-Integer Linear Programs with Graph Neural Networks Nov 29, 2024 Graph Neural Network Reinforcement Learning (RL)
— Unverified 0HVAC-DPT: A Decision Pretrained Transformer for HVAC Control Nov 29, 2024 In-Context Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Solving Rubik's Cube Without Tricky Sampling Nov 29, 2024 Policy Gradient Methods Reinforcement Learning (RL)
— Unverified 0Supervised Learning-enhanced Multi-Group Actor Critic for Live Stream Allocation in Feed Nov 28, 2024 Recommendation Systems Reinforcement Learning (RL)
Code Code Available 0TEA: Trajectory Encoding Augmentation for Robust and Transferable Policies in Offline Reinforcement Learning Nov 28, 2024 Reinforcement Learning (RL)
— Unverified 0Convex Regularization and Convergence of Policy Gradient Flows under Safety Constraints Nov 28, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0A Comprehensive Survey of Reinforcement Learning: From Algorithms to Practical Challenges Nov 28, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Dynamic Non-Prehensile Object Transport via Model-Predictive Reinforcement Learning Nov 27, 2024 Model Predictive Control reinforcement-learning
— Unverified 0ELEMENTAL: Interactive Learning from Demonstrations and Vision-Language Models for Reward Design in Robotics Nov 27, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0ScaleViz: Scaling Visualization Recommendation Models on Large Data Nov 27, 2024 Reinforcement Learning (RL)
— Unverified 0NeoHebbian Synapses to Accelerate Online Training of Neuromorphic Hardware Nov 27, 2024 Reinforcement Learning (RL)
— Unverified 0Dynamic Retail Pricing via Q-Learning -- A Reinforcement Learning Framework for Enhanced Revenue Management Nov 27, 2024 Decision Making Management
— Unverified 0PROGRESSOR: A Perceptually Guided Reward Estimator with Self-Supervised Online Refinement Nov 26, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0Pretrained LLM Adapted with LoRA as a Decision Transformer for Offline RL in Quantitative Trading Nov 26, 2024 Offline RL parameter-efficient fine-tuning
Code Code Available 2Accelerating Proximal Policy Optimization Learning Using Task Prediction for Solving Environments with Delayed Rewards Nov 26, 2024 Reinforcement Learning (RL)
— Unverified 0LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble Nov 26, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0Free^2Guide: Gradient-Free Path Integral Control for Enhancing Text-to-Video Generation with Large Vision-Language Models Nov 26, 2024 Reinforcement Learning (RL) Text-to-Video Generation
— Unverified 0M3: Mamba-assisted Multi-Circuit Optimization via MBRL with Effective Scheduling Nov 25, 2024 Mamba Reinforcement Learning (RL)
— Unverified 0Probing for Consciousness in Machines Nov 25, 2024 Reinforcement Learning (RL)
— Unverified 0Unsupervised Event Outlier Detection in Continuous Time Nov 25, 2024 Anomaly Detection Data Augmentation
— Unverified 0