Offline Imitation Learning from Multiple Baselines with Applications to Compiler Optimization Mar 28, 2024 Compiler Optimization Imitation Learning
— Unverified 0Reinforcement Learning in Agent-Based Market Simulation: Unveiling Realistic Stylized Facts and Behavior Mar 28, 2024 Reinforcement Learning (RL)
— Unverified 0LORD: Large Models based Opposite Reward Design for Autonomous Driving Mar 27, 2024 Autonomous Driving Imitation Learning
— Unverified 0Robustness and Visual Explanation for Black Box Image, Video, and ECG Signal Classification with Reinforcement Learning Mar 27, 2024 Classification image-classification
— Unverified 0From Two-Dimensional to Three-Dimensional Environment with Q-Learning: Modeling Autonomous Navigation with Reinforcement Learning and no Libraries Mar 27, 2024 Autonomous Navigation Decision Making
Code Code Available 0Long and Short-Term Constraints Driven Safe Reinforcement Learning for Autonomous Driving Mar 27, 2024 Autonomous Driving Decision Making
— Unverified 0Image Deraining via Self-supervised Reinforcement Learning Mar 27, 2024 Denoising Dictionary Learning
— Unverified 0Probabilistic Model Checking of Stochastic Reinforcement Learning Policies Mar 27, 2024 model reinforcement-learning
— Unverified 0Towards Human-Centered Construction Robotics: A Reinforcement Learning-Driven Companion Robot for Contextually Assisting Carpentry Workers Mar 27, 2024 Reinforcement Learning (RL)
— Unverified 0FPGA-Based Neural Thrust Controller for UAVs Mar 27, 2024 Reinforcement Learning (RL)
— Unverified 0CaT: Constraints as Terminations for Legged Locomotion Reinforcement Learning Mar 27, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Safe and Robust Reinforcement Learning: Principles and Practice Mar 27, 2024 Domain Adaptation reinforcement-learning
— Unverified 0TractOracle: towards an anatomically-informed reward function for RL-based tractography Mar 26, 2024 Reinforcement Learning (RL)
Code Code Available 1Learning the Optimal Power Flow: Environment Design Matters Mar 26, 2024 Reinforcement Learning (RL)
Code Code Available 0PeersimGym: An Environment for Solving the Task Offloading Problem with Reinforcement Learning Mar 26, 2024 Deep Reinforcement Learning Distributed Computing
Code Code Available 1Reinforcement Learning-based Receding Horizon Control using Adaptive Control Barrier Functions for Safety-Critical Systems Mar 26, 2024 Bilevel Optimization Model Predictive Control
Code Code Available 1Uncertainty-aware Distributional Offline Reinforcement Learning Mar 26, 2024 Offline RL reinforcement-learning
— Unverified 0Depending on yourself when you should: Mentoring LLM with RL agents to become the master in cybersecurity games Mar 26, 2024 Reinforcement Learning (RL)
— Unverified 0RL for Consistency Models: Faster Reward Guided Text-to-Image Generation Mar 25, 2024 Image Generation Instruction Following
— Unverified 0Policy Optimization finds Nash Equilibrium in Regularized General-Sum LQ Games Mar 25, 2024 Reinforcement Learning (RL)
— Unverified 0Semantic-Aware Remote Estimation of Multiple Markov Sources Under Constraints Mar 25, 2024 Q-Learning Reinforcement Learning (RL)
— Unverified 0Outcome-Constrained Large Language Models for Countering Hate Speech Mar 25, 2024 Reinforcement Learning (RL) Text Generation
— Unverified 0Reinforcement Learning-based Recommender Systems with Large Language Models for State Reward and Action Modeling Mar 25, 2024 Offline RL Recommendation Systems
— Unverified 0Planning with a Learned Policy Basis to Optimally Solve Complex Tasks Mar 22, 2024 Reinforcement Learning (RL)
— Unverified 0Distilling Reinforcement Learning Policies for Interpretable Robot Locomotion: Gradient Boosting Machines and Symbolic Regression Mar 21, 2024 Additive models Reinforcement Learning (RL)
— Unverified 0Task-optimal data-driven surrogate models for eNMPC via differentiable simulation and optimization Mar 21, 2024 Model Predictive Control Reinforcement Learning (RL)
— Unverified 0Heuristic Algorithm-based Action Masking Reinforcement Learning (HAAM-RL) with Ensemble Inference Method Mar 21, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Constrained Reinforcement Learning with Smoothed Log Barrier Function Mar 21, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Policy Mirror Descent with Lookahead Mar 21, 2024 Reinforcement Learning (RL)
Code Code Available 0Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning Mar 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Safety-Aware Reinforcement Learning for Electric Vehicle Charging Station Management in Distribution Network Mar 20, 2024 Management Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning for Online Testing of Autonomous Driving Systems: a Replication and Extension Study Mar 20, 2024 Autonomous Driving Q-Learning
— Unverified 0Sample Complexity of Offline Distributionally Robust Linear Markov Decision Processes Mar 19, 2024 Reinforcement Learning (RL)
— Unverified 0Policy Bifurcation in Safe Reinforcement Learning Mar 19, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning Mar 19, 2024 Reinforcement Learning (RL) Visual Grounding
Code Code Available 1Equivariant Ensembles and Regularization for Reinforcement Learning in Map-based Path Planning Mar 19, 2024 Inductive Bias Reinforcement Learning (RL)
Code Code Available 2Fast Value Tracking for Deep Reinforcement Learning Mar 19, 2024 Decision Making Deep Reinforcement Learning
— Unverified 0Reinforcement Learning with Generalizable Gaussian Splatting Mar 18, 2024 3DGS reinforcement-learning
— Unverified 0Efficient Transformer-based Hyper-parameter Optimization for Resource-constrained IoT Environments Mar 18, 2024 Reinforcement Learning (RL)
Code Code Available 0Distill2Explain: Differentiable decision trees for explainable reinforcement learning in energy application controllers Mar 18, 2024 energy management Reinforcement Learning (RL)
— Unverified 0Decomposing Control Lyapunov Functions for Efficient Reinforcement Learning Mar 18, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Bootstrapping Reinforcement Learning with Imitation for Vision-Based Agile Flight Mar 18, 2024 Imitation Learning reinforcement-learning
— Unverified 0EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents Mar 18, 2024 Reinforcement Learning (RL) World Knowledge
— Unverified 0Pessimistic Causal Reinforcement Learning with Mediators for Confounded Offline Data Mar 18, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Offline Multitask Representation Learning for Reinforcement Learning Mar 18, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards Mar 18, 2024 Decision Making Q-Learning
— Unverified 0The Value of Reward Lookahead in Reinforcement Learning Mar 18, 2024 Offline RL reinforcement-learning
— Unverified 0Reinforcement Learning with Token-level Feedback for Controllable Text Generation Mar 18, 2024 Attribute reinforcement-learning
Code Code Available 1Causality from Bottom to Top: A Survey Mar 17, 2024 Anomaly Detection Fraud Detection
— Unverified 0Prior-dependent analysis of posterior sampling reinforcement learning with function approximation Mar 17, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0