Jointly Training and Pruning CNNs via Learnable Agent Guidance and Alignment Mar 28, 2024 Reinforcement Learning (RL)
— Unverified 0Offline Imitation Learning from Multiple Baselines with Applications to Compiler Optimization Mar 28, 2024 Compiler Optimization Imitation Learning
— Unverified 0Reinforcement Learning in Agent-Based Market Simulation: Unveiling Realistic Stylized Facts and Behavior Mar 28, 2024 Reinforcement Learning (RL)
— Unverified 0Probabilistic Model Checking of Stochastic Reinforcement Learning Policies Mar 27, 2024 model reinforcement-learning
— Unverified 0Robustness and Visual Explanation for Black Box Image, Video, and ECG Signal Classification with Reinforcement Learning Mar 27, 2024 Classification image-classification
— Unverified 0Towards Human-Centered Construction Robotics: A Reinforcement Learning-Driven Companion Robot for Contextually Assisting Carpentry Workers Mar 27, 2024 Reinforcement Learning (RL)
— Unverified 0Safe and Robust Reinforcement Learning: Principles and Practice Mar 27, 2024 Domain Adaptation reinforcement-learning
— Unverified 0FPGA-Based Neural Thrust Controller for UAVs Mar 27, 2024 Reinforcement Learning (RL)
— Unverified 0Long and Short-Term Constraints Driven Safe Reinforcement Learning for Autonomous Driving Mar 27, 2024 Autonomous Driving Decision Making
— Unverified 0Image Deraining via Self-supervised Reinforcement Learning Mar 27, 2024 Denoising Dictionary Learning
— Unverified 0CaT: Constraints as Terminations for Legged Locomotion Reinforcement Learning Mar 27, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0From Two-Dimensional to Three-Dimensional Environment with Q-Learning: Modeling Autonomous Navigation with Reinforcement Learning and no Libraries Mar 27, 2024 Autonomous Navigation Decision Making
Code Code Available 0LORD: Large Models based Opposite Reward Design for Autonomous Driving Mar 27, 2024 Autonomous Driving Imitation Learning
— Unverified 0Learning the Optimal Power Flow: Environment Design Matters Mar 26, 2024 Reinforcement Learning (RL)
Code Code Available 0Depending on yourself when you should: Mentoring LLM with RL agents to become the master in cybersecurity games Mar 26, 2024 Reinforcement Learning (RL)
— Unverified 0Uncertainty-aware Distributional Offline Reinforcement Learning Mar 26, 2024 Offline RL reinforcement-learning
— Unverified 0Semantic-Aware Remote Estimation of Multiple Markov Sources Under Constraints Mar 25, 2024 Q-Learning Reinforcement Learning (RL)
— Unverified 0RL for Consistency Models: Faster Reward Guided Text-to-Image Generation Mar 25, 2024 Image Generation Instruction Following
— Unverified 0Reinforcement Learning-based Recommender Systems with Large Language Models for State Reward and Action Modeling Mar 25, 2024 Offline RL Recommendation Systems
— Unverified 0Outcome-Constrained Large Language Models for Countering Hate Speech Mar 25, 2024 Reinforcement Learning (RL) Text Generation
— Unverified 0Policy Optimization finds Nash Equilibrium in Regularized General-Sum LQ Games Mar 25, 2024 Reinforcement Learning (RL)
— Unverified 0Planning with a Learned Policy Basis to Optimally Solve Complex Tasks Mar 22, 2024 Reinforcement Learning (RL)
— Unverified 0Policy Mirror Descent with Lookahead Mar 21, 2024 Reinforcement Learning (RL)
Code Code Available 0Task-optimal data-driven surrogate models for eNMPC via differentiable simulation and optimization Mar 21, 2024 Model Predictive Control Reinforcement Learning (RL)
— Unverified 0Constrained Reinforcement Learning with Smoothed Log Barrier Function Mar 21, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Heuristic Algorithm-based Action Masking Reinforcement Learning (HAAM-RL) with Ensemble Inference Method Mar 21, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Distilling Reinforcement Learning Policies for Interpretable Robot Locomotion: Gradient Boosting Machines and Symbolic Regression Mar 21, 2024 Additive models Reinforcement Learning (RL)
— Unverified 0Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning Mar 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Reinforcement Learning for Online Testing of Autonomous Driving Systems: a Replication and Extension Study Mar 20, 2024 Autonomous Driving Q-Learning
— Unverified 0Safety-Aware Reinforcement Learning for Electric Vehicle Charging Station Management in Distribution Network Mar 20, 2024 Management Reinforcement Learning (RL)
— Unverified 0Sample Complexity of Offline Distributionally Robust Linear Markov Decision Processes Mar 19, 2024 Reinforcement Learning (RL)
— Unverified 0Fast Value Tracking for Deep Reinforcement Learning Mar 19, 2024 Decision Making Deep Reinforcement Learning
— Unverified 0Efficient Transformer-based Hyper-parameter Optimization for Resource-constrained IoT Environments Mar 18, 2024 Reinforcement Learning (RL)
Code Code Available 0Bootstrapping Reinforcement Learning with Imitation for Vision-Based Agile Flight Mar 18, 2024 Imitation Learning reinforcement-learning
— Unverified 0Distill2Explain: Differentiable decision trees for explainable reinforcement learning in energy application controllers Mar 18, 2024 energy management Reinforcement Learning (RL)
— Unverified 0EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents Mar 18, 2024 Reinforcement Learning (RL) World Knowledge
— Unverified 0Decomposing Control Lyapunov Functions for Efficient Reinforcement Learning Mar 18, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0The Value of Reward Lookahead in Reinforcement Learning Mar 18, 2024 Offline RL reinforcement-learning
— Unverified 0State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards Mar 18, 2024 Decision Making Q-Learning
— Unverified 0Offline Multitask Representation Learning for Reinforcement Learning Mar 18, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning with Generalizable Gaussian Splatting Mar 18, 2024 3DGS reinforcement-learning
— Unverified 0Pessimistic Causal Reinforcement Learning with Mediators for Confounded Offline Data Mar 18, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Prior-dependent analysis of posterior sampling reinforcement learning with function approximation Mar 17, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Independent RL for Cooperative-Competitive Agents: A Mean-Field Perspective Mar 17, 2024 Problem Decomposition Reinforcement Learning (RL)
— Unverified 0Causality from Bottom to Top: A Survey Mar 17, 2024 Anomaly Detection Fraud Detection
— Unverified 0Distributed Multi-Objective Dynamic Offloading Scheduling for Air-Ground Cooperative MEC Mar 16, 2024 Decision Making Edge-computing
— Unverified 0Neural-Kernel Conditional Mean Embeddings Mar 16, 2024 Deep Learning Density Estimation
— Unverified 0The Fallacy of Minimizing Cumulative Regret in the Sequential Task Setting Mar 16, 2024 Reinforcement Learning (RL)
— Unverified 0ViSaRL: Visual Reinforcement Learning Guided by Human Saliency Mar 16, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning Mar 15, 2024 Natural Language Understanding reinforcement-learning
Code Code Available 0