Human-in-the-loop Reinforcement Learning for Data Quality Monitoring in Particle Physics Experiments May 24, 2024 Data Augmentation Reinforcement Learning (RL)
— Unverified 0Extracting Heuristics from Large Language Models for Reward Shaping in Reinforcement Learning May 24, 2024 Language Modelling Large Language Model
— Unverified 0Cross-Domain Policy Adaptation by Capturing Representation Mismatch May 24, 2024 Reinforcement Learning (RL) Representation Learning
Code Code Available 1Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search May 24, 2024 Code Generation Language Modelling
Code Code Available 1Blood Glucose Control Via Pre-trained Counterfactual Invertible Neural Networks May 23, 2024 counterfactual Counterfactual Inference
— Unverified 0Efficiently Training Deep-Learning Parametric Policies using Lagrangian Duality May 23, 2024 Decision Making Decision Making Under Uncertainty
— Unverified 0PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning May 23, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1Multi-turn Reinforcement Learning from Preference Human Feedback May 23, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence May 23, 2024 Distributional Reinforcement Learning Policy Gradient Methods
— Unverified 0Offline Reinforcement Learning from Datasets with Structured Non-Stationarity May 23, 2024 continuous-control Continuous Control
Code Code Available 0AGILE: A Novel Reinforcement Learning Framework of LLM Agents May 23, 2024 Question Answering reinforcement-learning
Code Code Available 2A finite time analysis of distributed Q-learning May 23, 2024 Decision Making Multi-agent Reinforcement Learning
— Unverified 0Exclusively Penalized Q-learning for Offline Reinforcement Learning May 23, 2024 Offline RL Q-Learning
— Unverified 0Which Experiences Are Influential for RL Agents? Efficiently Estimating The Influence of Experiences May 23, 2024 Reinforcement Learning (RL)
Code Code Available 0Variational Delayed Policy Optimization May 23, 2024 MuJoCo Reinforcement Learning (RL)
Code Code Available 0Leader Reward for POMO-Based Neural Combinatorial Optimization May 22, 2024 Combinatorial Optimization Reinforcement Learning (RL)
— Unverified 0Autonomous Algorithm for Training Autonomous Vehicles with Minimal Human Intervention May 22, 2024 Autonomous Driving Autonomous Vehicles
— Unverified 0Lusifer: LLM-based User SImulated Feedback Environment for online Recommender systems May 22, 2024 Collaborative Filtering Recommendation Systems
Code Code Available 0Learning to sample fibers for goodness-of-fit testing May 22, 2024 Reinforcement Learning (RL)
— Unverified 0Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow May 22, 2024 Ingenuity MuJoCo
Code Code Available 1Large Language Models (LLMs) Assisted Wireless Network Deployment in Urban Settings May 22, 2024 Navigate Reinforcement Learning (RL)
— Unverified 0Knowledge Graph Reasoning with Self-supervised Reinforcement Learning May 22, 2024 Knowledge Graphs reinforcement-learning
Code Code Available 1HighwayLLM: Decision-Making and Navigation in Highway Driving with RL-Informed Language Model May 22, 2024 Autonomous Driving Autonomous Vehicles
— Unverified 0Multi-Agent Reinforcement Learning with Hierarchical Coordination for Emergency Responder Stationing May 21, 2024 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Practical and efficient quantum circuit synthesis and transpiling with Reinforcement Learning May 21, 2024 Reinforcement Learning (RL)
— Unverified 0CausalPlayground: Addressing Data-Generation Requirements in Cutting-Edge Causality Research May 21, 2024 Reinforcement Learning (RL)
Code Code Available 1A Multimodal Learning-based Approach for Autonomous Landing of UAV May 21, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0Rethinking Robustness Assessment: Adversarial Attacks on Learning-based Quadrupedal Locomotion Controllers May 21, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Investigating the Impact of Choice on Deep Reinforcement Learning for Space Controls May 20, 2024 continuous-control Continuous Control
— Unverified 0Feasibility Consistent Representation Learning for Safe Reinforcement Learning May 20, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1Scrutinize What We Ignore: Reining In Task Representation Shift Of Context-Based Offline Meta Reinforcement Learning May 20, 2024 Meta-Learning Meta Reinforcement Learning
Code Code Available 0Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning May 20, 2024 continuous-control Continuous Control
— Unverified 0Highway Graph to Accelerate Reinforcement Learning May 20, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Comparisons Are All You Need for Optimizing Smooth Functions May 19, 2024 All reinforcement-learning
— Unverified 0Large Language Models are Biased Reinforcement Learners May 19, 2024 Decision Making In-Context Learning
Code Code Available 0Do No Harm: A Counterfactual Approach to Safe Reinforcement Learning May 19, 2024 counterfactual Friction
— Unverified 0Optimal control barrier functions for RL based safe powertrain control May 18, 2024 Reinforcement Learning (RL)
— Unverified 0Combined film and pulse heating of lithium ion batteries to improve performance in low ambient temperature May 18, 2024 Reinforcement Learning (RL)
— Unverified 0Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses May 18, 2024 D4RL Offline RL
— Unverified 0LLM-based Multi-Agent Reinforcement Learning: Current and Future Directions May 17, 2024 Multi-agent Reinforcement Learning Question Answering
— Unverified 0Simulation-Based Benchmarking of Reinforcement Learning Agents for Personalized Retail Promotions May 16, 2024 Benchmarking Reinforcement Learning (RL)
Code Code Available 0Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning May 16, 2024 Decision Making Instruction Following
— Unverified 0Stochastic Q-learning for Large Discrete Action Spaces May 16, 2024 Decision Making Q-Learning
— Unverified 0IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues May 15, 2024 Information Retrieval Question Answering
— Unverified 0Deep Learning in Earthquake Engineering: A Comprehensive Review May 15, 2024 Deep Learning Dimensionality Reduction
— Unverified 0Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning May 15, 2024 Reinforcement Learning (RL)
— Unverified 0CarDreamer: Open-Source Learning Platform for World Model based Autonomous Driving May 15, 2024 Autonomous Driving Autonomous Vehicles
Code Code Available 3vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement May 14, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0Reinformer: Max-Return Sequence Modeling for Offline RL May 14, 2024 D4RL Offline RL
Code Code Available 1Deep Reinforcement Learning for Real-Time Ground Delay Program Revision and Corresponding Flight Delay Assignments May 14, 2024 Decision Making Deep Reinforcement Learning
— Unverified 0