A Novel Reinforcement Learning Model for Post-Incident Malware Investigations Oct 19, 2024 Malware Detection Q-Learning
— Unverified 0Action abstractions for amortized sampling Oct 19, 2024 Chunking Reinforcement Learning (RL)
— Unverified 0Towards Effective Planning Strategies for Dynamic Opinion Networks Oct 18, 2024 Blocking Misinformation
Code Code Available 0Interpretable end-to-end Neurosymbolic Reinforcement Learning agents Oct 18, 2024 Atari Games Deep Reinforcement Learning
— Unverified 0A Large Language Model-Driven Reward Design Framework via Dynamic Feedback for Reinforcement Learning Oct 18, 2024 Language Modeling Language Modelling
— Unverified 0Reinforcement Learning in Non-Markov Market-Making Oct 18, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Harnessing Causality in Reinforcement Learning With Bagged Decision Times Oct 18, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0MarineFormer: A Spatio-Temporal Attention Model for USV Navigation in Dynamic Marine Environments Oct 17, 2024 Collision Avoidance Graph Attention
— Unverified 0Guided Reinforcement Learning for Robust Multi-Contact Loco-Manipulation Oct 17, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization Oct 17, 2024 continuous-control Continuous Control
Code Code Available 0Coordinated Dispatch of Energy Storage Systems in the Active Distribution Network: A Complementary Reinforcement Learning and Optimization Approach Oct 17, 2024 Reinforcement Learning (RL)
— Unverified 0Integrating Large Language Models and Reinforcement Learning for Non-Linear Reasoning Oct 17, 2024 Binary Classification Reinforcement Learning (RL)
— Unverified 0Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive Approach Oct 17, 2024 Policy Gradient Methods Reinforcement Learning (RL)
— Unverified 0EdgeRL: Reinforcement Learning-driven Deep Learning Model Inference Optimization at Edge Oct 16, 2024 Deep Learning Inference Optimization
— Unverified 0Augmented Intelligence in Smart Intersections: Local Digital Twins-Assisted Hybrid Autonomous Driving Oct 16, 2024 Autonomous Driving Reinforcement Learning (RL)
— Unverified 0Off-dynamics Conditional Diffusion Planners Oct 16, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning with Euclidean Data Augmentation for State-Based Continuous Control Oct 16, 2024 continuous-control Continuous Control
Code Code Available 0Robust RL with LLM-Driven Data Synthesis and Policy Adaptation for Autonomous Driving Oct 16, 2024 Autonomous Driving Common Sense Reasoning
— Unverified 0Sample-Efficient Reinforcement Learning with Temporal Logic Objectives: Leveraging the Task Specification to Guide Exploration Oct 16, 2024 Reinforcement Learning (RL)
Code Code Available 0Dynamic Learning Rate for Deep Reinforcement Learning: A Bandit Approach Oct 16, 2024 Deep Reinforcement Learning Meta-Learning
— Unverified 0Neural-based Control for CubeSat Docking Maneuvers Oct 16, 2024 Reinforcement Learning (RL)
— Unverified 0SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling Oct 16, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0When to Trust Your Data: Enhancing Dyna-Style Model-Based Reinforcement Learning With Data Filter Oct 16, 2024 Model-based Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning with LTL and ω-Regular Objectives via Optimality-Preserving Translation to Average Rewards Oct 16, 2024 Reinforcement Learning (RL)
— Unverified 0Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning Oct 15, 2024 D4RL Model-based Reinforcement Learning
Code Code Available 0Multi-objective Reinforcement Learning: A Tool for Pluralistic Alignment Oct 15, 2024 Multi-Objective Reinforcement Learning reinforcement-learning
— Unverified 0Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task Oct 15, 2024 ARC Decision Making
— Unverified 0Reinforcement Learning Based Bidding Framework with High-dimensional Bids in Power Markets Oct 15, 2024 Reinforcement Learning (RL)
— Unverified 0Multi-Objective-Optimization Multi-AUV Assisted Data Collection Framework for IoUT Based on Offline Reinforcement Learning Oct 15, 2024 Collision Avoidance Offline RL
— Unverified 0ILAEDA: An Imitation Learning Based Approach for Automatic Exploratory Data Analysis Oct 15, 2024 Imitation Learning Reinforcement Learning (RL)
— Unverified 0Large Language Model-Enhanced Reinforcement Learning for Generic Bus Holding Control Strategies Oct 14, 2024 In-Context Learning Language Modeling
— Unverified 0Asymptotic Analysis of Sample-averaged Q-learning Oct 14, 2024 OpenAI Gym Q-Learning
— Unverified 0DR-MPC: Deep Residual Model Predictive Control for Real-world Social Navigation Oct 14, 2024 Deep Reinforcement Learning Model Predictive Control
— Unverified 0Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning Oct 14, 2024 Distributional Reinforcement Learning reinforcement-learning
— Unverified 0Burning RED: Unlocking Subtask-Driven Reinforcement Learning and Risk-Awareness in Average-Reward Markov Decision Processes Oct 14, 2024 Decision Making Decision Making Under Uncertainty
— Unverified 0Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator Oct 13, 2024 All Bilevel Optimization
— Unverified 0Generalization of Compositional Tasks with Logical Specification via Implicit Planning Oct 13, 2024 Graph Neural Network Reinforcement Learning (RL)
— Unverified 0Improving Generalization on the ProcGen Benchmark with Simple Architectural Changes and Scale Oct 13, 2024 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0Integrating Reinforcement Learning and Large Language Models for Crop Production Process Management Optimization and Control through A New Knowledge-Based Deep Learning Paradigm Oct 13, 2024 Management Offline RL
— Unverified 0Transformers as Game Players: Provable In-context Game-playing Capabilities of Pre-trained Models Oct 13, 2024 In-Context Learning Reinforcement Learning (RL)
— Unverified 0SAPIENT: Mastering Multi-turn Conversational Recommendation with Strategic Planning and Monte Carlo Tree Search Oct 12, 2024 Conversational Recommendation Conversational Search
Code Code Available 0ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning Oct 12, 2024 Efficient Exploration reinforcement-learning
— Unverified 0Reinforcement Learning in Hyperbolic Spaces: Models and Experiments Oct 12, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Overcoming Slow Decision Frequencies in Continuous Control: Model-Based Sequence Reinforcement Learning for Model-Free Control Oct 11, 2024 continuous-control Continuous Control
Code Code Available 0Reinforcement Learning for Control of Non-Markovian Cellular Population Dynamics Oct 11, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Words as Beacons: Guiding RL Agents with High-Level Language Prompts Oct 11, 2024 Reinforcement Learning (RL)
— Unverified 0MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL Oct 11, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0SOLD: Slot Object-Centric Latent Dynamics Models for Relational Manipulation Learning from Pixels Oct 11, 2024 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0Physical Simulation for Multi-agent Multi-machine Tending Oct 11, 2024 Reinforcement Learning (RL)
— Unverified 0Can we hop in general? A discussion of benchmark selection and design using the Hopper environment Oct 11, 2024 Benchmarking Reinforcement Learning (RL)
— Unverified 0