Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policies Feb 20, 2024 Adversarial Attack MuJoCo
Code Code Available 0MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces Feb 20, 2024 Decision Making Offline RL
Code Code Available 0Align Your Intents: Offline Imitation Learning via Optimal Transport Feb 20, 2024 D4RL Decision Making
— Unverified 0Offline Multi-task Transfer RL with Representational Penalization Feb 19, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0A Critical Evaluation of AI Feedback for Aligning Large Language Models Feb 19, 2024 Instruction Following reinforcement-learning
Code Code Available 2Self-evolving Autoencoder Embedded Q-Network Feb 18, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0Programmatic Reinforcement Learning: Navigating Gridworlds Feb 18, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0SINR-Aware Deep Reinforcement Learning for Distributed Dynamic Channel Allocation in Cognitive Interference Networks Feb 17, 2024 Deep Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 0Modelling crypto markets by multi-agent reinforcement learning Feb 16, 2024 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0Policy Learning for Off-Dynamics RL with Deficient Support Feb 16, 2024 Reinforcement Learning (RL)
Code Code Available 1Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Feb 15, 2024 All Decision Making
Code Code Available 2Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation Feb 15, 2024 Image Generation Reinforcement Learning (RL)
— Unverified 0Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment Feb 15, 2024 GPU Reinforcement Learning (RL)
Code Code Available 1Performative Reinforcement Learning in Gradually Shifting Environments Feb 15, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption Feb 14, 2024 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0How does Your RL Agent Explore? An Optimal Transport Analysis of Occupancy Measure Trajectories Feb 14, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Steady-State Error Compensation for Reinforcement Learning with Quadratic Rewards Feb 14, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Discovering Command and Control (C2) Channels on Tor and Public Networks Using Reinforcement Learning Feb 14, 2024 Reinforcement Learning (RL)
— Unverified 0Exploiting Estimation Bias in Clipped Double Q-Learning for Continous Control Reinforcement Learning Tasks Feb 14, 2024 Computational Efficiency continuous-control
— Unverified 0PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models Feb 13, 2024 Denoising Reinforcement Learning (RL)
— Unverified 0Conservative and Risk-Aware Offline Multi-Agent Reinforcement Learning Feb 13, 2024 Multi-agent Reinforcement Learning Q-Learning
Code Code Available 0Provable Traffic Rule Compliance in Safe Reinforcement Learning on the Open Sea Feb 13, 2024 Autonomous Vehicles Reinforcement Learning (RL)
— Unverified 0Hybrid Inverse Reinforcement Learning Feb 13, 2024 continuous-control Continuous Control
Code Code Available 1Intelligent Agricultural Management Considering N_2O Emission and Climate Variability with Uncertainties Feb 13, 2024 Decision Making Management
— Unverified 0Optimal Task Assignment and Path Planning using Conflict-Based Search with Precedence and Temporal Constraints Feb 13, 2024 Multi-Agent Path Finding Reinforcement Learning (RL)
— Unverified 0Auxiliary Reward Generation with Transition Distance Representation Learning Feb 12, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0IR-Aware ECO Timing Optimization Using Reinforcement Learning Feb 12, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model Feb 12, 2024 Distributional Reinforcement Learning reinforcement-learning
— Unverified 0Natural Language Reinforcement Learning Feb 11, 2024 Decision Making reinforcement-learning
— Unverified 0Future Prediction Can be a Strong Evidence of Good History Representation in Partially Observable Environments Feb 11, 2024 Future prediction Memorization
— Unverified 0Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF Feb 10, 2024 Bilevel Optimization reinforcement-learning
— Unverified 0RLEEGNet: Integrating Brain-Computer Interfaces with Adaptive AI for Intuitive Responsiveness and High-Accuracy Motor Imagery Classification Feb 9, 2024 EEG Motor Imagery
— Unverified 0Monitored Markov Decision Processes Feb 9, 2024 Reinforcement Learning (RL)
Code Code Available 0Entropy-Regularized Token-Level Policy Optimization for Language Agent Reinforcement Feb 9, 2024 Code Generation Decision Making
Code Code Available 1Learn to Teach: Sample-Efficient Privileged Learning for Humanoid Locomotion over Diverse Terrains Feb 9, 2024 Depth Estimation MuJoCo
— Unverified 0Value function interference and greedy action selection in value-based multi-objective reinforcement learning Feb 9, 2024 Multi-Objective Reinforcement Learning Q-Learning
— Unverified 0High-Precision Geosteering via Reinforcement Learning and Particle Filters Feb 9, 2024 Decision Making reinforcement-learning
— Unverified 0Deceptive Path Planning via Reinforcement Learning with Graph Neural Networks Feb 9, 2024 Graph Neural Network reinforcement-learning
Code Code Available 1ACTER: Diverse and Actionable Counterfactual Sequences for Explaining and Diagnosing RL Policies Feb 9, 2024 counterfactual Counterfactual Reasoning
— Unverified 0Scaling Intelligent Agents in Combat Simulations for Wargaming Feb 8, 2024 Deep Reinforcement Learning Hierarchical Reinforcement Learning
— Unverified 0Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning Feb 8, 2024 Deep Reinforcement Learning Offline RL
— Unverified 0Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices Feb 8, 2024 Federated Learning Offline RL
— Unverified 0Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization Feb 8, 2024 Q-Learning reinforcement-learning
Code Code Available 0Differentially Private Deep Model-Based Reinforcement Learning Feb 8, 2024 continuous-control Continuous Control
— Unverified 0Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning Feb 8, 2024 GSM8K reinforcement-learning
Code Code Available 2Model-Based RL for Mean-Field Games is not Statistically Harder than Single-Agent RL Feb 8, 2024 Computational Efficiency Reinforcement Learning (RL)
Code Code Available 0QGFN: Controllable Greediness with Action Values Feb 7, 2024 Diversity Reinforcement Learning (RL)
Code Code Available 1Convergence for Natural Policy Gradient on Infinite-State Queueing MDPs Feb 7, 2024 Reinforcement Learning (RL)
— Unverified 0Safety Filters for Black-Box Dynamical Systems by Learning Discriminating Hyperplanes Feb 7, 2024 Reinforcement Learning (RL)
Code Code Available 1Context in Public Health for Underserved Communities: A Bayesian Approach to Online Restless Bandits Feb 7, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0