Maneuver Decision-Making For Autonomous Air Combat Through Curriculum Learning And Reinforcement Learning With Sparse Rewards Feb 12, 2023 Decision Making reinforcement-learning
— Unverified 0Procedural generation of meta-reinforcement learning tasks Feb 11, 2023 Meta-Learning Meta Reinforcement Learning
Code Code Available 1Cross-domain Random Pre-training with Prototypes for Reinforcement Learning Feb 11, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning Feb 11, 2023 Decision Making reinforcement-learning
— Unverified 0A SWAT-based Reinforcement Learning Framework for Crop Management Feb 10, 2023 Benchmarking Decision Making
Code Code Available 1The Wisdom of Hindsight Makes Language Models Better Instruction Followers Feb 10, 2023 Decision Making Language Modeling
Code Code Available 1A Survey on Causal Reinforcement Learning Feb 10, 2023 Decision Making reinforcement-learning
— Unverified 0On Penalty-based Bilevel Gradient Descent Method Feb 10, 2023 Bilevel Optimization Image Reconstruction
Code Code Available 1Combining Reconstruction and Contrastive Methods for Multimodal Representations in RL Feb 10, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Towards Minimax Optimality of Model-based Robust Reinforcement Learning Feb 10, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Low Entropy Communication in Multi-Agent Reinforcement Learning Feb 10, 2023 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0RayNet: A Simulation Platform for Developing Reinforcement Learning-Driven Network Protocols Feb 9, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 1Data Quality-aware Mixed-precision Quantization via Hybrid Reinforcement Learning Feb 9, 2023 Quantization reinforcement-learning
— Unverified 0Scaling Goal-based Exploration via Pruning Proto-goals Feb 9, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0ManiSkill2: A Unified Benchmark for Generalizable Manipulation Skills Feb 9, 2023 GPU Imitation Learning
Code Code Available 1Hierarchical Generative Adversarial Imitation Learning with Mid-level Input Generation for Autonomous Driving on Urban Environments Feb 9, 2023 Autonomous Driving Autonomous Navigation
Code Code Available 1CLARE: Conservative Model-Based Reward Learning for Offline Inverse Reinforcement Learning Feb 9, 2023 continuous-control Continuous Control
— Unverified 0An Investigation into Pre-Training Object-Centric Representations for Reinforcement Learning Feb 9, 2023 Object Optical Character Recognition (OCR)
— Unverified 0Equivariant MuZero Feb 9, 2023 Deep Reinforcement Learning Model-based Reinforcement Learning
— Unverified 0Learning Complex Teamwork Tasks Using a Given Sub-task Decomposition Feb 9, 2023 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 0AISYN: AI-driven Reinforcement Learning-Based Logic Synthesis Framework Feb 8, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0A Scale-Independent Multi-Objective Reinforcement Learning with Convergence Analysis Feb 8, 2023 Decision Making Multi-Objective Reinforcement Learning
— Unverified 0Near-Optimal Adversarial Reinforcement Learning with Switching Costs Feb 8, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning Feb 8, 2023 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0A Near-Optimal Algorithm for Safe Reinforcement Learning Under Instantaneous Hard Constraints Feb 8, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Learning Graph-Enhanced Commander-Executor for Multi-Agent Navigation Feb 8, 2023 Hierarchical Reinforcement Learning Multi-agent Reinforcement Learning
Code Code Available 0Predictable MDP Abstraction for Unsupervised Model-Based RL Feb 8, 2023 model Model-based Reinforcement Learning
Code Code Available 1Non-zero-sum Game Control for Multi-vehicle Driving via Reinforcement Learning Feb 8, 2023 Model-based Reinforcement Learning reinforcement-learning
Code Code Available 0Eliciting User Preferences for Personalized Multi-Objective Decision Making through Comparative Feedback Feb 7, 2023 Decision Making Reinforcement Learning (RL)
— Unverified 0Online Reinforcement Learning with Uncertain Episode Lengths Feb 7, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Optimizing Audio Recommendations for the Long-Term: A Reinforcement Learning Perspective Feb 7, 2023 Recommendation Systems reinforcement-learning
— Unverified 0Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence Feb 7, 2023 Continuous Control MuJoCo
Code Code Available 1Adaptive Aggregation for Safety-Critical Control Feb 7, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning Feb 7, 2023 Efficient Exploration Multi-agent Reinforcement Learning
— Unverified 0Multi-Task Recommendations with Reinforcement Learning Feb 7, 2023 Multi-Task Learning Recommendation Systems
Code Code Available 1Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR Feb 7, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Transfer learning for process design with reinforcement learning Feb 7, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Towards Skilled Population Curriculum for Multi-Agent Reinforcement Learning Feb 7, 2023 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Intrinsic Rewards from Self-Organizing Feature Maps for Exploration in Reinforcement Learning Feb 6, 2023 Clustering Deep Reinforcement Learning
Code Code Available 0Holistic Deep-Reinforcement-Learning-based Training of Autonomous Navigation Systems Feb 6, 2023 Autonomous Navigation Deep Reinforcement Learning
— Unverified 0RLTP: Reinforcement Learning to Pace for Delayed Impression Modeling in Preloaded Ads Feb 6, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0State-wise Safe Reinforcement Learning: A Survey Feb 6, 2023 Autonomous Driving reinforcement-learning
— Unverified 0A Strong Baseline for Batch Imitation Learning Feb 6, 2023 continuous-control Continuous Control
— Unverified 0Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning Feb 6, 2023 Decision Making reinforcement-learning
Code Code Available 2DITTO: Offline Imitation Learning with World Models Feb 6, 2023 Imitation Learning reinforcement-learning
— Unverified 0Offline Learning in Markov Games with General Function Approximation Feb 6, 2023 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Efficient Online Reinforcement Learning with Offline Data Feb 6, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 2Arena-Web -- A Web-based Development and Benchmarking Platform for Autonomous Navigation Approaches Feb 6, 2023 Autonomous Navigation Benchmarking
— Unverified 0Offline Learning of Closed-Loop Deep Brain Stimulation Controllers for Parkinson Disease Treatment Feb 5, 2023 Reinforcement Learning (RL)
Code Code Available 0Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage Feb 5, 2023 Offline RL Q-Learning
— Unverified 0