Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning Jun 27, 2023 D4RL Offline RL
— Unverified 0Learning to Modulate pre-trained Models in RL Jun 26, 2023 Reinforcement Learning (RL)
Code Code Available 1InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback Jun 26, 2023 Benchmarking Code Generation
Code Code Available 2Augmenting Control over Exploration Space in Molecular Dynamics Simulators to Streamline De Novo Analysis through Generative Control Policies Jun 26, 2023 Drug Discovery Inductive Bias
— Unverified 0Estimating player completion rate in mobile puzzle games using reinforcement learning Jun 26, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Multivariate Time Series Early Classification Across Channel and Time Dimensions Jun 26, 2023 Classification Early Classification
Code Code Available 0Supervised Pretraining Can Learn In-Context Reinforcement Learning Jun 26, 2023 Decision Making In-Context Learning
— Unverified 0ChiPFormer: Transferable Chip Placement via Offline Decision Transformer Jun 26, 2023 Offline RL Reinforcement Learning (RL)
— Unverified 0Decentralized Multi-Robot Formation Control Using Reinforcement Learning Jun 26, 2023 Q-Learning reinforcement-learning
— Unverified 0PolicyClusterGCN: Identifying Efficient Clusters for Training Graph Convolutional Networks Jun 25, 2023 graph partitioning Node Classification
— Unverified 0A Framework for dynamically meeting performance objectives on a service mesh Jun 25, 2023 Management Reinforcement Learning (RL)
— Unverified 0Is RLHF More Difficult than Standard RL? Jun 25, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching Jun 24, 2023 Imitation Learning Offline RL
— Unverified 0Towards Optimal Pricing of Demand Response -- A Nonparametric Constrained Policy Optimization Approach Jun 24, 2023 Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning with Temporal-Logic-Based Causal Diagrams Jun 23, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Active Coverage for PAC Reinforcement Learning Jun 23, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0CLUE: Calibrated Latent Guidance for Offline Reinforcement Learning Jun 23, 2023 Imitation Learning Offline RL
— Unverified 0MP3: Movement Primitive-Based (Re-)Planning Policy Jun 22, 2023 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning Jun 22, 2023 continuous-control Continuous Control
Code Code Available 1Transferable Curricula through Difficulty Conditioned Generators Jun 22, 2023 Reinforcement Learning (RL) Starcraft
— Unverified 0Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting Jun 22, 2023 Offline RL reinforcement-learning
Code Code Available 1Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement Learning Jun 22, 2023 Data Augmentation Offline RL
Code Code Available 1State-wise Constrained Policy Optimization Jun 21, 2023 Autonomous Driving reinforcement-learning
Code Code Available 1AdCraft: An Advanced Reinforcement Learning Benchmark Environment for Search Engine Marketing Optimization Jun 21, 2023 Management Marketing
Code Code Available 0Learning to Generate Better Than Your LLM Jun 20, 2023 Conditional Text Generation reinforcement-learning
Code Code Available 1Efficient Dynamics Modeling in Interactive Environments with Koopman Theory Jun 20, 2023 Reinforcement Learning (RL)
— Unverified 0Int-HRL: Towards Intention-based Hierarchical Reinforcement Learning Jun 20, 2023 Deep Reinforcement Learning Hierarchical Reinforcement Learning
— Unverified 0Reward Shaping via Diffusion Process in Reinforcement Learning Jun 20, 2023 Navigate reinforcement-learning
— Unverified 0Autonomous Driving with Deep Reinforcement Learning in CARLA Simulation Jun 20, 2023 Autonomous Driving Autonomous Vehicles
— Unverified 0Neural Inventory Control in Networks via Hindsight Differentiable Policy Optimization Jun 20, 2023 Deep Reinforcement Learning Management
Code Code Available 1Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap Jun 20, 2023 Offline RL Reinforcement Learning (RL)
— Unverified 0Adversarial Search and Tracking with Multiagent Reinforcement Learning in Sparsely Observable Environment Jun 20, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 1Adaptive Ordered Information Extraction with Deep Reinforcement Learning Jun 19, 2023 Deep Reinforcement Learning Event Extraction
Code Code Available 0PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning Jun 19, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 1AdaStop: adaptive statistical testing for sound comparisons of Deep RL agents Jun 19, 2023 Deep Reinforcement Learning MuJoCo
Code Code Available 0On the Model-Misspecification in Reinforcement Learning Jun 19, 2023 model Open-Ended Question Answering
— Unverified 0Enhancing variational quantum state diagonalization using reinforcement learning techniques Jun 19, 2023 Quantum Machine Learning reinforcement-learning
Code Code Available 0Acceleration in Policy Optimization Jun 18, 2023 Meta-Learning Policy Gradient Methods
— Unverified 0The RL Perceptron: Generalisation Dynamics of Policy Learning in High Dimensions Jun 17, 2023 Atari Games Reinforcement Learning (RL)
— Unverified 0Active Policy Improvement from Multiple Black-box Oracles Jun 17, 2023 Imitation Learning Reinforcement Learning (RL)
Code Code Available 0Genes in Intelligent Agents Jun 17, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Do as I can, not as I get Jun 17, 2023 Knowledge Graphs Multi-modal Knowledge Graph
— Unverified 0The False Dawn: Reevaluating Google's Reinforcement Learning for Chip Macro Placement Jun 16, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Bootstrapped Representations in Reinforcement Learning Jun 16, 2023 Auxiliary Learning reinforcement-learning
— Unverified 0Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX Jun 16, 2023 Decision Making reinforcement-learning
Code Code Available 2Semi-Offline Reinforcement Learning for Optimized Text Generation Jun 16, 2023 Offline RL reinforcement-learning
Code Code Available 0Temporal Difference Learning with Experience Replay Jun 16, 2023 Reinforcement Learning (RL)
— Unverified 0Low-Switching Policy Gradient with Exploration via Online Sensitivity Sampling Jun 15, 2023 Reinforcement Learning (RL) Sensitivity
— Unverified 0Offline Multi-Agent Reinforcement Learning with Coupled Value Factorization Jun 15, 2023 Management Multi-agent Reinforcement Learning
— Unverified 0Granger Causal Interaction Skill Chains Jun 15, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0