A Reinforcement Learning Engine with Reduced Action and State Space for Scalable Cyber-Physical Optimal Response Oct 6, 2024 Reinforcement Learning (RL)
— Unverified 0Improved Off-policy Reinforcement Learning in Biological Sequence Design Oct 6, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0DeepLTL: Learning to Efficiently Satisfy Complex LTL Specifications for Multi-Task RL Oct 6, 2024 Reinforcement Learning (RL)
— Unverified 0Improving Portfolio Optimization Results with Bandit Networks Oct 5, 2024 Portfolio Optimization Recommendation Systems
Code Code Available 0Spatial-aware decision-making with ring attractors in reinforcement learning systems Oct 4, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0Predictive Coding for Decision Transformer Oct 4, 2024 Decision Making Reinforcement Learning (RL)
Code Code Available 1Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization Oct 4, 2024 Deep Reinforcement Learning Quantization
Code Code Available 1CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control Oct 4, 2024 Motion Generation Reinforcement Learning (RL)
Code Code Available 3Solving Reach-Avoid-Stay Problems Using Deep Deterministic Policy Gradients Oct 3, 2024 Reinforcement Learning (RL)
— Unverified 0Efficient Residual Learning with Mixture-of-Experts for Universal Dexterous Grasping Oct 3, 2024 GPU Mixture-of-Experts
— Unverified 0ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI Oct 3, 2024 Few-Shot Imitation Learning Imitation Learning
Code Code Available 1Learning Emergence of Interaction Patterns across Independent RL Agents in Multi-Agent Environments Oct 3, 2024 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Cross-Embodiment Dexterous Grasping with Reinforcement Learning Oct 3, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0End-to-end Driving in High-Interaction Traffic Scenarios with Reinforcement Learning Oct 3, 2024 Autonomous Driving CARLA Leaderboard 2.0
— Unverified 0Dual Active Learning for Reinforcement Learning from Human Feedback Oct 3, 2024 Active Learning reinforcement-learning
— Unverified 0Beyond Expected Returns: A Policy Gradient Algorithm for Cumulative Prospect Theoretic Reinforcement Learning Oct 3, 2024 Reinforcement Learning (RL)
— Unverified 0The Smart Buildings Control Suite: A Diverse Open Source Benchmark to Evaluate and Scale HVAC Control Policies for Sustainability Oct 2, 2024 Model Predictive Control Offline RL
— Unverified 0ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization Oct 2, 2024 MuJoCo Multi-agent Reinforcement Learning
— Unverified 0Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL Oct 2, 2024 Reinforcement Learning (RL)
— Unverified 0LLM-Augmented Symbolic Reinforcement Learning with Landmark-Based Task Decomposition Oct 2, 2024 Common Sense Reasoning Inductive logic programming
— Unverified 0Adaptive teachers for amortized samplers Oct 2, 2024 Decision Making Efficient Exploration
Code Code Available 0Sampling from Energy-based Policies using Diffusion Oct 2, 2024 continuous-control Continuous Control
— Unverified 0VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment Oct 2, 2024 GSM8K Math
Code Code Available 2Sparse Autoencoders Reveal Temporal Difference Learning in Large Language Models Oct 2, 2024 In-Context Learning Reinforcement Learning (RL)
— Unverified 0Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space Oct 2, 2024 Decision Making Distributional Reinforcement Learning
— Unverified 0Scalable Reinforcement Learning-based Neural Architecture Search Oct 2, 2024 Neural Architecture Search reinforcement-learning
— Unverified 0PreND: Enhancing Intrinsic Motivation in Reinforcement Learning through Pre-trained Network Distillation Oct 2, 2024 Developmental Learning reinforcement-learning
— Unverified 0Absolute State-wise Constrained Policy Optimization: High-Probability State-wise Constraints Satisfaction Oct 2, 2024 Autonomous Driving continuous-control
— Unverified 0Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining Oct 1, 2024 Atari Games model
Code Code Available 1Upper and Lower Bounds for Distributionally Robust Off-Dynamics Reinforcement Learning Sep 30, 2024 2k Computational Efficiency
— Unverified 0Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner Sep 30, 2024 Reinforcement Learning (RL)
— Unverified 0Personalisation via Dynamic Policy Fusion Sep 30, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Focus On What Matters: Separated Models For Visual-Based RL Generalization Sep 29, 2024 Image Reconstruction Reinforcement Learning (RL)
— Unverified 0Analysis on Riemann Hypothesis with Cross Entropy Optimization and Reasoning Sep 29, 2024 Reinforcement Learning (RL)
— Unverified 0Constrained Reinforcement Learning for Safe Heat Pump Control Sep 29, 2024 Benchmarking reinforcement-learning
Code Code Available 0Grounded Curriculum Learning Sep 29, 2024 Reinforcement Learning (RL)
— Unverified 0Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regularization Sep 28, 2024 Reinforcement Learning (RL)
— Unverified 0Learning to Bridge the Gap: Efficient Novelty Recovery with Planning and Reinforcement Learning Sep 28, 2024 Reinforcement Learning (RL)
— Unverified 0Strongly-polynomial time and validation analysis of policy gradient methods Sep 28, 2024 Policy Gradient Methods Reinforcement Learning (RL)
— Unverified 0Climate Adaptation with Reinforcement Learning: Experiments with Flooding and Transportation in Copenhagen Sep 27, 2024 Decision Making Reinforcement Learning (RL)
Code Code Available 0ARLBench: Flexible and Efficient Benchmarking for Hyperparameter Optimization in Reinforcement Learning Sep 27, 2024 AutoML Benchmarking
Code Code Available 1Enhancing Spectrum Efficiency in 6G Satellite Networks: A GAIL-Powered Policy Learning via Asynchronous Federated Inverse Reinforcement Learning Sep 27, 2024 Federated Learning Imitation Learning
— Unverified 0TemporalPaD: a reinforcement-learning framework for temporal feature representation and dimension reduction Sep 27, 2024 Dimensionality Reduction Reinforcement Learning (RL)
— Unverified 0CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models Sep 27, 2024 Reinforcement Learning (RL) World Knowledge
Code Code Available 1Cost-Aware Dynamic Cloud Workflow Scheduling using Self-Attention and Evolutionary Reinforcement Learning Sep 27, 2024 Reinforcement Learning (RL) Scheduling
— Unverified 0Optimizing Downlink C-NOMA Transmission with Movable Antennas: A DDPG-based Approach Sep 26, 2024 Reinforcement Learning (RL)
— Unverified 0DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors Sep 26, 2024 continuous-control Continuous Control
Code Code Available 1LoopSR: Looping Sim-and-Real for Lifelong Policy Adaptation of Legged Robots Sep 26, 2024 Contrastive Learning Decoder
— Unverified 0Autoregressive Multi-trait Essay Scoring via Reinforcement Learning with Scoring-aware Multiple Rewards Sep 26, 2024 Automated Essay Scoring reinforcement-learning
— Unverified 0Asynchronous Fractional Multi-Agent Deep Reinforcement Learning for Age-Minimal Mobile Edge Computing Sep 25, 2024 Deep Reinforcement Learning Edge-computing
— Unverified 0