Towards Dynamic Trend Filtering through Trend Point Detection with Reinforcement Learning Jun 6, 2024 Reinforcement Learning (RL) Time Series
Code Code Available 0HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning Jun 6, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 1ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories Jun 6, 2024 Data Augmentation reinforcement-learning
— Unverified 0Bootstrapping Expectiles in Reinforcement Learning Jun 6, 2024 Q-Learning reinforcement-learning
— Unverified 0Self-Play with Adversarial Critic: Provable and Scalable Offline Alignment for Language Models Jun 6, 2024 Offline RL reinforcement-learning
— Unverified 0Prompt-based Visual Alignment for Zero-shot Policy Transfer Jun 5, 2024 Autonomous Driving Language Modelling
— Unverified 0Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms Jun 5, 2024 Reinforcement Learning (RL)
— Unverified 0"Give Me an Example Like This": Episodic Active Reinforcement Learning from Demonstrations Jun 5, 2024 Active Learning Reinforcement Learning (RL)
Code Code Available 0UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning Jun 5, 2024 D4RL Offline RL
— Unverified 0CommonPower: A Framework for Safe Data-Driven Smart Grid Control Jun 5, 2024 Benchmarking energy management
Code Code Available 1From Tarzan to Tolkien: Controlling the Language Proficiency Level of LLMs for Content Generation Jun 5, 2024 Language Modeling Language Modelling
— Unverified 0DEER: A Delay-Resilient Framework for Reinforcement Learning with Variable Delays Jun 5, 2024 MuJoCo Reinforcement Learning (RL)
— Unverified 0Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning Jun 5, 2024 Quantization Reinforcement Learning (RL)
Code Code Available 1iQRL -- Implicitly Quantized Representations for Sample-efficient Reinforcement Learning Jun 4, 2024 continuous-control Continuous Control
— Unverified 0By Fair Means or Foul: Quantifying Collusion in a Market Simulation with Deep Reinforcement Learning Jun 4, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Mamba as Decision Maker: Exploring Multi-scale Sequence Modeling in Offline Reinforcement Learning Jun 4, 2024 Mamba OpenAI Gym
Code Code Available 1Smaller Batches, Bigger Gains? Investigating the Impact of Batch Sizes on Reinforcement Learning Based Real-World Production Scheduling Jun 4, 2024 Reinforcement Learning (RL) Scheduling
— Unverified 0Rectifying Reinforcement Learning for Reward Matching Jun 4, 2024 Decision Making reinforcement-learning
— Unverified 0Reinforcement Learning with Lookahead Information Jun 4, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Cross-Embodiment Robot Manipulation Skill Transfer using Latent Space Alignment Jun 4, 2024 Decoder Reinforcement Learning (RL)
Code Code Available 1A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning Jun 4, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning Jun 4, 2024 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0MOT: A Mixture of Actors Reinforcement Learning Method by Optimal Transport for Algorithmic Trading Jun 3, 2024 Algorithmic Trading Imitation Learning
— Unverified 0Reinforcement Learning as a Robotics-Inspired Framework for Insect Navigation: From Spatial Representations to Neural Implementation Jun 3, 2024 Reinforcement Learning (RL) Robot Navigation
— Unverified 0Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond Jun 3, 2024 Multi-Armed Bandits Reinforcement Learning (RL)
— Unverified 0Causal prompting model-based offline reinforcement learning Jun 3, 2024 model Offline RL
— Unverified 0REvolve: Reward Evolution with Large Language Models using Human Feedback Jun 3, 2024 Autonomous Driving Reinforcement Learning (RL)
— Unverified 0A Fast Convergence Theory for Offline Decision Making Jun 3, 2024 Decision Making Offline RL
— Unverified 0When to Sense and Control? A Time-adaptive Approach for Continuous-Time RL Jun 3, 2024 Reinforcement Learning (RL)
Code Code Available 0The Importance of Online Data: Understanding Preference Fine-tuning via Coverage Jun 3, 2024 Reinforcement Learning (RL)
— Unverified 0MOSEAC: Streamlined Variable Time Step Reinforcement Learning Jun 3, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0NeoRL: Efficient Exploration for Nonepisodic RL Jun 3, 2024 Efficient Exploration Reinforcement Learning (RL)
— Unverified 0Federated Learning-based Collaborative Wideband Spectrum Sensing and Scheduling for UAVs in UTM Systems Jun 3, 2024 Dataset Generation Federated Learning
— Unverified 0Learning the Target Network in Function Space Jun 3, 2024 Reinforcement Learning (RL)
— Unverified 0Model Predictive Control and Reinforcement Learning: A Unified Framework Based on Dynamic Programming Jun 2, 2024 Model Predictive Control Reinforcement Learning (RL)
— Unverified 0A Digital Twin Framework for Reinforcement Learning with Real-Time Self-Improvement via Human Assistive Teleoperation Jun 2, 2024 Deep Reinforcement Learning reinforcement-learning
— Unverified 0FuRL: Visual-Language Models as Fuzzy Rewards for Reinforcement Learning Jun 2, 2024 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 1Crafting a Pogo Stick in Minecraft with Heuristic Search (Extended Abstract) Jun 1, 2024 Heuristic Search Minecraft
Code Code Available 0SUBER: An RL Environment with Simulated Human Behavior for Recommender Systems Jun 1, 2024 Recommendation Systems Reinforcement Learning (RL)
Code Code Available 1Decision Mamba: Reinforcement Learning via Hybrid Selective Sequence Modeling May 31, 2024 D4RL Mamba
— Unverified 0Bayesian Design Principles for Offline-to-Online Reinforcement Learning May 31, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation May 31, 2024 MuJoCo reinforcement-learning
Code Code Available 5Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning May 31, 2024 D4RL Reinforcement Learning (RL)
Code Code Available 1Reinforcement Learning for Sociohydrology May 31, 2024 Management reinforcement-learning
— Unverified 0In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought May 31, 2024 D4RL Decision Making
Code Code Available 1SleeperNets: Universal Backdoor Poisoning Attacks Against Reinforcement Learning Agents May 30, 2024 Backdoor Attack reinforcement-learning
— Unverified 0Hybrid Reinforcement Learning Framework for Mixed-Variable Problems May 30, 2024 Bayesian Optimization reinforcement-learning
— Unverified 0From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems May 30, 2024 Decision Making Hierarchical Reinforcement Learning
— Unverified 0Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf May 30, 2024 Reinforcement Learning (RL)
— Unverified 0Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity May 30, 2024 Bilevel Optimization reinforcement-learning
— Unverified 0