JAX-LOB: A GPU-Accelerated limit order book simulator to unlock large scale reinforcement learning for trading Aug 25, 2023 GPU reinforcement-learning
— Unverified 0Continuous Reinforcement Learning-based Dynamic Difficulty Adjustment in a Visual Working Memory Game Aug 24, 2023 Memorization Reinforcement Learning (RL)
— Unverified 0Bayesian Exploration Networks Aug 24, 2023 Decision Making Decision Making Under Uncertainty
— Unverified 0Extreme Risk Mitigation in Reinforcement Learning using Extreme Value Theory Aug 24, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Conditional Kernel Imitation Learning for Continuous State Environments Aug 24, 2023 Density Estimation Imitation Learning
— Unverified 0MolOpt: Autonomous Molecular Geometry Optimization using Multi-Agent Reinforcement Learning Aug 24, 2023 3D geometry Computational chemistry
Code Code Available 0Reinforcement learning informed evolutionary search for autonomous systems testing Aug 24, 2023 Computational Efficiency Efficient Exploration
— Unverified 0Racing Towards Reinforcement Learning based control of an Autonomous Formula SAE Car Aug 24, 2023 Autonomous Navigation Deep Reinforcement Learning
— Unverified 0RamseyRL: A Framework for Intelligent Ramsey Number Counterexample Searching Aug 23, 2023 Reinforcement Learning (RL)
Code Code Available 0Aligning Language Models with Offline Learning from Human Feedback Aug 23, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Towards Validating Long-Term User Feedbacks in Interactive Recommendation Systems Aug 22, 2023 Interactive Recommendation Recommendation Systems
— Unverified 0Stabilizing Unsupervised Environment Design with a Learned Adversary Aug 21, 2023 Car Racing continuous-control
— Unverified 0A Homogenization Approach for Gradient-Dominated Stochastic Optimization Aug 21, 2023 Management Reinforcement Learning (RL)
— Unverified 0LaGR-SEQ: Language-Guided Reinforcement Learning with Sample-Efficient Querying Aug 21, 2023 Decision Making reinforcement-learning
Code Code Available 0Soft Decomposed Policy-Critic: Bridging the Gap for Effective Continuous Control with Discrete RL Aug 20, 2023 Atari Games continuous-control
— Unverified 0Accelerating Exact Combinatorial Optimization via RL-based Initialization -- A Case Study in Scheduling Aug 19, 2023 Combinatorial Optimization Reinforcement Learning (RL)
— Unverified 0UAV-assisted Semantic Communication with Hybrid Action Reinforcement Learning Aug 18, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games Aug 17, 2023 Multi-agent Reinforcement Learning Q-Learning
— Unverified 0ReProHRL: Towards Multi-Goal Navigation in the Real World using Hierarchical Agents Aug 17, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforced Self-Training (ReST) for Language Modeling Aug 17, 2023 Language Modeling Language Modelling
— Unverified 0Data-driven Integrated Sensing and Communication: Recent Advances, Challenges, and Future Prospects Aug 17, 2023 Integrated sensing and communication ISAC
— Unverified 0IMM: An Imitative Reinforcement Learning Approach with Predictive Representation Learning for Automatic Market Making Aug 17, 2023 Decision Making Imitation Learning
— Unverified 0Partially Observable Multi-Agent Reinforcement Learning with Information Sharing Aug 16, 2023 Computational Efficiency Multi-agent Reinforcement Learning
— Unverified 0On-demand Cold Start Frequency Reduction with Off-Policy Reinforcement Learning in Serverless Computing Aug 15, 2023 Cloud Computing CPU
— Unverified 0Planning to Learn: A Novel Algorithm for Active Learning during Model-Based Planning Aug 15, 2023 Active Learning counterfactual
Code Code Available 0Real Robot Challenge 2022: Learning Dexterous Manipulation from Offline Data in the Real World Aug 15, 2023 Offline RL reinforcement-learning
— Unverified 0A Reinforcement Learning Approach for Performance-aware Reduction in Power Consumption of Data Center Compute Nodes Aug 15, 2023 Management Reinforcement Learning (RL)
Code Code Available 0IOB: Integrating Optimization Transfer and Behavior Transfer for Multi-Policy Reuse Aug 14, 2023 Continual Learning Reinforcement Learning (RL)
— Unverified 0Insurance pricing on price comparison websites via reinforcement learning Aug 14, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0ACRE: Actor-Critic with Reward-Preserving Exploration Aug 14, 2023 continuous-control Continuous Control
Code Code Available 0Learning to Optimize LSM-trees: Towards A Reinforcement Learning based Key-Value Store for Dynamic Workloads Aug 14, 2023 Reinforcement Learning (RL)
— Unverified 0Omega-Regular Reward Machines Aug 14, 2023 Reinforcement Learning (RL)
— Unverified 0Neural Categorical Priors for Physics-Based Character Control Aug 14, 2023 Diversity Reinforcement Learning (RL)
— Unverified 0InTune: Reinforcement Learning-based Data Pipeline Optimization for Deep Recommendation Models Aug 13, 2023 CPU GPU
— Unverified 0CyberForce: A Federated Reinforcement Learning Framework for Malware Mitigation Aug 11, 2023 Anomaly Detection Data Poisoning
— Unverified 0A Comparison of Classical and Deep Reinforcement Learning Methods for HVAC Control Aug 10, 2023 Deep Reinforcement Learning Q-Learning
— Unverified 0Provably Efficient Algorithm for Nonstationary Low-Rank MDPs Aug 10, 2023 Reinforcement Learning (RL)
— Unverified 0Collaborative Wideband Spectrum Sensing and Scheduling for Networked UAVs in UTM Systems Aug 9, 2023 Management Multi-class Classification
— Unverified 0Actor-Critic with variable time discretization via sustained actions Aug 8, 2023 Reinforcement Learning (RL)
— Unverified 0Characterization of Human Balance through a Reinforcement Learning-based Muscle Controller Aug 8, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Exploiting Generalization in Offline Reinforcement Learning via Unseen State Augmentations Aug 7, 2023 Offline RL reinforcement-learning
— Unverified 0A Reinforcement Learning-Based Approach to Graph Discovery in D2D-Enabled Federated Learning Aug 7, 2023 Federated Learning Reinforcement Learning (RL)
— Unverified 0QDax: A Library for Quality-Diversity and Population-based Algorithms with Hardware Acceleration Aug 7, 2023 continuous-control Continuous Control
— Unverified 0Nonprehensile Planar Manipulation through Reinforcement Learning with Multimodal Categorical Exploration Aug 4, 2023 Object reinforcement-learning
— Unverified 0PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback Aug 3, 2023 Bilevel Optimization Procedure Learning
— Unverified 0Bag of Policies for Distributional Deep Exploration Aug 3, 2023 Atari Games Efficient Exploration
— Unverified 0Follow the Soldiers with Optimized Single-Shot Multibox Detection and Reinforcement Learning Aug 2, 2023 object-detection Object Detection
— Unverified 0Revisiting a Design Choice in Gradient Temporal Difference Learning Aug 2, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization Aug 1, 2023 Bilevel Optimization Diversity
Code Code Available 0End-to-End Reinforcement Learning for Torque Based Variable Height Hopping Jul 31, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0