Language Reward Modulation for Pretraining Reinforcement Learning Aug 23, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 1RamseyRL: A Framework for Intelligent Ramsey Number Counterexample Searching Aug 23, 2023 Reinforcement Learning (RL)
Code Code Available 0Towards Validating Long-Term User Feedbacks in Interactive Recommendation Systems Aug 22, 2023 Interactive Recommendation Recommendation Systems
— Unverified 0LaGR-SEQ: Language-Guided Reinforcement Learning with Sample-Efficient Querying Aug 21, 2023 Decision Making reinforcement-learning
Code Code Available 0A Homogenization Approach for Gradient-Dominated Stochastic Optimization Aug 21, 2023 Management Reinforcement Learning (RL)
— Unverified 0Stabilizing Unsupervised Environment Design with a Learned Adversary Aug 21, 2023 Car Racing continuous-control
— Unverified 0Soft Decomposed Policy-Critic: Bridging the Gap for Effective Continuous Control with Discrete RL Aug 20, 2023 Atari Games continuous-control
— Unverified 0Accelerating Exact Combinatorial Optimization via RL-based Initialization -- A Case Study in Scheduling Aug 19, 2023 Combinatorial Optimization Reinforcement Learning (RL)
— Unverified 0UAV-assisted Semantic Communication with Hybrid Action Reinforcement Learning Aug 18, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforced Self-Training (ReST) for Language Modeling Aug 17, 2023 Language Modeling Language Modelling
— Unverified 0Data-driven Integrated Sensing and Communication: Recent Advances, Challenges, and Future Prospects Aug 17, 2023 Integrated sensing and communication ISAC
— Unverified 0ReProHRL: Towards Multi-Goal Navigation in the Real World using Hierarchical Agents Aug 17, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games Aug 17, 2023 Multi-agent Reinforcement Learning Q-Learning
— Unverified 0IMM: An Imitative Reinforcement Learning Approach with Predictive Representation Learning for Automatic Market Making Aug 17, 2023 Decision Making Imitation Learning
— Unverified 0Partially Observable Multi-Agent Reinforcement Learning with Information Sharing Aug 16, 2023 Computational Efficiency Multi-agent Reinforcement Learning
— Unverified 0Planning to Learn: A Novel Algorithm for Active Learning during Model-Based Planning Aug 15, 2023 Active Learning counterfactual
Code Code Available 0A Reinforcement Learning Approach for Performance-aware Reduction in Power Consumption of Data Center Compute Nodes Aug 15, 2023 Management Reinforcement Learning (RL)
Code Code Available 0Real Robot Challenge 2022: Learning Dexterous Manipulation from Offline Data in the Real World Aug 15, 2023 Offline RL reinforcement-learning
— Unverified 0On-demand Cold Start Frequency Reduction with Off-Policy Reinforcement Learning in Serverless Computing Aug 15, 2023 Cloud Computing CPU
— Unverified 0ACRE: Actor-Critic with Reward-Preserving Exploration Aug 14, 2023 continuous-control Continuous Control
Code Code Available 0IOB: Integrating Optimization Transfer and Behavior Transfer for Multi-Policy Reuse Aug 14, 2023 Continual Learning Reinforcement Learning (RL)
— Unverified 0Dialogue for Prompting: a Policy-Gradient-Based Discrete Prompt Generation for Few-shot Learning Aug 14, 2023 Few-Shot Learning Reinforcement Learning (RL)
Code Code Available 1Learning to Optimize LSM-trees: Towards A Reinforcement Learning based Key-Value Store for Dynamic Workloads Aug 14, 2023 Reinforcement Learning (RL)
— Unverified 0Omega-Regular Reward Machines Aug 14, 2023 Reinforcement Learning (RL)
— Unverified 0Insurance pricing on price comparison websites via reinforcement learning Aug 14, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Neural Categorical Priors for Physics-Based Character Control Aug 14, 2023 Diversity Reinforcement Learning (RL)
— Unverified 0InTune: Reinforcement Learning-based Data Pipeline Optimization for Deep Recommendation Models Aug 13, 2023 CPU GPU
— Unverified 0CyberForce: A Federated Reinforcement Learning Framework for Malware Mitigation Aug 11, 2023 Anomaly Detection Data Poisoning
— Unverified 0A Comparison of Classical and Deep Reinforcement Learning Methods for HVAC Control Aug 10, 2023 Deep Reinforcement Learning Q-Learning
— Unverified 0Provably Efficient Algorithm for Nonstationary Low-Rank MDPs Aug 10, 2023 Reinforcement Learning (RL)
— Unverified 0Collaborative Wideband Spectrum Sensing and Scheduling for Networked UAVs in UTM Systems Aug 9, 2023 Management Multi-class Classification
— Unverified 0Actor-Critic with variable time discretization via sustained actions Aug 8, 2023 Reinforcement Learning (RL)
— Unverified 0Characterization of Human Balance through a Reinforcement Learning-based Muscle Controller Aug 8, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Exploiting Generalization in Offline Reinforcement Learning via Unseen State Augmentations Aug 7, 2023 Offline RL reinforcement-learning
— Unverified 0A Reinforcement Learning-Based Approach to Graph Discovery in D2D-Enabled Federated Learning Aug 7, 2023 Federated Learning Reinforcement Learning (RL)
— Unverified 0QDax: A Library for Quality-Diversity and Population-based Algorithms with Hardware Acceleration Aug 7, 2023 continuous-control Continuous Control
— Unverified 0Reinforcement Learning for Financial Index Tracking Aug 5, 2023 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation Aug 4, 2023 Abstractive Text Summarization Language Modeling
Code Code Available 1Nonprehensile Planar Manipulation through Reinforcement Learning with Multimodal Categorical Exploration Aug 4, 2023 Object reinforcement-learning
— Unverified 0PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback Aug 3, 2023 Bilevel Optimization Procedure Learning
— Unverified 0Bag of Policies for Distributional Deep Exploration Aug 3, 2023 Atari Games Efficient Exploration
— Unverified 0Revisiting a Design Choice in Gradient Temporal Difference Learning Aug 2, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Follow the Soldiers with Optimized Single-Shot Multibox Detection and Reinforcement Learning Aug 2, 2023 object-detection Object Detection
— Unverified 0qgym: A Gym for Training and Benchmarking RL-Based Quantum Compilation Aug 1, 2023 Benchmarking OpenAI Gym
Code Code Available 1Reinforcement Learning-based Non-Autoregressive Solver for Traveling Salesman Problems Aug 1, 2023 Combinatorial Optimization reinforcement-learning
Code Code Available 1BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization Aug 1, 2023 Bilevel Optimization Diversity
Code Code Available 0Reinforcement Learning for Generative AI: State of the Art, Opportunities and Open Research Challenges Jul 31, 2023 Reinforcement Learning (RL) Survey
— Unverified 0End-to-End Reinforcement Learning for Torque Based Variable Height Hopping Jul 31, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0DRL4Route: A Deep Reinforcement Learning Framework for Pick-up and Delivery Route Prediction Jul 30, 2023 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1PIMbot: Policy and Incentive Manipulation for Multi-Robot Reinforcement Learning in Social Dilemmas Jul 29, 2023 Reinforcement Learning (RL)
Code Code Available 0