General Method for Solving Four Types of SAT Problems Dec 27, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0A Bayesian Framework of Deep Reinforcement Learning for Joint O-RAN/MEC Orchestration Dec 26, 2023 Deep Reinforcement Learning Edge-computing
— Unverified 0LLMLight: Large Language Models as Traffic Signal Control Agents Dec 26, 2023 Decision Making Management
Code Code Available 2Learning Online Policies for Person Tracking in Multi-View Environments Dec 26, 2023 Human Detection Reinforcement Learning (RL)
— Unverified 0Efficient Reinforcement Learning via Decoupling Exploration and Utilization Dec 26, 2023 Autonomous Vehicles MuJoCo
Code Code Available 1PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning Dec 26, 2023 Decision Making Deep Reinforcement Learning
Code Code Available 1Agent based modelling for continuously varying supply chains Dec 24, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning for Safe Occupancy Strategies in Educational Spaces during an Epidemic Dec 23, 2023 Management Q-Learning
— Unverified 0Mutual Information as Intrinsic Reward of Reinforcement Learning Agents for On-demand Ride Pooling Dec 23, 2023 Reinforcement Learning (RL)
— Unverified 0Gradient Shaping for Multi-Constraint Safe Reinforcement Learning Dec 23, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Human-AI Collaboration in Real-World Complex Environment with Reinforcement Learning Dec 23, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Hardware-Aware DNN Compression via Diverse Pruning and Mixed-Precision Quantization Dec 23, 2023 Quantization Reinforcement Learning (RL)
— Unverified 0REBEL: Reward Regularization-Based Approach for Robotic Reinforcement Learning from Human Feedback Dec 22, 2023 Bilevel Optimization continuous-control
— Unverified 0Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning Dec 22, 2023 AI Agent Reinforcement Learning (RL)
— Unverified 0A Survey of Reinforcement Learning from Human Feedback Dec 22, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Multiagent Copilot Approach for Shared Autonomy between Human EEG and TD3 Deep Reinforcement Learning Dec 22, 2023 Brain Computer Interface Deep Reinforcement Learning
— Unverified 0Critic-Guided Decision Transformer for Offline Reinforcement Learning Dec 21, 2023 D4RL Offline RL
Code Code Available 1Multi-Agent Probabilistic Ensembles with Trajectory Sampling for Connected Autonomous Vehicles Dec 21, 2023 Autonomous Vehicles Decision Making
— Unverified 0Maximum entropy GFlowNets with soft Q-learning Dec 21, 2023 Q-Learning Reinforcement Learning (RL)
— Unverified 0Optimizing Heat Alert Issuance with Reinforcement Learning Dec 21, 2023 Data Augmentation Decision Making
Code Code Available 0Diffusion Reward: Learning Rewards via Conditional Video Diffusion Dec 21, 2023 Diversity Reinforcement Learning (RL)
Code Code Available 1RFRL Gym: A Reinforcement Learning Testbed for Cognitive Radio Applications Dec 20, 2023 OpenAI Gym reinforcement-learning
Code Code Available 1OpenRL: A Unified Reinforcement Learning Framework Dec 20, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 2Optimal coordination of resources: A solution from reinforcement learning Dec 20, 2023 Q-Learning reinforcement-learning
— Unverified 0Towards Machines that Trust: AI Agents Learn to Trust in the Trust Game Dec 20, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Parameterized Projected Bellman Operator Dec 20, 2023 Decision Making Reinforcement Learning (RL)
Code Code Available 0BadRL: Sparse Targeted Backdoor Attack Against Reinforcement Learning Dec 19, 2023 Backdoor Attack reinforcement-learning
Code Code Available 0Data-Driven Merton's Strategies via Policy Randomization Dec 19, 2023 Reinforcement Learning (RL)
— Unverified 0Stable Relay Learning Optimization Approach for Fast Power System Production Cost Minimization Simulation Dec 19, 2023 Imitation Learning Reinforcement Learning (RL)
— Unverified 0CUDC: A Curiosity-Driven Unsupervised Data Collection Method with Adaptive Temporal Distances for Offline Reinforcement Learning Dec 19, 2023 Navigate Offline RL
— Unverified 0A Dual Curriculum Learning Framework for Multi-UAV Pursuit-Evasion in Diverse Environments Dec 19, 2023 Reinforcement Learning (RL) Zero-shot Generalization
— Unverified 0Neural Network Approximation for Pessimistic Offline Reinforcement Learning Dec 19, 2023 Deep Reinforcement Learning Offline RL
— Unverified 0Solving the swing-up and balance task for the Acrobot and Pendubot with SAC Dec 18, 2023 Acrobot Position
— Unverified 0Safeguarded Progress in Reinforcement Learning: Safe Bayesian Exploration for Control Policy Synthesis Dec 18, 2023 Bayesian Inference Reinforcement Learning (RL)
— Unverified 0Active search and coverage using point-cloud reinforcement learning Dec 18, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Challenges for Reinforcement Learning in Quantum Circuit Design Dec 18, 2023 Quantum Machine Learning reinforcement-learning
Code Code Available 1Learning to Act without Actions Dec 17, 2023 Reinforcement Learning (RL)
Code Code Available 1CACTO-SL: Using Sobolev Learning to improve Continuous Actor-Critic with Trajectory Optimization Dec 17, 2023 Reinforcement Learning (RL)
Code Code Available 1Improving Environment Robustness of Deep Reinforcement Learning Approaches for Autonomous Racing Using Bayesian Optimization-based Curriculum Learning Dec 16, 2023 Autonomous Driving Autonomous Racing
Code Code Available 0Active Reinforcement Learning for Robust Building Control Dec 16, 2023 Atari Games Game of Go
Code Code Available 1Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement Learning Dec 16, 2023 Reinforcement Learning (RL) Safe Reinforcement Learning
Code Code Available 0Advancing RAN Slicing with Offline Reinforcement Learning Dec 16, 2023 Management Offline RL
— Unverified 0Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints Dec 16, 2023 Decision Making Fairness
— Unverified 0Fractional Deep Reinforcement Learning for Age-Minimal Mobile Edge Computing Dec 16, 2023 Autonomous Driving Deep Reinforcement Learning
— Unverified 0Assume-Guarantee Reinforcement Learning Dec 15, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Toward Computationally Efficient Inverse Reinforcement Learning via Reward Shaping Dec 15, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Towards Automatic Data Augmentation for Disordered Speech Recognition Dec 14, 2023 Data Augmentation Reinforcement Learning (RL)
— Unverified 0iOn-Profiler: intelligent Online multi-objective VNF Profiling with Reinforcement Learning Dec 14, 2023 CPU reinforcement-learning
— Unverified 0ReCoRe: Regularized Contrastive Representation Learning of World Model Dec 14, 2023 Contrastive Learning Denoising
— Unverified 0World Models via Policy-Guided Trajectory Diffusion Dec 13, 2023 continuous-control Continuous Control
Code Code Available 1