HistoGym: A Reinforcement Learning Environment for Histopathological Image Analysis Aug 16, 2024 Cancer Classification OpenAI Gym
Code Code Available 0Adaptive Planning with Generative Models under Uncertainty Aug 2, 2024 Autonomous Navigation Decision Making
— Unverified 0Enhancing Hardware Fault Tolerance in Machines with Reinforcement Learning Policy Gradient Algorithms Jul 21, 2024 Continual Learning OpenAI Gym
— Unverified 0A Comprehensive Guide to Combining R and Python code for Data Science, Machine Learning and Reinforcement Learning Jul 19, 2024 OpenAI Gym
— Unverified 0Traffic control using intelligent timing of traffic lights with reinforcement learning technique and real-time processing of surveillance camera images May 22, 2024 Management OpenAI Gym
— Unverified 0Decision Mamba Architectures May 13, 2024 D4RL Imitation Learning
Code Code Available 0SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems May 7, 2024 CPU GPU
Code Code Available 0Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline May 4, 2024 Computational Efficiency MuJoCo
— Unverified 0Airlift Challenge: A Competition for Optimizing Cargo Delivery Apr 26, 2024 OpenAI Gym
— Unverified 0Enhancing Privacy and Security of Autonomous UAV Navigation Apr 26, 2024 Autonomous Navigation Disaster Response
— Unverified 0HomeLabGym: A real-world testbed for home energy management systems Apr 22, 2024 energy management Management
— Unverified 0Noisy Spiking Actor Network for Exploration Mar 7, 2024 continuous-control Continuous Control
— Unverified 0QF-tuner: Breaking Tradition in Reinforcement Learning Feb 26, 2024 OpenAI Gym Q-Learning
— Unverified 0MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces Feb 20, 2024 Decision Making Offline RL
Code Code Available 0Easy as ABCs: Unifying Boltzmann Q-Learning and Counterfactual Regret Minimization Feb 19, 2024 counterfactual OpenAI Gym
— Unverified 0Scilab-RL: A software framework for efficient reinforcement learning and cognitive modeling research Jan 25, 2024 Data Visualization Hyperparameter Optimization
— Unverified 0MultiSlot ReRanker: A Generic Model-based Re-Ranking Framework in Recommendation Systems Jan 11, 2024 Diversity OpenAI Gym
— Unverified 0Decision Making in Non-Stationary Environments with Policy-Augmented Search Jan 6, 2024 Decision Making Decision Making Under Uncertainty
Code Code Available 0A Closed-Loop Multi-perspective Visual Servoing Approach with Reinforcement Learning Dec 25, 2023 OpenAI Gym reinforcement-learning
— Unverified 0Investigating the Performance and Reliability, of the Q-Learning Algorithm in Various Unknown Environments Dec 19, 2023 OpenAI Gym Pathfinder
Code Code Available 0Efficient Parallel Reinforcement Learning Framework using the Reactor Model Dec 7, 2023 OpenAI Gym Q-Learning
Code Code Available 0Resilient Control of Networked Microgrids using Vertical Federated Reinforcement Learning: Designs and Real-Time Test-Bed Validations Nov 21, 2023 OpenAI Gym Reinforcement Learning (RL)
— Unverified 0Guaranteeing Control Requirements via Reward Shaping in Reinforcement Learning Nov 16, 2023 Deep Reinforcement Learning OpenAI Gym
Code Code Available 0Bridging Dimensions: Confident Reachability for High-Dimensional Controllers Nov 8, 2023 Knowledge Distillation OpenAI Gym
Code Code Available 0Repairing Learning-Enabled Controllers While Preserving What Works Nov 6, 2023 OpenAI Gym
Code Code Available 0SDGym: Low-Code Reinforcement Learning Environments using System Dynamics Models Oct 19, 2023 OpenAI Gym reinforcement-learning
Code Code Available 0Neural architecture impact on identifying temporally extended Reinforcement Learning tasks Oct 4, 2023 Deep Reinforcement Learning image-classification
— Unverified 0Optimizing with Low Budgets: a Comparison on the Black-box Optimization Benchmarking Suite and OpenAI Gym Sep 29, 2023 Bayesian Optimization Benchmarking
— Unverified 0Implicit Sensing in Traffic Optimization: Advanced Deep Reinforcement Learning Techniques Sep 25, 2023 Autonomous Vehicles Deep Reinforcement Learning
— Unverified 0gym-saturation: Gymnasium environments for saturation provers (System description) Sep 16, 2023 OpenAI Gym reinforcement-learning
— Unverified 0Attention Loss Adjusted Prioritized Experience Replay Sep 13, 2023 Deep Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 0Distributionally Robust Statistical Verification with Imprecise Neural Networks Aug 28, 2023 Active Learning MuJoCo
— Unverified 0Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning Aug 28, 2023 D4RL Off-policy evaluation
— Unverified 0On Combining Expert Demonstrations in Imitation Learning via Optimal Transport Jul 20, 2023 Imitation Learning OpenAI Gym
— Unverified 0Scaling Distributed Multi-task Reinforcement Learning with Experience Sharing Jul 11, 2023 Lifelong learning OpenAI Gym
— Unverified 0Dynamic Observation Policies in Observation Cost-Sensitive Reinforcement Learning Jul 5, 2023 OpenAI Gym reinforcement-learning
Code Code Available 0Learning Environment Models with Continuous Stochastic Dynamics Jun 29, 2023 Acrobot Benchmarking
— Unverified 0Correcting discount-factor mismatch in on-policy policy gradient methods Jun 23, 2023 OpenAI Gym Policy Gradient Methods
— Unverified 0Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation Jun 23, 2023 Few-Shot Image Classification Few-Shot Imitation Learning
Code Code Available 0Deep Reinforcement Learning for ESG financial portfolio management Jun 19, 2023 Decision Making Deep Reinforcement Learning
— Unverified 0Mimicking Better by Matching the Approximate Action Distribution Jun 16, 2023 Imitation Learning MuJoCo
Code Code Available 0Active Inference in Hebbian Learning Networks Jun 8, 2023 OpenAI Gym Q-Learning
— Unverified 0Risk-Aware Reward Shaping of Reinforcement Learning Agents for Autonomous Driving Jun 5, 2023 Autonomous Driving Motion Planning
Code Code Available 0Optimizing Attention and Cognitive Control Costs Using Temporally-Layered Architectures May 30, 2023 continuous-control Continuous Control
Code Code Available 0Discovering Individual Rewards in Collective Behavior through Inverse Multi-Agent Reinforcement Learning May 17, 2023 Multi-agent Reinforcement Learning OpenAI Gym
— Unverified 0Rethinking Population-assisted Off-policy Reinforcement Learning May 4, 2023 OpenAI Gym reinforcement-learning
— Unverified 0Gym-preCICE: Reinforcement Learning Environments for Active Flow Control May 3, 2023 OpenAI Gym reinforcement-learning
— Unverified 0Signal Novelty Detection as an Intrinsic Reward for Robotics Apr 14, 2023 Acrobot Anomaly Detection
Code Code Available 0Exact and Cost-Effective Automated Transformation of Neural Network Controllers to Decision Tree Controllers Apr 11, 2023 Decision Making OpenAI Gym
— Unverified 0Causal Repair of Learning-enabled Cyber-physical Systems Apr 6, 2023 counterfactual Diagnostic
— Unverified 0