MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research Sep 27, 2021 Deep Reinforcement Learning NetHack
— Unverified 0Minimal Batch Adaptive Learning Policy Engine for Real-Time Mid-Price Forecasting in High-Frequency Trading Dec 26, 2024 Feature Importance Reinforcement Learning (RL)
— Unverified 0Minimalist and High-performance Conversational Recommendation with Uncertainty Estimation for User Preference Jun 29, 2022 Attribute Conversational Recommendation
— Unverified 0Minimalistic Attacks: How Little it Takes to Fool a Deep Reinforcement Learning Policy Nov 10, 2019 Adversarial Attack Atari Games
— Unverified 0Minimal Value-Equivalent Partial Models for Scalable and Robust Planning in Lifelong Reinforcement Learning Jan 24, 2023 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0Minimax Model Learning Mar 2, 2021 model Model-based Reinforcement Learning
— Unverified 0Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning Mar 14, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0Nearly Minimax Optimal Reinforcement Learning for Discounted MDPs Oct 1, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Minimax Optimal Reinforcement Learning with Quasi-Optimism Mar 2, 2025 Computational Efficiency reinforcement-learning
— Unverified 0Minimax-Optimal Reward-Agnostic Exploration in Reinforcement Learning Apr 14, 2023 Offline RL reinforcement-learning
— Unverified 0Minimax Sample Complexity for Turn-based Stochastic Game Nov 29, 2020 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Minimax Strikes Back Dec 19, 2020 Deep Reinforcement Learning GPU
— Unverified 0Minimax Weight and Q-Function Learning for Off-Policy Evaluation Oct 28, 2019 Off-policy evaluation Reinforcement Learning
— Unverified 0Minimax Weight Learning for Absorbing MDPs Jan 9, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Minimizing Communication while Maximizing Performance in Multi-Agent Reinforcement Learning Jun 15, 2021 Multi-agent Reinforcement Learning Multi-Task Learning
— Unverified 0Minimizing Human Assistance: Augmenting a Single Demonstration for Deep Reinforcement Learning Sep 22, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Minimizing Safety Interference for Safe and Comfortable Automated Driving with Distributional Reinforcement Learning Jul 15, 2021 Autonomous Vehicles Distributional Reinforcement Learning
— Unverified 0Minimizing the Outage Probability in a Markov Decision Process Feb 28, 2023 Q-Learning reinforcement-learning
— Unverified 0Minimum Description Length Control Jul 17, 2022 Bayesian Inference continuous-control
— Unverified 0Minimum Description Length Skills for Accelerated Reinforcement Learning Mar 9, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Minimum information divergence of Q-functions for dynamic treatment resumes Nov 16, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Mining Evidences for Concept Stock Recommendation Jun 1, 2018 Deep Reinforcement Learning Information Retrieval
— Unverified 0Mint: Matrix-Interleaving for Multi-Task Learning Sep 25, 2019 Multi-Task Learning reinforcement-learning
— Unverified 0APPTeK: Agent-Based Predicate Prediction in Temporal Knowledge Graphs Oct 27, 2021 Knowledge Graphs Prediction
— Unverified 0Mirror Descent Actor Critic via Bounded Advantage Learning Feb 6, 2025 Reinforcement Learning (RL)
— Unverified 0Mission schedule of agile satellites based on Proximal Policy Optimization Algorithm Jul 5, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Misspecification in Inverse Reinforcement Learning Dec 6, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Mis-spoke or mis-lead: Achieving Robustness in Multi-Agent Communicative Reinforcement Learning Aug 9, 2021 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Mitigate Bias in Face Recognition using Skewness-Aware Reinforcement Learning Nov 25, 2019 Face Recognition Fairness
— Unverified 0Mitigating Bias in Face Recognition Using Skewness-Aware Reinforcement Learning Jun 1, 2020 Face Recognition Fairness
— Unverified 0Mitigating Dimensionality in 2D Rectangle Packing Problem under Reinforcement Learning Schema Sep 15, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Mitigating Multi-Stage Cascading Failure by Reinforcement Learning Aug 19, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Mitigating Partial Observability in Adaptive Traffic Signal Control with Transformers Sep 16, 2024 Management Reinforcement Learning (RL)
— Unverified 0Mitigating Planner Overfitting in Model-Based Reinforcement Learning Dec 3, 2018 model Model-based Reinforcement Learning
— Unverified 0Mitigating Political Bias in Language Models Through Reinforced Calibration Apr 30, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization Mar 23, 2025 Reinforcement Learning (RL) Response Generation
— Unverified 0Mitigation of Adversarial Policy Imitation via Constrained Randomization of Policy (CRoP) Sep 29, 2021 Deep Reinforcement Learning Imitation Learning
— Unverified 0Mitigation of Policy Manipulation Attacks on Deep Q-Networks with Parameter-Space Noise Jun 4, 2018 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Mix and Match: Markov Chains & Mixing Times for Matching in Rideshare Nov 30, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Mixed Cooperative-Competitive Communication Using Multi-Agent Reinforcement Learning Oct 29, 2021 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Robust Policy Optimization in Continuous-time Mixed H_2/H_ Stochastic Control Sep 9, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Mixed-Precision Conjugate Gradient Solvers with RL-Driven Precision Tuning Apr 19, 2025 Computational Efficiency Q-Learning
— Unverified 0Mixed-Precision Neural Networks: A Survey Aug 11, 2022 Quantization Reinforcement Learning (RL)
— Unverified 0Mixed Reinforcement Learning with Additive Stochastic Uncertainty Feb 28, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Mixing Human Demonstrations with Self-Exploration in Experience Replay for Deep Reinforcement Learning Jul 14, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0MIX-MAB: Reinforcement Learning-based Resource Allocation Algorithm for LoRaWAN Jun 7, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Mix & Match - Agent Curricula for Reinforcement Learning Jul 1, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Mix&Match - Agent Curricula for Reinforcement Learning Jun 5, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0MIXRTs: Toward Interpretable Multi-Agent Reinforcement Learning via Mixing Recurrent Soft Decision Trees Sep 15, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0MLComp: A Methodology for Machine Learning-based Performance Estimation and Adaptive Selection of Pareto-Optimal Compiler Optimization Sequences Dec 9, 2020 Compiler Optimization reinforcement-learning
— Unverified 0