AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy Jun 16, 2025 Math Reinforcement Learning (RL)
— Unverified 0Augmenting Automated Game Testing with Deep Reinforcement Learning Mar 29, 2021 Deep Reinforcement Learning FPS Games
— Unverified 0Augmented Replay Memory in Reinforcement Learning With Continuous Control Dec 29, 2019 continuous-control Continuous Control
— Unverified 0Decentralized Circle Formation Control for Fish-like Robots in the Real-world via Reinforcement Learning Mar 9, 2021 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Decentralized Cooperative Lane Changing at Freeway Weaving Areas Using Multi-Agent Deep Reinforcement Learning Oct 5, 2021 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Decentralized Deep Reinforcement Learning for Network Level Traffic Signal Control Jul 2, 2020 Deep Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 0Decentralized Gossip-Based Stochastic Bilevel Optimization over Communication Networks Jun 22, 2022 Bilevel Optimization Federated Learning
— Unverified 0Augmented Random Search for Quadcopter Control: An alternative to Reinforcement Learning Nov 28, 2019 continuous-control Continuous Control
— Unverified 0Combining Multi-Objective Bayesian Optimization with Reinforcement Learning for TinyML May 23, 2023 Bayesian Optimization Hyperparameter Optimization
— Unverified 0AITuning: Machine Learning-based Tuning Tool for Run-Time Communication Libraries Sep 13, 2019 BIG-bench Machine Learning Deep Reinforcement Learning
— Unverified 0AISYN: AI-driven Reinforcement Learning-Based Logic Synthesis Framework Feb 8, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0AUGMENTED POLICY GRADIENT METHODS FOR EFFICIENT REINFORCEMENT LEARNING Sep 25, 2019 Policy Gradient Methods reinforcement-learning
— Unverified 0Adaptive Control of Differentially Private Linear Quadratic Systems Aug 26, 2021 Reinforcement Learning (RL)
— Unverified 0AAPO: Enhance the Reasoning Capabilities of LLMs with Advantage Momentum May 20, 2025 Mathematical Reasoning Reinforcement Learning (RL)
— Unverified 0Heterogeneous Knowledge for Augmented Modular Reinforcement Learning Jun 1, 2023 Decision Making reinforcement-learning
— Unverified 0Augmented Memory Networks for Streaming-Based Active One-Shot Learning Mar 20, 2019 Active Learning One-Shot Learning
— Unverified 0Adaptive Control of an Inverted Pendulum by a Reinforcement Learning-based LQR Method Sep 30, 2023 Benchmarking Reinforcement Learning (RL)
— Unverified 0Augmented Memory Networks for Streaming-Based Active One-Shot Learning Sep 4, 2019 Active Learning One-Shot Learning
— Unverified 0Augmented Lagrangian-Based Safe Reinforcement Learning Approach for Distribution System Volt/VAR Control Oct 19, 2024 Reinforcement Learning (RL) Safe Reinforcement Learning
— Unverified 0AirRL: A Reinforcement Learning Approach to Urban Air Quality Inference Mar 27, 2020 Air Quality Inference reinforcement-learning
— Unverified 0ACERAC: Efficient reinforcement learning in fine time discretization Apr 8, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0De-Biased Modelling of Search Click Behavior with Reinforcement Learning May 21, 2021 Learning-To-Rank reinforcement-learning
— Unverified 0Augmented Intelligence in Smart Intersections: Local Digital Twins-Assisted Hybrid Autonomous Driving Oct 16, 2024 Autonomous Driving Reinforcement Learning (RL)
— Unverified 0A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning Sep 29, 2021 Reinforcement Learning (RL) Stochastic Optimization
— Unverified 0AI Recommendation Systems for Lane-Changing Using Adherence-Aware Reinforcement Learning Apr 28, 2025 Autonomous Driving Recommendation Systems
— Unverified 0A Two-stage Framework and Reinforcement Learning-based Optimization Algorithms for Complex Scheduling Problems Mar 10, 2021 Combinatorial Optimization Earth Observation
— Unverified 0AirCapRL: Autonomous Aerial Human Motion Capture using Deep Reinforcement Learning Jul 13, 2020 Decision Making Deep Reinforcement Learning
— Unverified 0Adaptive control of a mechatronic system using constrained residual reinforcement learning Oct 6, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0A Tutorial Introduction to Reinforcement Learning Apr 3, 2023 Q-Learning reinforcement-learning
— Unverified 0ATTRITION: Attacking Static Hardware Trojan Detection Techniques Using Reinforcement Learning Aug 26, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Attribute Controllable Beautiful Caucasian Face Generation by Aesthetics Driven Reinforcement Learning Aug 9, 2022 Attribute Face Generation
— Unverified 0Attractor Selection in Nonlinear Energy Harvesting Using Deep Reinforcement Learning Oct 3, 2020 Deep Reinforcement Learning reinforcement-learning
— Unverified 0AI Planning: A Primer and Survey (Preliminary Report) Dec 7, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0A centralized reinforcement learning method for multi-agent job scheduling in Grid Sep 11, 2016 reinforcement-learning Reinforcement Learning
— Unverified 0Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning Sep 17, 2019 continuous-control Continuous Control
— Unverified 0ACNMP: Skill Transfer and Task Extrapolation through Learning from Demonstration and Reinforcement Learning via Representation Sharing Mar 25, 2020 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Attitude Control of Highly Maneuverable Aircraft Using an Improved Q-learning Oct 22, 2022 continuous-control Continuous Control
— Unverified 0AIGenC: An AI generalisation model via creativity May 19, 2022 model reinforcement-learning
— Unverified 0Teaching on a Budget in Multi-Agent Deep Reinforcement Learning Apr 19, 2019 Deep Reinforcement Learning Multi-agent Reinforcement Learning
— Unverified 0DearFSAC: An Approach to Optimizing Unreliable Federated Learning via Deep Reinforcement Learning Jan 30, 2022 Deep Reinforcement Learning Federated Learning
— Unverified 0Attention Routing: track-assignment detailed routing using attention-based reinforcement learning Apr 20, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Attention-Privileged Reinforcement Learning Nov 19, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0AIGB: Generative Auto-bidding via Conditional Diffusion Modeling May 25, 2024 Reinforcement Learning (RL)
— Unverified 0Attention Privileged Reinforcement Learning for Domain Transfer Sep 25, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Attention or memory? Neurointerpretable agents in space and time Jul 9, 2020 Atari Games Dimensionality Reduction
— Unverified 0AI-Driven Resource Allocation in Optical Wireless Communication Systems Apr 8, 2023 Management Reinforcement Learning (RL)
— Unverified 0Robust Model-free Reinforcement Learning with Multi-objective Bayesian Optimization Oct 29, 2019 Bayesian Optimization reinforcement-learning
— Unverified 0Death and Suicide in Universal Artificial Intelligence Jun 2, 2016 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Attention Graph for Multi-Robot Social Navigation with Deep Reinforcement Learning Jan 31, 2024 Deep Reinforcement Learning Graph Neural Network
— Unverified 0Attention-driven Robotic Manipulation Jan 1, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0