A Multimodal Learning-based Approach for Autonomous Landing of UAV May 21, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0Rethinking Robustness Assessment: Adversarial Attacks on Learning-based Quadrupedal Locomotion Controllers May 21, 2024 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Practical and efficient quantum circuit synthesis and transpiling with Reinforcement Learning May 21, 2024 Reinforcement Learning (RL)
— Unverified 0Multi-Agent Reinforcement Learning with Hierarchical Coordination for Emergency Responder Stationing May 21, 2024 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Highway Graph to Accelerate Reinforcement Learning May 20, 2024 reinforcement-learning Reinforcement Learning
Code Code Available 0Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning May 20, 2024 continuous-control Continuous Control
— Unverified 0Investigating the Impact of Choice on Deep Reinforcement Learning for Space Controls May 20, 2024 continuous-control Continuous Control
— Unverified 0Scrutinize What We Ignore: Reining In Task Representation Shift Of Context-Based Offline Meta Reinforcement Learning May 20, 2024 Meta-Learning Meta Reinforcement Learning
Code Code Available 0Large Language Models are Biased Reinforcement Learners May 19, 2024 Decision Making In-Context Learning
Code Code Available 0Do No Harm: A Counterfactual Approach to Safe Reinforcement Learning May 19, 2024 counterfactual Friction
— Unverified 0Comparisons Are All You Need for Optimizing Smooth Functions May 19, 2024 All reinforcement-learning
— Unverified 0Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses May 18, 2024 D4RL Offline RL
— Unverified 0Optimal control barrier functions for RL based safe powertrain control May 18, 2024 Reinforcement Learning (RL)
— Unverified 0Combined film and pulse heating of lithium ion batteries to improve performance in low ambient temperature May 18, 2024 Reinforcement Learning (RL)
— Unverified 0LLM-based Multi-Agent Reinforcement Learning: Current and Future Directions May 17, 2024 Multi-agent Reinforcement Learning Question Answering
— Unverified 0Stochastic Q-learning for Large Discrete Action Spaces May 16, 2024 Decision Making Q-Learning
— Unverified 0Simulation-Based Benchmarking of Reinforcement Learning Agents for Personalized Retail Promotions May 16, 2024 Benchmarking Reinforcement Learning (RL)
Code Code Available 0Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning May 16, 2024 Decision Making Instruction Following
— Unverified 0Deep Learning in Earthquake Engineering: A Comprehensive Review May 15, 2024 Deep Learning Dimensionality Reduction
— Unverified 0Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning May 15, 2024 Reinforcement Learning (RL)
— Unverified 0IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues May 15, 2024 Information Retrieval Question Answering
— Unverified 0vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement May 14, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0Deep Reinforcement Learning for Real-Time Ground Delay Program Revision and Corresponding Flight Delay Assignments May 14, 2024 Decision Making Deep Reinforcement Learning
— Unverified 0Neural Network Compression for Reinforcement Learning Tasks May 13, 2024 Neural Network Compression reinforcement-learning
— Unverified 0Hype or Heuristic? Quantum Reinforcement Learning for Join Order Optimisation May 13, 2024 Low-latency processing reinforcement-learning
Code Code Available 0Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback May 13, 2024 Reinforcement Learning (RL)
— Unverified 0Reducing Risk for Assistive Reinforcement Learning Policies with Diffusion Models May 13, 2024 Imitation Learning reinforcement-learning
— Unverified 0Intrinsic Rewards for Exploration without Harm from Observational Noise: A Simulation Study Based on the Free Energy Principle May 13, 2024 Efficient Exploration Navigate
— Unverified 0CAGES: Cost-Aware Gradient Entropy Search for Efficient Local Multi-Fidelity Bayesian Optimization May 13, 2024 Bayesian Optimization Reinforcement Learning (RL)
Code Code Available 0Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning May 12, 2024 Offline RL Reinforcement Learning (RL)
— Unverified 0Fairness in Reinforcement Learning: A Survey May 11, 2024 Autonomous Vehicles Fairness
— Unverified 0Space Processor Computation Time Analysis for Reinforcement Learning and Run Time Assurance Control Policies May 10, 2024 Reinforcement Learning (RL)
— Unverified 0Dominion: A New Frontier for AI Research May 10, 2024 Reinforcement Learning (RL)
— Unverified 0Improving Targeted Molecule Generation through Language Model Fine-Tuning Via Reinforcement Learning May 10, 2024 Drug Design Language Modeling
— Unverified 0An Overview of Machine Learning-Enabled Optimization for Reconfigurable Intelligent Surfaces-Aided 6G Networks: From Reinforcement Learning to Large Language Models May 9, 2024 Hierarchical Reinforcement Learning Management
— Unverified 0Fast Stochastic Policy Gradient: Negative Momentum for Reinforcement Learning May 8, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Genetic Drift Regularization: on preventing Actor Injection from breaking Evolution Strategies May 7, 2024 Evolutionary Algorithms Reinforcement Learning (RL)
— Unverified 0SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems May 7, 2024 CPU GPU
Code Code Available 0Roadside Units Assisted Localized Automated Vehicle Maneuvering: An Offline Reinforcement Learning Approach May 7, 2024 Autonomous Driving Reinforcement Learning (RL)
— Unverified 0Improving Offline Reinforcement Learning with Inaccurate Simulators May 7, 2024 D4RL Generative Adversarial Network
— Unverified 0Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning via Causal Normalizing Flows May 6, 2024 Causal Inference counterfactual
— Unverified 0Safe Reinforcement Learning with Learned Non-Markovian Safety Constraints May 5, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0CTD4 -- A Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple Critics May 4, 2024 continuous-control Continuous Control
Code Code Available 0UDUC: An Uncertainty-driven Approach for Learning-based Robust Control May 4, 2024 Contrastive Learning Model Predictive Control
— Unverified 0Learning Robot Soccer from Egocentric Vision with Deep Reinforcement Learning May 3, 2024 Deep Reinforcement Learning Object Tracking
— Unverified 0Zero-Sum Positional Differential Games as a Framework for Robust Reinforcement Learning: Deep Q-Learning Approach May 3, 2024 Q-Learning reinforcement-learning
— Unverified 0A Model-based Multi-Agent Personalized Short-Video Recommender System May 3, 2024 Recommendation Systems Reinforcement Learning (RL)
— Unverified 0Learning Robust Autonomous Navigation and Locomotion for Wheeled-Legged Robots May 3, 2024 Autonomous Navigation Navigate
— Unverified 0Model-based reinforcement learning for protein backbone design May 3, 2024 model Model-based Reinforcement Learning
— Unverified 0Proximal Curriculum with Task Correlations for Deep Reinforcement Learning May 3, 2024 Deep Reinforcement Learning reinforcement-learning
Code Code Available 0