Towards Smarter Sensing: 2D Clutter Mitigation in RL-Driven Cognitive MIMO Radar Feb 7, 2025 Integrated sensing and communication Reinforcement Learning (RL)
— Unverified 0Adversarially-Robust TD Learning with Markovian Data: Finite-Time Rates and Fundamental Limits Feb 7, 2025 Adversarial Robustness Reinforcement Learning (RL)
— Unverified 0DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails Feb 7, 2025 Reinforcement Learning (RL) Synthetic Data Generation
Code Code Available 1Enhancing Pre-Trained Decision Transformers with Prompt-Tuning Bandits Feb 7, 2025 Informativeness Offline RL
— Unverified 0Convergent NMPC-based Reinforcement Learning Using Deep Expected Sarsa and Nonlinear Temporal Difference Learning Feb 7, 2025 Reinforcement Learning (RL)
— Unverified 0LLM Alignment as Retriever Optimization: An Information Retrieval Perspective Feb 6, 2025 Information Retrieval Misinformation
— Unverified 0Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning Feb 6, 2025 Dataset Generation MuJoCo
— Unverified 0Mirror Descent Actor Critic via Bounded Advantage Learning Feb 6, 2025 Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning Based Prediction of PID Controller Gains for Quadrotor UAVs Feb 6, 2025 Reinforcement Learning (RL)
— Unverified 0Autotelic Reinforcement Learning: Exploring Intrinsic Motivations for Skill Acquisition in Open-Ended Environments Feb 6, 2025 Reinforcement Learning (RL)
— Unverified 0Illuminating Spaces: Deep Reinforcement Learning and Laser-Wall Partitioning for Architectural Layout Generation Feb 6, 2025 Deep Reinforcement Learning Layout Design
— Unverified 0Transforming Multimodal Models into Action Models for Radiotherapy Feb 6, 2025 Anatomy Few-Shot Learning
— Unverified 0Training Language Models to Reason Efficiently Feb 6, 2025 Reinforcement Learning (RL)
Code Code Available 2Demystifying Long Chain-of-Thought Reasoning in LLMs Feb 5, 2025 Reinforcement Learning (RL)
Code Code Available 3CTR-Driven Advertising Image Generation with Multimodal Large Language Models Feb 5, 2025 Image Generation Reinforcement Learning (RL)
Code Code Available 2OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds Feb 5, 2025 Few-Shot Learning Imitation Learning
— Unverified 0Underwater Soft Fin Flapping Motion with Deep Neural Network Based Surrogate Model Feb 5, 2025 Reinforcement Learning (RL)
Code Code Available 0AI-driven materials design: a mini-review Feb 5, 2025 Evolutionary Algorithms Reinforcement Learning (RL)
— Unverified 0Optimizing Electric Vehicles Charging using Large Language Models and Graph Neural Networks Feb 5, 2025 Reinforcement Learning (RL)
— Unverified 0Calibrated Unsupervised Anomaly Detection in Multivariate Time-series using Reinforcement Learning Feb 5, 2025 Anomaly Detection Reinforcement Learning (RL)
— Unverified 0Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs Feb 4, 2025 Code Generation Language Modeling
Code Code Available 2Brief analysis of DeepSeek R1 and it's implications for Generative AI Feb 4, 2025 GPU Mixture-of-Experts
— Unverified 0Adviser-Actor-Critic: Eliminating Steady-State Error in Reinforcement Learning Control Feb 4, 2025 Reinforcement Learning (RL)
— Unverified 0Flow Q-Learning Feb 4, 2025 Action Generation D4RL
Code Code Available 3Circular Microalgae-Based Carbon Control for Net Zero Feb 4, 2025 Reinforcement Learning (RL)
Code Code Available 0Analytical Lyapunov Function Discovery: An RL-based Generative Approach Feb 4, 2025 Reinforcement Learning (RL) valid
Code Code Available 1VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play Feb 4, 2025 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0RAPID: Robust and Agile Planner Using Inverse Reinforcement Learning for Vision-Based Drone Navigation Feb 4, 2025 Drone navigation Reinforcement Learning (RL)
— Unverified 0The Differences Between Direct Alignment Algorithms are a Blur Feb 3, 2025 Language Modeling Language Modelling
— Unverified 0Preference VLM: Leveraging VLMs for Scalable Preference-Based Reinforcement Learning Feb 3, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Process Reinforcement through Implicit Rewards Feb 3, 2025 Math Reinforcement Learning (RL)
Code Code Available 5Reinforcement Learning with Segment Feedback Feb 3, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Reinforcement Learning for Long-Horizon Interactive LLM Agents Feb 3, 2025 reinforcement-learning Reinforcement Learning
— Unverified 0Dynamic object goal pushing with mobile manipulators through model-free constrained reinforcement learning Feb 3, 2025 Friction Object
— Unverified 0ACECODER: Acing Coder RL via Automated Test-Case Synthesis Feb 3, 2025 HumanEval mbpp
— Unverified 0Toward Task Generalization via Memory Augmentation in Meta-Reinforcement Learning Feb 3, 2025 Meta Reinforcement Learning reinforcement-learning
— Unverified 0GNN-DT: Graph Neural Network Enhanced Decision Transformer for Efficient Optimization in Dynamic Environments Feb 3, 2025 Efficient Exploration Graph Neural Network
Code Code Available 1Resilient UAV Trajectory Planning via Few-Shot Meta-Offline Reinforcement Learning Feb 3, 2025 Meta-Learning Offline RL
— Unverified 0Zeroth-order Informed Fine-Tuning for Diffusion Model: A Recursive Likelihood Ratio Optimizer Feb 2, 2025 Reinforcement Learning (RL) Video Generation
— Unverified 0Recursive generalized type-2 fuzzy radial basis function neural networks for joint position estimation and adaptive EMG-based impedance control of lower limb exoskeletons Feb 1, 2025 Electromyography (EMG) GPU
Code Code Available 0Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network Feb 1, 2025 continuous-control Continuous Control
— Unverified 0Model-Free Predictive Control: Introductory Algebraic Calculations, and a Comparison with HEOL and ANNs Feb 1, 2025 Model Predictive Control Reinforcement Learning (RL)
— Unverified 0A Differentiated Reward Method for Reinforcement Learning based Multi-Vehicle Cooperative Decision-Making Algorithms Feb 1, 2025 Decision Making Reinforcement Learning (RL)
— Unverified 0Optimizing Job Allocation using Reinforcement Learning with Graph Neural Networks Jan 31, 2025 Reinforcement Learning (RL) Scheduling
— Unverified 0Decorrelated Soft Actor-Critic for Efficient Deep Reinforcement Learning Jan 31, 2025 Deep Reinforcement Learning reinforcement-learning
— Unverified 0SHARPIE: A Modular Framework for Reinforcement Learning and Human-AI Interaction Experiments Jan 31, 2025 reinforcement-learning Reinforcement Learning
Code Code Available 1O-MAPL: Offline Multi-agent Preference Learning Jan 31, 2025 Reinforcement Learning (RL) SMAC
— Unverified 0Towards Physiologically Sensible Predictions via the Rule-based Reinforcement Learning Layer Jan 31, 2025 Reinforcement Learning (RL)
— Unverified 0RLS3: RL-Based Synthetic Sample Selection to Enhance Spatial Reasoning in Vision-Language Models for Indoor Autonomous Perception Jan 31, 2025 Reinforcement Learning (RL) Spatial Reasoning
— Unverified 0Test-Time Training Scaling Laws for Chemical Exploration in Drug Design Jan 31, 2025 Drug Design Drug Discovery
Code Code Available 3