MOORL: A Framework for Integrating Offline-Online Reinforcement Learning Jun 11, 2025 D4RL Deep Reinforcement Learning
— Unverified 0Go-Browse: Training Web Agents with Structured Exploration Jun 4, 2025 Efficient Exploration Language Modeling
— Unverified 0DrSR: LLM based Scientific Equation Discovery with Dual Reasoning from Data and Experience Jun 4, 2025 Efficient Exploration Equation Discovery
— Unverified 0WoMAP: World Models For Embodied Open-Vocabulary Object Localization Jun 2, 2025 Active Object Localization Efficient Exploration
— Unverified 0MermaidFlow: Redefining Agentic Workflow Generation via Safety-Constrained Evolutionary Programming May 29, 2025 Diversity Efficient Exploration
Code Code Available 2HelixDesign-Binder: A Scalable Production-Grade Platform for Binder Design Built on HelixFold3 May 28, 2025 Benchmarking Efficient Exploration
— Unverified 0DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning May 26, 2025 Efficient Exploration reinforcement-learning
Code Code Available 0STAR-R1: Spacial TrAnsformation Reasoning by Reinforcing Multimodal LLMs May 21, 2025 Efficient Exploration Reinforcement Learning (RL)
Code Code Available 0Comparative Analysis of Black-Box Optimization Methods for Weather Intervention Design May 16, 2025 Bayesian Optimization Efficient Exploration
— Unverified 0IN-RIL: Interleaved Reinforcement and Imitation Learning for Policy Fine-Tuning May 15, 2025 Efficient Exploration Imitation Learning
Code Code Available 0Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists? May 14, 2025 Efficient Exploration
— Unverified 0Distilling Realizable Students from Unrealizable Teachers May 14, 2025 Efficient Exploration Imitation Learning
— Unverified 0Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning May 13, 2025 Efficient Exploration Multi-agent Reinforcement Learning
— Unverified 0Interpretable SHAP-bounded Bayesian Optimization for Underwater Acoustic Metamaterial Coating Design May 10, 2025 Bayesian Optimization Efficient Exploration
— Unverified 0An Explainable Nature-Inspired Framework for Monkeypox Diagnosis: Xception Features Combined with NGBoost and African Vultures Optimization Algorithm Apr 24, 2025 Diagnostic Dimensionality Reduction
— Unverified 0ForesightNav: Learning Scene Imagination for Efficient Exploration Apr 22, 2025 Efficient Exploration Navigate
Code Code Available 2Aerial Active STAR-RIS-assisted Satellite-Terrestrial Covert Communications Apr 22, 2025 Deep Reinforcement Learning Denoising
— Unverified 0Lumos: Efficient Performance Modeling and Estimation for Large-scale LLM Training Apr 12, 2025 Efficient Exploration
— Unverified 0Memetic Search for Green Vehicle Routing Problem with Private Capacitated Refueling Stations Apr 6, 2025 Efficient Exploration
— Unverified 0From Automation to Autonomy in Smart Manufacturing: A Bayesian Optimization Framework for Modeling Multi-Objective Experimentation and Sequential Decision Making Apr 5, 2025 Bayesian Optimization Data Integration
— Unverified 0Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning Mar 28, 2025 Efficient Exploration Language Modeling
— Unverified 0Maya: Optimizing Deep Learning Training Workloads using Emulated Virtual Accelerators Mar 26, 2025 Deep Learning Efficient Exploration
— Unverified 0FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs Mar 25, 2025 Efficient Exploration Information Retrieval
— Unverified 0CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning Mar 23, 2025 Deep Reinforcement Learning Efficient Exploration
— Unverified 0KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies Mar 23, 2025 continuous-control Continuous Control
— Unverified 0Disentangling Uncertainties by Learning Compressed Data Representation Mar 20, 2025 Efficient Exploration Gaussian Processes
Code Code Available 0Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model Mar 14, 2025 Bayesian Inference Efficient Exploration
— Unverified 0HyperArm Bandit Optimization: A Novel approach to Hyperparameter Optimization and an Analysis of Bandit Algorithms in Stochastic and Adversarial Settings Mar 13, 2025 Bayesian Optimization Computational Efficiency
— Unverified 0Is a Good Foundation Necessary for Efficient Reinforcement Learning? The Computational Role of the Base Model in Exploration Mar 10, 2025 Efficient Exploration
— Unverified 0Reward-Centered ReST-MCTS: A Robust Decision-Making Framework for Robotic Manipulation in High Uncertainty Environments Mar 7, 2025 Decision Making Efficient Exploration
Code Code Available 0Probabilistic Insights for Efficient Exploration Strategies in Reinforcement Learning Mar 5, 2025 Diversity Efficient Exploration
— Unverified 0A Transformer Model for Predicting Chemical Reaction Products from Generic Templates Mar 4, 2025 Computational chemistry Efficient Exploration
— Unverified 0Contextualizing biological perturbation experiments through language Feb 28, 2025 Efficient Exploration
Code Code Available 1Training a Generally Curious Agent Feb 24, 2025 Decision Making Efficient Exploration
Code Code Available 1On Space-Filling Input Design for Nonlinear Dynamic Model Learning: A Gaussian Process Approach Feb 24, 2025 Efficient Exploration
— Unverified 0Synergistic Fusion of Multi-Source Knowledge via Evidence Theory for High-Entropy Alloy Discovery Feb 20, 2025 Efficient Exploration Transfer Learning
— Unverified 0Mem2Ego: Empowering Vision-Language Models with Global-to-Ego Memory for Long-Horizon Embodied Navigation Feb 20, 2025 Decision Making Efficient Exploration
— Unverified 0FragFM: Hierarchical Framework for Efficient Molecule Generation via Fragment-Level Discrete Flow Matching Feb 19, 2025 Diversity Drug Discovery
— Unverified 0DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models Feb 19, 2025 Diversity Efficient Exploration
— Unverified 0Finedeep: Mitigating Sparse Activation in Dense LLMs via Multi-Layer Fine-Grained Experts Feb 18, 2025 Efficient Exploration
— Unverified 0Maximum Entropy Reinforcement Learning with Diffusion Policy Feb 17, 2025 Efficient Exploration MuJoCo
Code Code Available 1Massively Scaling Explicit Policy-conditioned Value Functions Feb 17, 2025 continuous-control Continuous Control
— Unverified 0Causal Information Prioritization for Efficient Reinforcement Learning Feb 14, 2025 continuous-control Continuous Control
— Unverified 0Exploratory Diffusion Model for Unsupervised Reinforcement Learning Feb 11, 2025 Efficient Exploration model
— Unverified 0Guided Exploration for Efficient Relational Model Learning Feb 10, 2025 Efficient Exploration model
— Unverified 0Few-shot_LLM_Synthetic_Data_with_Distribution_Matching Feb 9, 2025 Attribute Efficient Exploration
Code Code Available 0Adaptive Exploration for Multi-Reward Multi-Policy Evaluation Feb 4, 2025 Efficient Exploration
— Unverified 0GNN-DT: Graph Neural Network Enhanced Decision Transformer for Efficient Optimization in Dynamic Environments Feb 3, 2025 Efficient Exploration Graph Neural Network
Code Code Available 1Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning Jan 29, 2025 continuous-control Continuous Control
Code Code Available 1Constrained Hybrid Metaheuristic Algorithm for Probabilistic Neural Networks Learning Jan 26, 2025 Efficient Exploration
— Unverified 0