| Open Source Planning & Control System with Language Agents for Autonomous Scientific Discovery | Jul 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Topic Modeling and Link-Prediction for Material Property Discovery | Jul 8, 2025 | Knowledge GraphsLink Prediction | —Unverified | 0 |
| STRUCTSENSE: A Task-Agnostic Agentic Framework for Structured Information Extraction with Human-In-The-Loop Evaluation and Benchmarking | Jul 4, 2025 | BenchmarkingNavigate | CodeCode Available | 0 |
| Distributed Cross-Channel Hierarchical Aggregation for Foundation Models | Jun 26, 2025 | Computational Efficiencyscientific discovery | —Unverified | 0 |
| Active Inference AI Systems for Scientific Discovery | Jun 26, 2025 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| A Survey of AI for Materials Science: Foundation Models, LLM Agents, Datasets, and Tools | Jun 25, 2025 | Continual LearningDomain Generalization | —Unverified | 0 |
| AI Assistants to Enhance and Exploit the PETSc Knowledge Base | Jun 25, 2025 | RAGReranking | —Unverified | 0 |
| From Reproduction to Replication: Evaluating Research Agents with Progressive Code Masking | Jun 24, 2025 | Code Generationscientific discovery | CodeCode Available | 0 |
| AutomataGPT: Forecasting and Ruleset Inference for Two-Dimensional Cellular Automata | Jun 19, 2025 | scientific discovery | —Unverified | 0 |
| LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling Research | Jun 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Graphics4Science: Computer Graphics for Scientific Impacts | Jun 18, 2025 | scientific discovery | —Unverified | 0 |
| An ELIXIR scoping review on domain-specific evaluation metrics for synthetic data in life sciences | Jun 17, 2025 | scientific discoverySynthetic Data Evaluation | —Unverified | 0 |
| Scientifically-Interpretable Reasoning Network (ScIReN): Uncovering the Black-Box of Nature | Jun 16, 2025 | scientific discovery | —Unverified | 0 |
| Evolvable Conditional Diffusion | Jun 16, 2025 | DenoisingDescriptive | —Unverified | 0 |
| Interpretable representation learning of quantum data enabled by probabilistic variational autoencoders | Jun 13, 2025 | Interpretable Machine LearningRepresentation Learning | —Unverified | 0 |
| ClimateChat: Designing Data and Methods for Instruction Tuning LLMs to Answer Climate Change Queries | Jun 12, 2025 | scientific discovery | CodeCode Available | 1 |
| HSG-12M: A Large-Scale Spatial Multigraph Dataset | Jun 10, 2025 | Graph Learningscientific discovery | CodeCode Available | 1 |
| AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists | Jun 9, 2025 | scientific discoveryvalid | —Unverified | 0 |
| ALINE: Joint Amortization for Bayesian Inference and Active Data Acquisition | Jun 8, 2025 | Active LearningBayesian Inference | CodeCode Available | 0 |
| Can Theoretical Physics Research Benefit from Language Agents? | Jun 6, 2025 | Code GenerationMathematical Reasoning | —Unverified | 0 |
| Unsupervised Machine Learning for Scientific Discovery: Workflow and Best Practices | Jun 5, 2025 | Astronomyscientific discovery | CodeCode Available | 0 |
| Matter-of-Fact: A Benchmark for Verifying the Feasibility of Literature-Supported Claims in Materials Science | Jun 4, 2025 | ArticlesCode Generation | CodeCode Available | 0 |
| Multi-Exit Kolmogorov-Arnold Networks: enhancing accuracy and parsimony | Jun 3, 2025 | Kolmogorov-Arnold Networksscientific discovery | —Unverified | 0 |
| A Dynamic Framework for Semantic Grouping of Common Data Elements (CDE) Using Embeddings and Clustering | Jun 2, 2025 | Clusteringscientific discovery | —Unverified | 0 |
| From Street Views to Urban Science: Discovering Road Safety Factors with Multimodal Large Language Models | Jun 2, 2025 | Large Language ModelMultimodal Large Language Model | —Unverified | 0 |