| Open Source Planning & Control System with Language Agents for Autonomous Scientific Discovery | Jul 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Topic Modeling and Link-Prediction for Material Property Discovery | Jul 8, 2025 | Knowledge GraphsLink Prediction | —Unverified | 0 |
| STRUCTSENSE: A Task-Agnostic Agentic Framework for Structured Information Extraction with Human-In-The-Loop Evaluation and Benchmarking | Jul 4, 2025 | BenchmarkingNavigate | CodeCode Available | 0 |
| Distributed Cross-Channel Hierarchical Aggregation for Foundation Models | Jun 26, 2025 | Computational Efficiencyscientific discovery | —Unverified | 0 |
| Active Inference AI Systems for Scientific Discovery | Jun 26, 2025 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| A Survey of AI for Materials Science: Foundation Models, LLM Agents, Datasets, and Tools | Jun 25, 2025 | Continual LearningDomain Generalization | —Unverified | 0 |
| AI Assistants to Enhance and Exploit the PETSc Knowledge Base | Jun 25, 2025 | RAGReranking | —Unverified | 0 |
| From Reproduction to Replication: Evaluating Research Agents with Progressive Code Masking | Jun 24, 2025 | Code Generationscientific discovery | CodeCode Available | 0 |
| AutomataGPT: Forecasting and Ruleset Inference for Two-Dimensional Cellular Automata | Jun 19, 2025 | scientific discovery | —Unverified | 0 |
| LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling Research | Jun 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |