| Can Theoretical Physics Research Benefit from Language Agents? | Jun 6, 2025 | Code GenerationMathematical Reasoning | —Unverified | 0 |
| Unsupervised Machine Learning for Scientific Discovery: Workflow and Best Practices | Jun 5, 2025 | Astronomyscientific discovery | CodeCode Available | 0 |
| Matter-of-Fact: A Benchmark for Verifying the Feasibility of Literature-Supported Claims in Materials Science | Jun 4, 2025 | ArticlesCode Generation | CodeCode Available | 0 |
| Multi-Exit Kolmogorov-Arnold Networks: enhancing accuracy and parsimony | Jun 3, 2025 | Kolmogorov-Arnold Networksscientific discovery | —Unverified | 0 |
| A Dynamic Framework for Semantic Grouping of Common Data Elements (CDE) Using Embeddings and Clustering | Jun 2, 2025 | Clusteringscientific discovery | —Unverified | 0 |
| From Street Views to Urban Science: Discovering Road Safety Factors with Multimodal Large Language Models | Jun 2, 2025 | Large Language ModelMultimodal Large Language Model | —Unverified | 0 |
| OmniEarth-Bench: Towards Holistic Evaluation of Earth's Six Spheres and Cross-Spheres Interactions with Multimodal Observational Earth Data | May 29, 2025 | scientific discovery | —Unverified | 0 |
| ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows | May 26, 2025 | Astronomyscientific discovery | —Unverified | 0 |
| MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback | May 23, 2025 | scientific discovery | CodeCode Available | 0 |
| BiomedSQL: Text-to-SQL for Scientific Reasoning on Biomedical Knowledge Bases | May 23, 2025 | Causal Inferencescientific discovery | CodeCode Available | 0 |