| From Street Views to Urban Science: Discovering Road Safety Factors with Multimodal Large Language Models | Jun 2, 2025 | Large Language ModelMultimodal Large Language Model | —Unverified | 0 |
| OmniEarth-Bench: Towards Holistic Evaluation of Earth's Six Spheres and Cross-Spheres Interactions with Multimodal Observational Earth Data | May 29, 2025 | scientific discovery | —Unverified | 0 |
| ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows | May 26, 2025 | Astronomyscientific discovery | —Unverified | 0 |
| BiomedSQL: Text-to-SQL for Scientific Reasoning on Biomedical Knowledge Bases | May 23, 2025 | Causal Inferencescientific discovery | CodeCode Available | 0 |
| MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback | May 23, 2025 | scientific discovery | CodeCode Available | 0 |
| Improving Chemical Understanding of LLMs via SMILES Parsing | May 22, 2025 | Graph Matchingscientific discovery | —Unverified | 0 |
| Toward Reliable Biomedical Hypothesis Generation: Evaluating Truthfulness and Hallucination in Large Language Models | May 20, 2025 | Hallucinationscientific discovery | CodeCode Available | 0 |
| Robin: A multi-agent system for automating scientific discovery | May 19, 2025 | scientific discovery | —Unverified | 0 |
| InterFeat: An Automated Pipeline for Finding Interesting Hypotheses in Structured Biomedical Data | May 18, 2025 | Knowledge Graphsscientific discovery | CodeCode Available | 0 |
| When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific Research | May 17, 2025 | Misconceptionsscientific discovery | CodeCode Available | 0 |
| On the definition and importance of interpretability in scientific machine learning | May 16, 2025 | Equation DiscoveryInterpretable Machine Learning | —Unverified | 0 |
| Deep Symbolic Optimization: Reinforcement Learning for Symbolic Mathematics | May 16, 2025 | Equation Discoveryreinforcement-learning | —Unverified | 0 |
| Symbol-based entity marker highlighting for enhanced text mining in materials science with generative AI | May 9, 2025 | NERscientific discovery | —Unverified | 0 |
| Generative Discovery of Partial Differential Equations by Learning from Math Handbooks | May 9, 2025 | Computational EfficiencyMath | —Unverified | 0 |
| Contributions of the Petabyte Scale Sequence Search Codeathon toward efforts to scale sequence-based searches on SRA | May 9, 2025 | Benchmarkingscientific discovery | —Unverified | 0 |
| Soft causal learning for generalized molecule property prediction: An environment perspective | May 7, 2025 | Graph LearningProperty Prediction | —Unverified | 0 |
| Scientific Hypothesis Generation and Validation: Methods, Datasets, and Future Directions | May 6, 2025 | Causal InferenceDomain Adaptation | —Unverified | 0 |
| A Survey of Slow Thinking-based Reasoning LLMs using Reinforced Learning and Inference-time Scaling Law | May 5, 2025 | MathMedical Diagnosis | —Unverified | 0 |
| 34 Examples of LLM Applications in Materials Science and Chemistry: Towards Automation, Assistants, Agents, and Accelerated Scientific Discovery | May 5, 2025 | Large Language ModelMolecular Property Prediction | —Unverified | 0 |
| AI Idea Bench 2025: AI Research Idea Generation Benchmark | Apr 19, 2025 | Benchmarkingscientific discovery | —Unverified | 0 |
| Ascribe New Dimensions to Scientific Data Visualization with VR | Apr 18, 2025 | Data Visualizationscientific discovery | —Unverified | 0 |
| Causal-Copilot: An Autonomous Causal Analysis Agent | Apr 17, 2025 | Causal DiscoveryCausal Inference | —Unverified | 0 |
| Deep literature reviews: an application of fine-tuned language models to migration research | Apr 17, 2025 | Articlesscientific discovery | —Unverified | 0 |
| MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges? | Apr 13, 2025 | Large Language Modelscientific discovery | —Unverified | 0 |
| Scaling Laws of Graph Neural Networks for Atomistic Materials Modeling | Apr 10, 2025 | Drug Discoveryscientific discovery | —Unverified | 0 |