| SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents | May 29, 2025 | Adversarial AttackLarge Language Model | CodeCode Available | 1 |
| MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research | May 26, 2025 | scientific discovery | CodeCode Available | 1 |
| PiFlow: Principle-aware Scientific Discovery with Multi-Agent Collaboration | May 21, 2025 | Large Language Modelscientific discovery | CodeCode Available | 1 |
| Benchmarking AI scientists in omics data-driven biological research | May 13, 2025 | BenchmarkingMultiple-choice | CodeCode Available | 1 |
| IRIS: Interactive Research Ideation System for Accelerating Scientific Discovery | Apr 23, 2025 | scientific discovery | CodeCode Available | 1 |
| The AI Cosmologist I: An Agentic System for Automated Data Analysis | Apr 4, 2025 | scientific discovery | CodeCode Available | 1 |
| Offline Model-Based Optimization: Comprehensive Review | Mar 21, 2025 | modelNeural Architecture Search | CodeCode Available | 1 |
| MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research | Mar 17, 2025 | ArticlesBenchmarking | CodeCode Available | 1 |
| Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation | Feb 26, 2025 | Ingenuityscientific discovery | CodeCode Available | 1 |
| InductionBench: LLMs Fail in the Simplest Complexity Class | Feb 20, 2025 | scientific discovery | CodeCode Available | 1 |
| K-Paths: Reasoning over Graph Paths for Drug Repurposing and Drug Interaction Prediction | Feb 18, 2025 | Drug DiscoveryKnowledge Graphs | CodeCode Available | 1 |
| Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation | Feb 7, 2025 | scientific discoverySurvey | CodeCode Available | 1 |
| AIGS: Generating Science from AI-Powered Automated Falsification | Nov 17, 2024 | scientific discovery | CodeCode Available | 1 |
| Geometric Representation Condition Improves Equivariant Molecule Generation | Oct 4, 2024 | Drug Designscientific discovery | CodeCode Available | 1 |
| BLADE: Benchmarking Language Model Agents for Data-Driven Science | Aug 19, 2024 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation | Jul 12, 2024 | Few-Shot Learningscientific discovery | CodeCode Available | 1 |
| LLM-SR: Scientific Equation Discovery via Programming with Large Language Models | Apr 29, 2024 | Equation DiscoveryInterpretable Machine Learning | CodeCode Available | 1 |
| GraphGPT: Graph Learning with Generative Pre-trained Transformers | Dec 31, 2023 | DecoderGraph Learning | CodeCode Available | 1 |
| Deep Generative Symbolic Regression | Dec 30, 2023 | FormHeuristic Search | CodeCode Available | 1 |
| A Transformer Model for Symbolic Regression towards Scientific Discovery | Dec 7, 2023 | regressionscientific discovery | CodeCode Available | 1 |
| Machine-Guided Discovery of a Real-World Rogue Wave Model | Nov 21, 2023 | Model Selectionregression | CodeCode Available | 1 |
| Large Language Models are Zero Shot Hypothesis Proposers | Nov 10, 2023 | scientific discovery | CodeCode Available | 1 |
| Modelling Cellular Perturbations with the Sparse Additive Mechanism Shift Variational Autoencoder | Nov 5, 2023 | DisentanglementDrug Discovery | CodeCode Available | 1 |
| Large Language Models for Scientific Synthesis, Inference and Explanation | Oct 12, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| Evolving Scientific Discovery by Unifying Data and Background Knowledge with AI Hilbert | Aug 18, 2023 | Equation DiscoveryLogical Reasoning | CodeCode Available | 1 |
| Towards Lightweight Data Integration using Multi-workflow Provenance and Data Observability | Aug 17, 2023 | CPUData Integration | CodeCode Available | 1 |
| Constructing Custom Thermodynamics Using Deep Learning | Aug 8, 2023 | Deep LearningPhysical Intuition | CodeCode Available | 1 |
| Symmetry-Informed Geometric Representation for Molecules, Proteins, and Crystalline Materials | Jun 15, 2023 | BenchmarkingComputational chemistry | CodeCode Available | 1 |
| X-TIME: An in-memory engine for accelerating machine learning on tabular data with CAMs | Apr 3, 2023 | GPUscientific discovery | CodeCode Available | 1 |
| Applications of Gaussian Processes at Extreme Lengthscales: From Molecules to Black Holes | Mar 24, 2023 | Bayesian OptimisationGaussian Processes | CodeCode Available | 1 |
| Unifying Molecular and Textual Representations via Multi-task Language Modelling | Jan 29, 2023 | Language ModellingMolecule Captioning | CodeCode Available | 1 |
| Scalable Hybrid Learning Techniques for Scientific Data Compression | Dec 21, 2022 | Data CompressionGPU | CodeCode Available | 1 |
| SRSD: Rethinking Datasets of Symbolic Regression for Scientific Discovery | Dec 2, 2022 | regressionscientific discovery | CodeCode Available | 1 |
| Semi-Supervised Domain Adaptation for Cross-Survey Galaxy Morphology Classification and Anomaly Detection | Nov 1, 2022 | Anomaly DetectionDomain Adaptation | CodeCode Available | 1 |
| Selection by Prediction with Conformal p-values | Oct 4, 2022 | Decision MakingDrug Discovery | CodeCode Available | 1 |
| Explaining Patterns in Data with Language Models via Interpretable Autoprompting | Oct 4, 2022 | Explanation GenerationNatural Language Understanding | CodeCode Available | 1 |
| Quantitative probing: Validating causal models using quantitative domain knowledge | Sep 7, 2022 | scientific discovery | CodeCode Available | 1 |
| MLExchange: A web-based platform enabling exchangeable machine learning workflows for scientific studies | Aug 20, 2022 | scientific discovery | CodeCode Available | 1 |
| Deep Learning and Symbolic Regression for Discovering Parametric Equations | Jul 1, 2022 | BIG-bench Machine LearningDeep Learning | CodeCode Available | 1 |
| Rethinking Symbolic Regression Datasets and Benchmarks for Scientific Discovery | Jun 21, 2022 | regressionscientific discovery | CodeCode Available | 1 |
| AutoML Two-Sample Test | Jun 17, 2022 | AutoMLscientific discovery | CodeCode Available | 1 |
| Going From Molecules to Genomic Variations to Scientific Discovery: Intelligent Algorithms and Architectures for Intelligent Genome Analysis | May 16, 2022 | scientific discovery | CodeCode Available | 1 |
| Bayesian optimization with known experimental and design constraints for chemistry applications | Mar 29, 2022 | Bayesian Optimizationscientific discovery | CodeCode Available | 1 |
| AbductionRules: Training Transformers to Explain Unexpected Inputs | Mar 23, 2022 | Common Sense ReasoningLogical Reasoning | CodeCode Available | 1 |
| Learning quantum dynamics with latent neural ODEs | Oct 20, 2021 | scientific discovery | CodeCode Available | 1 |
| Molformer: Motif-based Transformer on 3D Heterogeneous Molecular Graphs | Oct 4, 2021 | 3D geometrygraph construction | CodeCode Available | 1 |
| AI Descartes: Combining Data and Theory for Derivable Scientific Discovery | Sep 3, 2021 | Automated Theorem ProvingBIG-bench Machine Learning | CodeCode Available | 1 |
| Neural Symbolic Regression that Scales | Jun 11, 2021 | regressionscientific discovery | CodeCode Available | 1 |
| Data-driven discovery of Green's functions with human-understandable deep learning | May 1, 2021 | scientific discovery | CodeCode Available | 1 |
| Gemini: Dynamic Bias Correction for Autonomous Experimentation and Molecular Simulation | Mar 5, 2021 | Bayesian Optimizationregression | CodeCode Available | 1 |