| Better than classical? The subtle art of benchmarking quantum machine learning models | Mar 11, 2024 | BenchmarkingBinary Classification | CodeCode Available | 7 | 5 |
| Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine | Nov 28, 2023 | Electrical EngineeringExperimental Design | CodeCode Available | 5 | 5 |
| NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals | Jul 18, 2024 | Experimental DesignGPU | CodeCode Available | 4 | 5 |
| Chain of Ideas: Revolutionizing Research Via Novel Idea Development with LLM Agents | Oct 17, 2024 | Experimental Design | CodeCode Available | 4 | 5 |
| Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers | Sep 6, 2024 | Experimental Designscientific discovery | CodeCode Available | 3 | 5 |
| Predicting from Strings: Language Model Embeddings for Bayesian Optimization | Oct 14, 2024 | Bayesian OptimizationExperimental Design | CodeCode Available | 3 | 5 |
| Attention is not not Explanation | Aug 13, 2019 | Decision MakingDiagnostic | CodeCode Available | 3 | 5 |
| OmniPred: Language Models as Universal Regressors | Feb 22, 2024 | Experimental Designregression | CodeCode Available | 3 | 5 |
| Many Heads Are Better Than One: Improved Scientific Idea Generation by A LLM-Based Multi-Agent System | Oct 12, 2024 | Experimental Designscientific discovery | CodeCode Available | 2 | 5 |
| OpenBox: A Python Toolkit for Generalized Black-box Optimization | Apr 26, 2023 | Experimental Design | CodeCode Available | 2 | 5 |
| BoTorch: A Framework for Efficient Monte-Carlo Bayesian Optimization | Oct 14, 2019 | Bayesian OptimisationBayesian Optimization | CodeCode Available | 2 | 5 |
| hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power Machine Learning Devices | Mar 9, 2021 | BIG-bench Machine LearningDiagnostic | CodeCode Available | 2 | 5 |
| Probing the limitations of multimodal language models for chemistry and materials research | Nov 25, 2024 | Experimental DesignSpatial Reasoning | CodeCode Available | 2 | 5 |
| Reviving The Classics: Active Reward Modeling in Large Language Model Alignment | Feb 4, 2025 | Computational EfficiencyExperimental Design | CodeCode Available | 2 | 5 |
| Honegumi: An Interface for Accelerating the Adoption of Bayesian Optimization in the Experimental Sciences | Feb 4, 2025 | Bayesian OptimizationExperimental Design | CodeCode Available | 2 | 5 |
| The Machine Psychology of Cooperation: Can GPT models operationalise prompts for altruism, cooperation, competitiveness and selfishness in economic games? | May 13, 2023 | Experimental DesignLanguage Modelling | CodeCode Available | 1 | 5 |
| Interventions, Where and How? Experimental Design for Causal Models at Scale | Mar 3, 2022 | Causal DiscoveryExperimental Design | CodeCode Available | 1 | 5 |
| LegalLens: Leveraging LLMs for Legal Violation Identification in Unstructured Text | Feb 6, 2024 | Experimental Design | CodeCode Available | 1 | 5 |
| SoK: Membership Inference Attacks on LLMs are Rushing Nowhere (and How to Fix It) | Jun 25, 2024 | BenchmarkingExperimental Design | CodeCode Available | 1 | 5 |
| Conditional Image Generation by Conditioning Variational Auto-Encoders | Feb 24, 2021 | Conditional Image GenerationExperimental Design | CodeCode Available | 1 | 5 |
| Initial recommendations for performing, benchmarking, and reporting single-cell proteomics experiments | Jul 19, 2022 | BenchmarkingExperimental Design | CodeCode Available | 1 | 5 |
| New Paradigms for Exploiting Parallel Experiments in Bayesian Optimization | Oct 3, 2022 | Bayesian OptimizationExperimental Design | CodeCode Available | 1 | 5 |
| A Bayesian Model of Dose-Response for Cancer Drug Studies | Jun 10, 2019 | DenoisingDrug Discovery | CodeCode Available | 1 | 5 |
| ExPT: Synthetic Pretraining for Few-Shot Experimental Design | Oct 30, 2023 | Experimental DesignIn-Context Learning | CodeCode Available | 1 | 5 |
| Implicit Deep Adaptive Design: Policy-Based Experimental Design without Likelihoods | Nov 3, 2021 | Experimental Design | CodeCode Available | 1 | 5 |
| Active Task Disambiguation with LLMs | Feb 6, 2025 | Experimental DesignQuestion Selection | CodeCode Available | 1 | 5 |
| Evaluating Multiview Object Consistency in Humans and Image Models | Sep 9, 2024 | Experimental Design | CodeCode Available | 1 | 5 |
| Gemstones: A Model Suite for Multi-Faceted Scaling Laws | Feb 7, 2025 | Experimental DesignLanguage Modeling | CodeCode Available | 1 | 5 |
| Empirical evaluation of scoring functions for Bayesian network model selection | Sep 11, 2012 | Experimental DesignModel Selection | CodeCode Available | 1 | 5 |
| Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics | Mar 21, 2024 | DeepFake DetectionExperimental Design | CodeCode Available | 1 | 5 |
| Edge Proposal Sets for Link Prediction | Jun 30, 2021 | Experimental DesignLink Prediction | CodeCode Available | 1 | 5 |
| Emulation of physical processes with Emukit | Oct 25, 2021 | Bayesian OptimizationDecision Making | CodeCode Available | 1 | 5 |
| GeneDisco: A Benchmark for Experimental Design in Drug Discovery | Oct 22, 2021 | Active LearningDrug Discovery | CodeCode Available | 1 | 5 |
| An Experimental Design Perspective on Model-Based Reinforcement Learning | Dec 9, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Deeper Learning By Doing: Integrating Hands-On Research Projects Into a Machine Learning Course | Jul 28, 2021 | BIG-bench Machine LearningExperimental Design | CodeCode Available | 1 | 5 |
| Creation and analysis of biochemical constraint-based models: the COBRA Toolbox v3.0 | Oct 11, 2017 | Experimental Design | CodeCode Available | 1 | 5 |
| Correct block-design experiments mitigate temporal correlation bias in EEG classification | Nov 25, 2020 | ClassificationEEG | CodeCode Available | 1 | 5 |
| A friendly introduction to triangular transport | Mar 27, 2025 | Bayesian InferenceDecision Making | CodeCode Available | 1 | 5 |
| Autonomous Microscopy Experiments through Large Language Model Agents | Dec 18, 2024 | BenchmarkingExperimental Design | CodeCode Available | 1 | 5 |
| A Practical Recipe for Federated Learning Under Statistical Heterogeneity Experimental Design | Jul 28, 2023 | Experimental DesignFederated Learning | CodeCode Available | 1 | 5 |
| Active Learning for Optimal Intervention Design in Causal Models | Sep 10, 2022 | Active LearningExperimental Design | CodeCode Available | 1 | 5 |
| BINOCULARS for Efficient, Nonmyopic Sequential Experimental Design | Sep 10, 2019 | Bayesian OptimizationExperimental Design | CodeCode Available | 1 | 5 |
| Attention-Based Transformers for Instance Segmentation of Cells in Microstructures | Nov 19, 2020 | Cell DetectionCell Segmentation | CodeCode Available | 1 | 5 |
| Autoencoders for Unsupervised Anomaly Segmentation in Brain MR Images: A Comparative Study | Apr 7, 2020 | AnatomyAnomaly Detection | CodeCode Available | 1 | 5 |
| Experimental design for MRI by greedy policy search | Oct 30, 2020 | Experimental DesignPolicy Gradient Methods | CodeCode Available | 1 | 5 |
| Deep Adaptive Design: Amortizing Sequential Bayesian Experimental Design | Mar 3, 2021 | Experimental Design | CodeCode Available | 1 | 5 |
| Deep Local Volatility | Jul 20, 2020 | Deep LearningExperimental Design | CodeCode Available | 1 | 5 |
| Bayesian Algorithm Execution: Estimating Computable Properties of Black-box Functions Using Mutual Information | Apr 19, 2021 | Bayesian OptimizationExperimental Design | CodeCode Available | 1 | 5 |
| CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental Design | Feb 27, 2023 | Experimental Design | CodeCode Available | 1 | 5 |
| Learning High-Dimensional Parametric Maps via Reduced Basis Adaptive Residual Networks | Dec 14, 2021 | Experimental DesignVocal Bursts Intensity Prediction | CodeCode Available | 1 | 5 |