| Better than classical? The subtle art of benchmarking quantum machine learning models | Mar 11, 2024 | BenchmarkingBinary Classification | CodeCode Available | 7 | 5 |
| Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine | Nov 28, 2023 | Electrical EngineeringExperimental Design | CodeCode Available | 5 | 5 |
| Chain of Ideas: Revolutionizing Research Via Novel Idea Development with LLM Agents | Oct 17, 2024 | Experimental Design | CodeCode Available | 4 | 5 |
| NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals | Jul 18, 2024 | Experimental DesignGPU | CodeCode Available | 4 | 5 |
| Predicting from Strings: Language Model Embeddings for Bayesian Optimization | Oct 14, 2024 | Bayesian OptimizationExperimental Design | CodeCode Available | 3 | 5 |
| OmniPred: Language Models as Universal Regressors | Feb 22, 2024 | Experimental Designregression | CodeCode Available | 3 | 5 |
| Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers | Sep 6, 2024 | Experimental Designscientific discovery | CodeCode Available | 3 | 5 |
| Attention is not not Explanation | Aug 13, 2019 | Decision MakingDiagnostic | CodeCode Available | 3 | 5 |
| Probing the limitations of multimodal language models for chemistry and materials research | Nov 25, 2024 | Experimental DesignSpatial Reasoning | CodeCode Available | 2 | 5 |
| Many Heads Are Better Than One: Improved Scientific Idea Generation by A LLM-Based Multi-Agent System | Oct 12, 2024 | Experimental Designscientific discovery | CodeCode Available | 2 | 5 |
| BoTorch: A Framework for Efficient Monte-Carlo Bayesian Optimization | Oct 14, 2019 | Bayesian OptimisationBayesian Optimization | CodeCode Available | 2 | 5 |
| Reviving The Classics: Active Reward Modeling in Large Language Model Alignment | Feb 4, 2025 | Computational EfficiencyExperimental Design | CodeCode Available | 2 | 5 |
| OpenBox: A Python Toolkit for Generalized Black-box Optimization | Apr 26, 2023 | Experimental Design | CodeCode Available | 2 | 5 |
| hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power Machine Learning Devices | Mar 9, 2021 | BIG-bench Machine LearningDiagnostic | CodeCode Available | 2 | 5 |
| Honegumi: An Interface for Accelerating the Adoption of Bayesian Optimization in the Experimental Sciences | Feb 4, 2025 | Bayesian OptimizationExperimental Design | CodeCode Available | 2 | 5 |
| Comparing Well and Geophysical Data for Temperature Monitoring Within a Bayesian Experimental Design Framework | Oct 19, 2022 | Experimental DesignTime Series Regression | CodeCode Available | 1 | 5 |
| CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental Design | Feb 27, 2023 | Experimental Design | CodeCode Available | 1 | 5 |
| Confident Teacher, Confident Student? A Novel User Study Design for Investigating the Didactic Potential of Explanations and their Impact on Uncertainty | Sep 10, 2024 | Experimental DesignExplainable artificial intelligence | CodeCode Available | 1 | 5 |
| CeBed: A Benchmark for Deep Data-Driven OFDM Channel Estimation | Jun 23, 2023 | Experimental Design | CodeCode Available | 1 | 5 |
| An Experimental Design Perspective on Model-Based Reinforcement Learning | Dec 9, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Correct block-design experiments mitigate temporal correlation bias in EEG classification | Nov 25, 2020 | ClassificationEEG | CodeCode Available | 1 | 5 |
| A Bayesian Model of Dose-Response for Cancer Drug Studies | Jun 10, 2019 | DenoisingDrug Discovery | CodeCode Available | 1 | 5 |
| Bayesian Algorithm Execution: Estimating Computable Properties of Black-box Functions Using Mutual Information | Apr 19, 2021 | Bayesian OptimizationExperimental Design | CodeCode Available | 1 | 5 |
| Active Learning for Optimal Intervention Design in Causal Models | Sep 10, 2022 | Active LearningExperimental Design | CodeCode Available | 1 | 5 |
| Autoencoders for Unsupervised Anomaly Segmentation in Brain MR Images: A Comparative Study | Apr 7, 2020 | AnatomyAnomaly Detection | CodeCode Available | 1 | 5 |