SOTAVerified

Experimental Design

Papers

Showing 150 of 688 papers

TitleStatusHype
Better than classical? The subtle art of benchmarking quantum machine learning modelsCode7
Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in MedicineCode5
Chain of Ideas: Revolutionizing Research Via Novel Idea Development with LLM AgentsCode4
NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model InternalsCode4
Predicting from Strings: Language Model Embeddings for Bayesian OptimizationCode3
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP ResearchersCode3
OmniPred: Language Models as Universal RegressorsCode3
Attention is not not ExplanationCode3
Reviving The Classics: Active Reward Modeling in Large Language Model AlignmentCode2
Honegumi: An Interface for Accelerating the Adoption of Bayesian Optimization in the Experimental SciencesCode2
Probing the limitations of multimodal language models for chemistry and materials researchCode2
Many Heads Are Better Than One: Improved Scientific Idea Generation by A LLM-Based Multi-Agent SystemCode2
OpenBox: A Python Toolkit for Generalized Black-box OptimizationCode2
hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power Machine Learning DevicesCode2
BoTorch: A Framework for Efficient Monte-Carlo Bayesian OptimizationCode2
A friendly introduction to triangular transportCode1
Gemstones: A Model Suite for Multi-Faceted Scaling LawsCode1
Active Task Disambiguation with LLMsCode1
Autonomous Microscopy Experiments through Large Language Model AgentsCode1
Confident Teacher, Confident Student? A Novel User Study Design for Investigating the Didactic Potential of Explanations and their Impact on UncertaintyCode1
Evaluating Multiview Object Consistency in Humans and Image ModelsCode1
Toward Automated Simulation Research Workflow through LLM Prompt Engineering DesignCode1
GitHub is an effective platform for collaborative and reproducible laboratory researchCode1
SoK: Membership Inference Attacks on LLMs are Rushing Nowhere (and How to Fix It)Code1
Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media ForensicsCode1
LegalLens: Leveraging LLMs for Legal Violation Identification in Unstructured TextCode1
Variance Alignment Score: A Simple But Tough-to-Beat Data Selection Method for Multimodal Contrastive LearningCode1
ExPT: Synthetic Pretraining for Few-Shot Experimental DesignCode1
Sustainable Concrete via Bayesian OptimizationCode1
A Practical Recipe for Federated Learning Under Statistical Heterogeneity Experimental DesignCode1
CeBed: A Benchmark for Deep Data-Driven OFDM Channel EstimationCode1
The Machine Psychology of Cooperation: Can GPT models operationalise prompts for altruism, cooperation, competitiveness and selfishness in economic games?Code1
CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental DesignCode1
Comparing Well and Geophysical Data for Temperature Monitoring Within a Bayesian Experimental Design FrameworkCode1
New Paradigms for Exploiting Parallel Experiments in Bayesian OptimizationCode1
Active Learning for Optimal Intervention Design in Causal ModelsCode1
Initial recommendations for performing, benchmarking, and reporting single-cell proteomics experimentsCode1
Derivative-Informed Neural Operator: An Efficient Framework for High-Dimensional Parametric Derivative LearningCode1
Marginal Post Processing of Bayesian Inference Products with Normalizing Flows and Kernel Density EstimatorsCode1
VICE: Variational Interpretable Concept EmbeddingsCode1
Interventions, Where and How? Experimental Design for Causal Models at ScaleCode1
Optimizing Sequential Experimental Design with Deep Reinforcement LearningCode1
Learning High-Dimensional Parametric Maps via Reduced Basis Adaptive Residual NetworksCode1
An Experimental Design Perspective on Model-Based Reinforcement LearningCode1
Implicit Deep Adaptive Design: Policy-Based Experimental Design without LikelihoodsCode1
Emulation of physical processes with EmukitCode1
GeneDisco: A Benchmark for Experimental Design in Drug DiscoveryCode1
What Does TERRA-REF's High Resolution, Multi Sensor Plant Sensing Public Domain Data Offer the Computer Vision Community?Code1
Deeper Learning By Doing: Integrating Hands-On Research Projects Into a Machine Learning CourseCode1
Edge Proposal Sets for Link PredictionCode1
Show:102550
← PrevPage 1 of 14Next →

No leaderboard results yet.