SOTAVerified

scientific discovery

Papers

Showing 2650 of 464 papers

TitleStatusHype
Open Source Planning & Control System with Language Agents for Autonomous Scientific DiscoveryCode2
LLaMEA-BO: A Large Language Model Evolutionary Algorithm for Automatically Generating Bayesian Optimization AlgorithmsCode2
AI-Driven Automation Can Become the Foundation of Next-Era Science of Science ResearchCode2
HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis GenerationCode2
AI-Newton: A Concept-Driven Physical Law Discovery System without Prior Physical KnowledgeCode2
BixBench: a Comprehensive Benchmark for LLM-based Agents in Computational BiologyCode2
Protein Large Language Models: A Comprehensive SurveyCode2
DiffMS: Diffusion Generation of Molecules Conditioned on Mass SpectraCode2
From Generalist to Specialist: A Survey of Large Language Models for ChemistryCode2
Many Heads Are Better Than One: Improved Scientific Idea Generation by A LLM-Based Multi-Agent SystemCode2
MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific HypothesesCode2
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific DiscoveryCode2
SciLitLLM: How to Adapt LLMs for Scientific Literature UnderstandingCode2
OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AICode2
Flow of Reasoning:Training LLMs for Divergent Problem Solving with Minimal ExamplesCode2
BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation ExperimentsCode2
LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific DiscoveryCode2
Active Learning with Fully Bayesian Neural Networks for Discontinuous and Nonstationary DataCode2
SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language ModelsCode2
Multi-Fidelity Active Learning with GFlowNetsCode2
Ten Quick Tips for Harnessing the Power of ChatGPT/GPT-4 in Computational BiologyCode2
Accelerating Material Design with the Generative Toolkit for Scientific DiscoveryCode2
LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling ResearchCode1
ClimateChat: Designing Data and Methods for Instruction Tuning LLMs to Answer Climate Change QueriesCode1
HSG-12M: A Large-Scale Spatial Multigraph DatasetCode1
Show:102550
← PrevPage 2 of 19Next →

No leaderboard results yet.