SOTAVerified

scientific discovery

Papers

Showing 91100 of 464 papers

TitleStatusHype
Enabling AI Scientists to Recognize Innovation: A Domain-Agnostic Algorithm for Assessing Novelty0
Can Large Language Models Help Experimental Design for Causal Discovery?0
BixBench: a Comprehensive Benchmark for LLM-based Agents in Computational BiologyCode2
CS-PaperSum: A Large-Scale Dataset of AI-Generated Summaries for Scientific Papers0
Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample CreationCode1
Towards an AI co-scientist0
A Perspective on Symbolic Machine Learning in Physical Sciences0
Auto-Bench: An Automated Benchmark for Scientific Discovery in LLMs0
Protein Large Language Models: A Comprehensive SurveyCode2
InductionBench: LLMs Fail in the Simplest Complexity ClassCode1
Show:102550
← PrevPage 10 of 47Next →

No leaderboard results yet.