SOTAVerified

scientific discovery

Papers

Showing 5160 of 464 papers

TitleStatusHype
SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM AgentsCode1
MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning ResearchCode1
PiFlow: Principle-aware Scientific Discovery with Multi-Agent CollaborationCode1
Benchmarking AI scientists in omics data-driven biological researchCode1
IRIS: Interactive Research Ideation System for Accelerating Scientific DiscoveryCode1
The AI Cosmologist I: An Agentic System for Automated Data AnalysisCode1
Offline Model-Based Optimization: Comprehensive ReviewCode1
MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific ResearchCode1
Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample CreationCode1
InductionBench: LLMs Fail in the Simplest Complexity ClassCode1
Show:102550
← PrevPage 6 of 47Next →

No leaderboard results yet.