SOTAVerified

scientific discovery

Papers

Showing 76100 of 464 papers

TitleStatusHype
ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition0
Iterative Hypothesis Generation for Scientific Discovery with Monte Carlo Nash Equilibrium Self-Refining Trees0
SCI-IDEA: Context-Aware Scientific Ideation Using Token and Sentence Embeddings0
Structuring Scientific Innovation: A Framework for Modeling and Discovering Impactful Knowledge Combinations0
AgentRxiv: Towards Collaborative Autonomous ResearchCode9
Offline Model-Based Optimization: Comprehensive ReviewCode1
CodeScientist: End-to-End Semi-Automated Scientific Discovery with Code-based Experimentation0
MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific ResearchCode1
Lessons from the trenches on evaluating machine-learning systems in materials science0
SciHorizon: Benchmarking AI-for-Science Readiness from Scientific Data to Large Language Models0
Representation Retrieval Learning for Heterogeneous Data Integration0
Agentic AI for Scientific Discovery: A Survey of Progress, Challenges, and Future Directions0
Accelerating Earth Science Discovery via Multi-Agent LLM Systems0
Large Language Models for Zero-shot Inference of Causal Structures in Biology0
Building Machine Learning Challenges for Anomaly Detection in Science0
Enabling AI Scientists to Recognize Innovation: A Domain-Agnostic Algorithm for Assessing Novelty0
Can Large Language Models Help Experimental Design for Causal Discovery?0
BixBench: a Comprehensive Benchmark for LLM-based Agents in Computational BiologyCode2
CS-PaperSum: A Large-Scale Dataset of AI-Generated Summaries for Scientific Papers0
Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample CreationCode1
Towards an AI co-scientist0
A Perspective on Symbolic Machine Learning in Physical Sciences0
Auto-Bench: An Automated Benchmark for Scientific Discovery in LLMs0
Protein Large Language Models: A Comprehensive SurveyCode2
InductionBench: LLMs Fail in the Simplest Complexity ClassCode1
Show:102550
← PrevPage 4 of 19Next →

No leaderboard results yet.