SOTAVerified

scientific discovery

Papers

Showing 150 of 464 papers

TitleStatusHype
KAN 2.0: Kolmogorov-Arnold Networks Meet ScienceCode11
The AI Scientist: Towards Fully Automated Open-Ended Scientific DiscoveryCode11
Agent Laboratory: Using LLM Agents as Research AssistantsCode9
AgentRxiv: Towards Collaborative Autonomous ResearchCode9
AI-Researcher: Autonomous Scientific InnovationCode7
O1 Replication Journey: A Strategic Progress Report -- Part 1Code7
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree SearchCode7
SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoningCode5
Improving Parallel Program Performance with LLM Optimizers via Agent-System InterfacesCode4
LLM4AD: A Platform for Algorithm Design with Large Language ModelCode4
A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific DiscoveryCode4
Autonomous LLM-driven research from data to human-verifiable research papersCode4
On the limits of agency in agent-based modelsCode4
In-situ graph reasoning and knowledge expansion using Graph-PReFLexORCode3
Scientific Large Language Models: A Survey on Biological & Chemical DomainsCode3
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP ResearchersCode3
Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge NetworksCode3
A Review of Large Language Models and Autonomous Agents in ChemistryCode3
MM-Agent: LLM as Agents for Real-world Mathematical Modeling ProblemCode3
Recent Advances on Machine Learning for Computational Fluid Dynamics: A SurveyCode3
Safety at Scale: A Comprehensive Survey of Large Model SafetyCode3
Accelerating Scientific Discovery with Generative Knowledge Extraction, Graph-Based Representation, and Multimodal Intelligent Graph ReasoningCode3
DISCOVERYWORLD: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery AgentsCode3
BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM ModelCode3
From Automation to Autonomy: A Survey on Large Language Models in Scientific DiscoveryCode3
Many Heads Are Better Than One: Improved Scientific Idea Generation by A LLM-Based Multi-Agent SystemCode2
HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis GenerationCode2
Accelerating Material Design with the Generative Toolkit for Scientific DiscoveryCode2
SciLitLLM: How to Adapt LLMs for Scientific Literature UnderstandingCode2
SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language ModelsCode2
Ten Quick Tips for Harnessing the Power of ChatGPT/GPT-4 in Computational BiologyCode2
BixBench: a Comprehensive Benchmark for LLM-based Agents in Computational BiologyCode2
Active Learning with Fully Bayesian Neural Networks for Discontinuous and Nonstationary DataCode2
AI-Newton: A Concept-Driven Physical Law Discovery System without Prior Physical KnowledgeCode2
Protein Large Language Models: A Comprehensive SurveyCode2
Flow of Reasoning:Training LLMs for Divergent Problem Solving with Minimal ExamplesCode2
BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation ExperimentsCode2
AI-Driven Automation Can Become the Foundation of Next-Era Science of Science ResearchCode2
From Generalist to Specialist: A Survey of Large Language Models for ChemistryCode2
OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AICode2
Open Source Planning & Control System with Language Agents for Autonomous Scientific DiscoveryCode2
DiffMS: Diffusion Generation of Molecules Conditioned on Mass SpectraCode2
MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific HypothesesCode2
LLaMEA-BO: A Large Language Model Evolutionary Algorithm for Automatically Generating Bayesian Optimization AlgorithmsCode2
LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific DiscoveryCode2
Multi-Fidelity Active Learning with GFlowNetsCode2
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific DiscoveryCode2
Constructing Custom Thermodynamics Using Deep LearningCode1
ClimateChat: Designing Data and Methods for Instruction Tuning LLMs to Answer Climate Change QueriesCode1
InductionBench: LLMs Fail in the Simplest Complexity ClassCode1
Show:102550
← PrevPage 1 of 10Next →

No leaderboard results yet.