SOTAVerified

scientific discovery

Papers

Showing 150 of 464 papers

TitleStatusHype
KAN 2.0: Kolmogorov-Arnold Networks Meet ScienceCode11
The AI Scientist: Towards Fully Automated Open-Ended Scientific DiscoveryCode11
AgentRxiv: Towards Collaborative Autonomous ResearchCode9
Agent Laboratory: Using LLM Agents as Research AssistantsCode9
AI-Researcher: Autonomous Scientific InnovationCode7
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree SearchCode7
O1 Replication Journey: A Strategic Progress Report -- Part 1Code7
SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoningCode5
LLM4AD: A Platform for Algorithm Design with Large Language ModelCode4
Improving Parallel Program Performance with LLM Optimizers via Agent-System InterfacesCode4
On the limits of agency in agent-based modelsCode4
A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific DiscoveryCode4
Autonomous LLM-driven research from data to human-verifiable research papersCode4
BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM ModelCode3
MM-Agent: LLM as Agents for Real-world Mathematical Modeling ProblemCode3
From Automation to Autonomy: A Survey on Large Language Models in Scientific DiscoveryCode3
Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge NetworksCode3
Safety at Scale: A Comprehensive Survey of Large Model SafetyCode3
In-situ graph reasoning and knowledge expansion using Graph-PReFLexORCode3
Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP ResearchersCode3
Recent Advances on Machine Learning for Computational Fluid Dynamics: A SurveyCode3
A Review of Large Language Models and Autonomous Agents in ChemistryCode3
DISCOVERYWORLD: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery AgentsCode3
Accelerating Scientific Discovery with Generative Knowledge Extraction, Graph-Based Representation, and Multimodal Intelligent Graph ReasoningCode3
Scientific Large Language Models: A Survey on Biological & Chemical DomainsCode3
Open Source Planning & Control System with Language Agents for Autonomous Scientific DiscoveryCode2
LLaMEA-BO: A Large Language Model Evolutionary Algorithm for Automatically Generating Bayesian Optimization AlgorithmsCode2
AI-Driven Automation Can Become the Foundation of Next-Era Science of Science ResearchCode2
HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis GenerationCode2
AI-Newton: A Concept-Driven Physical Law Discovery System without Prior Physical KnowledgeCode2
BixBench: a Comprehensive Benchmark for LLM-based Agents in Computational BiologyCode2
Protein Large Language Models: A Comprehensive SurveyCode2
DiffMS: Diffusion Generation of Molecules Conditioned on Mass SpectraCode2
From Generalist to Specialist: A Survey of Large Language Models for ChemistryCode2
Many Heads Are Better Than One: Improved Scientific Idea Generation by A LLM-Based Multi-Agent SystemCode2
MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific HypothesesCode2
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific DiscoveryCode2
SciLitLLM: How to Adapt LLMs for Scientific Literature UnderstandingCode2
OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AICode2
Flow of Reasoning:Training LLMs for Divergent Problem Solving with Minimal ExamplesCode2
BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation ExperimentsCode2
LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific DiscoveryCode2
Active Learning with Fully Bayesian Neural Networks for Discontinuous and Nonstationary DataCode2
SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language ModelsCode2
Multi-Fidelity Active Learning with GFlowNetsCode2
Ten Quick Tips for Harnessing the Power of ChatGPT/GPT-4 in Computational BiologyCode2
Accelerating Material Design with the Generative Toolkit for Scientific DiscoveryCode2
LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling ResearchCode1
ClimateChat: Designing Data and Methods for Instruction Tuning LLMs to Answer Climate Change QueriesCode1
HSG-12M: A Large-Scale Spatial Multigraph DatasetCode1
Show:102550
← PrevPage 1 of 10Next →

No leaderboard results yet.