SOTAVerified

scientific discovery

Papers

Showing 150 of 464 papers

TitleStatusHype
Open Source Planning & Control System with Language Agents for Autonomous Scientific DiscoveryCode2
Topic Modeling and Link-Prediction for Material Property Discovery0
STRUCTSENSE: A Task-Agnostic Agentic Framework for Structured Information Extraction with Human-In-The-Loop Evaluation and BenchmarkingCode0
Distributed Cross-Channel Hierarchical Aggregation for Foundation Models0
Active Inference AI Systems for Scientific Discovery0
A Survey of AI for Materials Science: Foundation Models, LLM Agents, Datasets, and Tools0
AI Assistants to Enhance and Exploit the PETSc Knowledge Base0
From Reproduction to Replication: Evaluating Research Agents with Progressive Code MaskingCode0
AutomataGPT: Forecasting and Ruleset Inference for Two-Dimensional Cellular Automata0
LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling ResearchCode1
Graphics4Science: Computer Graphics for Scientific Impacts0
An ELIXIR scoping review on domain-specific evaluation metrics for synthetic data in life sciences0
Evolvable Conditional Diffusion0
Scientifically-Interpretable Reasoning Network (ScIReN): Uncovering the Black-Box of Nature0
Interpretable representation learning of quantum data enabled by probabilistic variational autoencoders0
ClimateChat: Designing Data and Methods for Instruction Tuning LLMs to Answer Climate Change QueriesCode1
HSG-12M: A Large-Scale Spatial Multigraph DatasetCode1
AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists0
ALINE: Joint Amortization for Bayesian Inference and Active Data AcquisitionCode0
Can Theoretical Physics Research Benefit from Language Agents?0
Unsupervised Machine Learning for Scientific Discovery: Workflow and Best PracticesCode0
Matter-of-Fact: A Benchmark for Verifying the Feasibility of Literature-Supported Claims in Materials ScienceCode0
Multi-Exit Kolmogorov-Arnold Networks: enhancing accuracy and parsimony0
A Dynamic Framework for Semantic Grouping of Common Data Elements (CDE) Using Embeddings and Clustering0
From Street Views to Urban Science: Discovering Road Safety Factors with Multimodal Large Language Models0
SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM AgentsCode1
OmniEarth-Bench: Towards Holistic Evaluation of Earth's Six Spheres and Cross-Spheres Interactions with Multimodal Observational Earth Data0
BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM ModelCode3
LLaMEA-BO: A Large Language Model Evolutionary Algorithm for Automatically Generating Bayesian Optimization AlgorithmsCode2
MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning ResearchCode1
ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows0
AI-Researcher: Autonomous Scientific InnovationCode7
BiomedSQL: Text-to-SQL for Scientific Reasoning on Biomedical Knowledge BasesCode0
MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental FeedbackCode0
Improving Chemical Understanding of LLMs via SMILES Parsing0
PiFlow: Principle-aware Scientific Discovery with Multi-Agent CollaborationCode1
MM-Agent: LLM as Agents for Real-world Mathematical Modeling ProblemCode3
Toward Reliable Biomedical Hypothesis Generation: Evaluating Truthfulness and Hallucination in Large Language ModelsCode0
From Automation to Autonomy: A Survey on Large Language Models in Scientific DiscoveryCode3
Robin: A multi-agent system for automating scientific discoveryCode0
InterFeat: An Automated Pipeline for Finding Interesting Hypotheses in Structured Biomedical DataCode0
When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific ResearchCode0
AI-Driven Automation Can Become the Foundation of Next-Era Science of Science ResearchCode2
On the definition and importance of interpretability in scientific machine learning0
Deep Symbolic Optimization: Reinforcement Learning for Symbolic Mathematics0
Benchmarking AI scientists in omics data-driven biological researchCode1
Contributions of the Petabyte Scale Sequence Search Codeathon toward efforts to scale sequence-based searches on SRA0
Symbol-based entity marker highlighting for enhanced text mining in materials science with generative AI0
Generative Discovery of Partial Differential Equations by Learning from Math Handbooks0
Soft causal learning for generalized molecule property prediction: An environment perspective0
Show:102550
← PrevPage 1 of 10Next →

No leaderboard results yet.