SOTAVerified

scientific discovery

Papers

Showing 126150 of 464 papers

TitleStatusHype
From Street Views to Urban Science: Discovering Road Safety Factors with Multimodal Large Language Models0
OmniEarth-Bench: Towards Holistic Evaluation of Earth's Six Spheres and Cross-Spheres Interactions with Multimodal Observational Earth Data0
ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows0
BiomedSQL: Text-to-SQL for Scientific Reasoning on Biomedical Knowledge BasesCode0
MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental FeedbackCode0
Improving Chemical Understanding of LLMs via SMILES Parsing0
Toward Reliable Biomedical Hypothesis Generation: Evaluating Truthfulness and Hallucination in Large Language ModelsCode0
Robin: A multi-agent system for automating scientific discovery0
InterFeat: An Automated Pipeline for Finding Interesting Hypotheses in Structured Biomedical DataCode0
When AI Co-Scientists Fail: SPOT-a Benchmark for Automated Verification of Scientific ResearchCode0
On the definition and importance of interpretability in scientific machine learning0
Deep Symbolic Optimization: Reinforcement Learning for Symbolic Mathematics0
Symbol-based entity marker highlighting for enhanced text mining in materials science with generative AI0
Generative Discovery of Partial Differential Equations by Learning from Math Handbooks0
Contributions of the Petabyte Scale Sequence Search Codeathon toward efforts to scale sequence-based searches on SRA0
Soft causal learning for generalized molecule property prediction: An environment perspective0
Scientific Hypothesis Generation and Validation: Methods, Datasets, and Future Directions0
A Survey of Slow Thinking-based Reasoning LLMs using Reinforced Learning and Inference-time Scaling Law0
34 Examples of LLM Applications in Materials Science and Chemistry: Towards Automation, Assistants, Agents, and Accelerated Scientific Discovery0
AI Idea Bench 2025: AI Research Idea Generation Benchmark0
Ascribe New Dimensions to Scientific Data Visualization with VR0
Causal-Copilot: An Autonomous Causal Analysis Agent0
Deep literature reviews: an application of fine-tuned language models to migration research0
MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges?0
Scaling Laws of Graph Neural Networks for Atomistic Materials Modeling0
Show:102550
← PrevPage 6 of 19Next →

No leaderboard results yet.