SOTAVerified

scientific discovery

Papers

Showing 51100 of 464 papers

TitleStatusHype
Scientific Hypothesis Generation and Validation: Methods, Datasets, and Future Directions0
34 Examples of LLM Applications in Materials Science and Chemistry: Towards Automation, Assistants, Agents, and Accelerated Scientific Discovery0
A Survey of Slow Thinking-based Reasoning LLMs using Reinforced Learning and Inference-time Scaling Law0
IRIS: Interactive Research Ideation System for Accelerating Scientific DiscoveryCode1
AI Idea Bench 2025: AI Research Idea Generation Benchmark0
Ascribe New Dimensions to Scientific Data Visualization with VR0
Deep literature reviews: an application of fine-tuned language models to migration research0
Causal-Copilot: An Autonomous Causal Analysis Agent0
HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis GenerationCode2
MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges?0
Scaling Laws of Graph Neural Networks for Atomistic Materials Modeling0
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree SearchCode7
The Power of the Pareto Front: Balancing Uncertain Rewards for Adaptive Experimentation in scanning probe microscopy0
Foundation Models for Environmental Science: A Survey of Emerging Frontiers0
The AI Cosmologist I: An Agentic System for Automated Data AnalysisCode1
We Need Improved Data Curation and Attribution in AI for Scientific Discovery0
How Deep Do Large Language Models Internalize Scientific Literature and Citation Practices?Code0
Advancing AI-Scientist Understanding: Making LLM Think Like a Physicist with Interpretable Reasoning0
AI-Newton: A Concept-Driven Physical Law Discovery System without Prior Physical KnowledgeCode2
Detecting Localized Density Anomalies in Multivariate Data via Coin-Flip StatisticsCode0
Towards Scientific Intelligence: A Survey of LLM-based Scientific Agents0
Interpretable Machine Learning in Physics: A Review0
A Retrieval-Augmented Knowledge Mining Method with Deep Thinking LLMs for Biomedical Research and Clinical Support0
Scaling Laws in Scientific Discovery with AI and Robot Scientists0
Confidence Adjusted Surprise Measure for Active Resourceful Trials (CA-SMART): A Data-driven Active Learning Framework for Accelerating Material Discovery under Resource Constraints0
ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition0
Iterative Hypothesis Generation for Scientific Discovery with Monte Carlo Nash Equilibrium Self-Refining Trees0
SCI-IDEA: Context-Aware Scientific Ideation Using Token and Sentence Embeddings0
Structuring Scientific Innovation: A Framework for Modeling and Discovering Impactful Knowledge Combinations0
AgentRxiv: Towards Collaborative Autonomous ResearchCode9
Offline Model-Based Optimization: Comprehensive ReviewCode1
CodeScientist: End-to-End Semi-Automated Scientific Discovery with Code-based Experimentation0
MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific ResearchCode1
Lessons from the trenches on evaluating machine-learning systems in materials science0
SciHorizon: Benchmarking AI-for-Science Readiness from Scientific Data to Large Language Models0
Representation Retrieval Learning for Heterogeneous Data Integration0
Agentic AI for Scientific Discovery: A Survey of Progress, Challenges, and Future Directions0
Accelerating Earth Science Discovery via Multi-Agent LLM Systems0
Large Language Models for Zero-shot Inference of Causal Structures in Biology0
Building Machine Learning Challenges for Anomaly Detection in Science0
Enabling AI Scientists to Recognize Innovation: A Domain-Agnostic Algorithm for Assessing Novelty0
Can Large Language Models Help Experimental Design for Causal Discovery?0
BixBench: a Comprehensive Benchmark for LLM-based Agents in Computational BiologyCode2
CS-PaperSum: A Large-Scale Dataset of AI-Generated Summaries for Scientific Papers0
Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample CreationCode1
Towards an AI co-scientist0
A Perspective on Symbolic Machine Learning in Physical Sciences0
Auto-Bench: An Automated Benchmark for Scientific Discovery in LLMs0
Protein Large Language Models: A Comprehensive SurveyCode2
InductionBench: LLMs Fail in the Simplest Complexity ClassCode1
Show:102550
← PrevPage 2 of 10Next →

No leaderboard results yet.