SOTAVerified

Benchmarking

Papers

Showing 38913900 of 5548 papers

TitleStatusHype
ScanNeRF: a Scalable Benchmark for Neural Radiance Fields0
SCBench: A Sports Commentary Benchmark for Video LLMs0
Scenarios and Approaches for Situated Natural Language Explanations0
ScholarSearch: Benchmarking Scholar Searching Ability of LLMs0
SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement0
Science Across Languages: Assessing LLM Multilingual Translation of Scientific Papers0
Scientific Machine Learning Benchmarks0
SciHorizon: Benchmarking AI-for-Science Readiness from Scientific Data to Large Language Models0
scMamba: A Scalable Foundation Model for Single-Cell Multi-Omics Integration Beyond Highly Variable Feature Selection0
Score-Based Generative Models for Molecule Generation0
Show:102550
← PrevPage 390 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified