SOTAVerified

Benchmarking

Papers

Showing 20512060 of 5548 papers

TitleStatusHype
Assessing Foundation Models for Sea Ice Type Segmentation in Sentinel-1 SAR Imagery0
ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition0
Benchmarking Deep Learning-Based Methods for Irradiance Nowcasting with Sky Images0
CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?Code0
GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics0
Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance0
CSPO: Cross-Market Synergistic Stock Price Movement Forecasting with Pseudo-volatility Optimization0
Benchmarking and optimizing organism wide single-cell RNA alignment methodsCode0
Can geometric combinatorics improve RNA branching predictions?Code0
RxRx3-core: Benchmarking drug-target interactions in High-Content Microscopy0
Show:102550
← PrevPage 206 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified