SOTAVerified

Benchmarking

Papers

Showing 511520 of 5548 papers

TitleStatusHype
Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and OutlookCode2
MINERVA: Evaluating Complex Video ReasoningCode2
GEOM-Drugs Revisited: Toward More Chemically Accurate Benchmarks for 3D Molecule GenerationCode1
Towards Robust and Generalizable Gerchberg Saxton based Physics Inspired Neural Networks for Computer Generated Holography: A Sensitivity Analysis Framework0
From Precision to Perception: User-Centred Evaluation of Keyword Extraction Algorithms for Internet-Scale Contextual Advertising0
Sadeed: Advancing Arabic Diacritization Through Small Language Model0
Galvatron: An Automatic Distributed System for Efficient Foundation Model Training0
Evaluating Generative Models for Tabular Data: Novel Metrics and Benchmarking0
OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System VerificationCode1
The Leaderboard Illusion0
Show:102550
← PrevPage 52 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified