SOTAVerified

Benchmarking

Papers

Showing 12711280 of 5548 papers

TitleStatusHype
How to Benchmark Vision Foundation Models for Semantic Segmentation?Code1
A framework for benchmarking clustering algorithmsCode1
"How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken ConversationsCode1
MELTing point: Mobile Evaluation of Language TransformersCode1
How to Train Neural Field Representations: A Comprehensive Study and BenchmarkCode1
MetaBox: A Benchmark Platform for Meta-Black-Box Optimization with Reinforcement LearningCode1
Benchmarking Recommendation, Classification, and Tracing Based on Hugging Face Knowledge GraphCode1
MetaFormer and CNN Hybrid Model for Polyp Image SegmentationCode1
Benchmarking structure-based three-dimensional molecular generative models using GenBench3D: ligand conformation quality mattersCode1
Arctique: An artificial histopathological dataset unifying realism and controllability for uncertainty quantificationCode1
Show:102550
← PrevPage 128 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified