SOTAVerified

Benchmarking

Papers

Showing 28912900 of 5548 papers

TitleStatusHype
The EuroCity Persons Dataset: A Novel Benchmark for Object Detection0
The Evolutionary Computation Methods No One Should Use0
HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects0
Benchmarking Vision-Based Object Tracking for USVs in Complex Maritime Environments0
Hints-In-Browser: Benchmarking Language Models for Programming Feedback Generation0
Benchmarking Video Frame Interpolation0
SnCQA: A hardware-efficient equivariant quantum convolutional circuit architecture0
HLB: Benchmarking LLMs' Humanlikeness in Language Use0
Benchmarking Unsupervised Outlier Detection with Realistic Synthetic Data0
The Expressive Power of Word Embeddings0
Show:102550
← PrevPage 290 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified