SOTAVerified

Benchmarking

Papers

Showing 33713380 of 5548 papers

TitleStatusHype
Lightweight Jet Reconstruction and Identification as an Object Detection Task0
Solving excited states for long-range interacting trapped ions with neural networks0
Top Score on the Wrong Exam: On Benchmarking in Machine Learning for Vulnerability Detection0
Benchmarking Multi-National Value Alignment for Large Language Models0
LIM: Large Interpolator Model for Dynamic Reconstruction0
Advanced Manufacturing Configuration by Sample-efficient Batch Bayesian Optimization0
Line Goes Up? Inherent Limitations of Benchmarks for Evaluating Large Language Models0
Liquid State Genetic Programming0
Livestock Monitoring with Transformer0
Benchmarking Multimodal Sentiment Analysis0
Show:102550
← PrevPage 338 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified