SOTAVerified

Benchmarking

Papers

Showing 37613770 of 5548 papers

TitleStatusHype
Towards Stable 3D Object Detection0
Benchmarking Domain Generalization on EEG-based Emotion Recognition0
MultiRobustBench: Benchmarking Robustness Against Multiple Attacks0
MultiSocial: Multilingual Benchmark of Machine-Generated Text Detection of Social-Media Texts0
AT-Drone: Benchmarking Adaptive Teaming in Multi-Drone Pursuit0
MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing0
Benchmarking Diverse-Modal Entity Linking with Generative Models0
Benchmarking Discrete Optimization Heuristics with IOHprofiler0
Non-linear Multitask Learning with Deep Gaussian Processes0
Benchmarking Differential Evolution on a Quantum Simulator0
Show:102550
← PrevPage 377 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified