SOTAVerified

Benchmarking

Papers

Showing 41514160 of 5548 papers

TitleStatusHype
The Unconstrained Ear Recognition Challenge0
The Unconstrained Ear Recognition Challenge 2019 - ArXiv Version With Appendix0
THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models0
TIIF-Bench: How Does Your T2I Model Follow Your Instructions?0
Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection0
Time Awareness in Large Language Models: Benchmarking Fact Recall Across Time0
Time Sensitive Knowledge Editing through Efficient Finetuning0
TIME: Temporal-sensitive Multi-dimensional Instruction Tuning and Benchmarking for Video-LLMs0
Time to Embrace Natural Language Processing (NLP)-based Digital Pathology: Benchmarking NLP- and Convolutional Neural Network-based Deep Learning Pipelines0
Timing Excess Returns A cross-universe approach to alpha0
Show:102550
← PrevPage 416 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified