SOTAVerified

Benchmarking

Papers

Showing 38113820 of 5548 papers

TitleStatusHype
Knowing-how & Knowing-that: A New Task for Machine Comprehension of User ManualsCode0
Improved statistical benchmarking of digital pathology models using pairwise frames evaluation0
Benchmarking Robustness of AI-Enabled Multi-sensor Fusion Systems: Challenges and Opportunities0
Applying Standards to Advance Upstream & Downstream Ethics in Large Language Models0
Explainable AI using expressive Boolean formulas0
Financial Numeric Extreme Labelling: A Dataset and Benchmarking for XBRL Tagging0
Benchmarking Middle-Trained Language Models for Neural Search0
N-Shot Benchmarking of Whisper on Diverse Arabic Speech Recognition0
MoviePuzzle: Visual Narrative Reasoning through Multimodal Order Learning0
EfficientSRFace: An Efficient Network with Super-Resolution Enhancement for Accurate Face Detection0
Show:102550
← PrevPage 382 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified