SOTAVerified

Benchmarking

Papers

Showing 39213930 of 5548 papers

TitleStatusHype
On Continual Model Refinement in Out-of-Distribution Data Streams0
Active Learning for Community Detection in Stochastic Block Models0
On-Device Self-Supervised Learning of Low-Latency Monocular Depth from Only Events0
Benchmarking Audio Visual Segmentation for Long-Untrimmed Videos0
On Distribution Grid Optimal Power Flow Development and Integration0
ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities0
One Label, One Billion Faces: Usage and Consistency of Racial Categories in Computer Vision0
Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese0
One of these (Few) Things is Not Like the Others0
Benchmarking Audio Deepfake Detection Robustness in Real-world Communication Scenarios0
Show:102550
← PrevPage 393 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified