SOTAVerified

Benchmarking

Papers

Showing 33613370 of 5548 papers

TitleStatusHype
ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit0
RoboPianist: Dexterous Piano Playing with Deep Reinforcement LearningCode2
ForamViT-GAN: Exploring New Paradigms in Deep Learning for Micropaleontological Image Analysis0
Benchmarking the Robustness of Quantized Models0
SimbaML: Connecting Mechanistic Models and Machine Learning with Augmented DataCode0
Probing Conceptual Understanding of Large Visual-Language ModelsCode0
Interpretable statistical representations of neural population dynamics and geometryCode1
Benchmarking Robustness to Text-Guided CorruptionsCode0
DRAC: Diabetic Retinopathy Analysis Challenge with Ultra-Wide Optical Coherence Tomography Angiography Images0
MMVC: Learned Multi-Mode Video Compression with Block-based Prediction Mode Selection and Density-Adaptive Entropy CodingCode1
Show:102550
← PrevPage 337 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified