SOTAVerified

Benchmarking

Papers

Showing 23512360 of 5548 papers

TitleStatusHype
Benchmarking Counterfactual Image GenerationCode1
Benchmarking the Robustness of Temporal Action Detection Models Against Temporal CorruptionsCode1
IndiBias: A Benchmark Dataset to Measure Social Biases in Language Models for Indian ContextCode0
Are Large Language Models Good at Utility Judgments?Code0
Benchmarking Implicit Neural Representation and Geometric Rendering in Real-Time RGB-D SLAMCode1
ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic ObjectCode1
Towards Image Ambient Lighting NormalizationCode1
Benchmarking Object Detectors with COCO: A New Path ForwardCode1
RankMamba: Benchmarking Mamba's Document Ranking Performance in the Era of TransformersCode1
Benchmarking Image Transformers for Prostate Cancer Detection from Ultrasound Data0
Show:102550
← PrevPage 236 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified