SOTAVerified

Benchmarking

Papers

Showing 10311040 of 5548 papers

TitleStatusHype
MGTBench: Benchmarking Machine-Generated Text DetectionCode1
MEGA: Multilingual Evaluation of Generative AICode1
Revisiting Realistic Test-Time Training: Sequential Inference and Adaptation by Anchored Clustering Regularized Self-TrainingCode1
DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4Code1
CCTV-Gun: Benchmarking Handgun Detection in CCTV ImagesCode1
COVID-19 event extraction from Twitter via extractive question answering with continuous promptsCode1
TransNetR: Transformer-based Residual Network for Polyp Segmentation with Multi-Center Out-of-Distribution TestingCode1
What Can We Learn From The Selective Prediction And Uncertainty Estimation Performance Of 523 Imagenet ClassifiersCode1
Revisiting the Gumbel-Softmax in MADDPGCode1
A framework for benchmarking class-out-of-distribution detection and its application to ImageNetCode1
Show:102550
← PrevPage 104 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified