SOTAVerified

Benchmarking

Papers

Showing 27612770 of 5548 papers

TitleStatusHype
AI Idea Bench 2025: AI Research Idea Generation Benchmark0
Exploring the Adversarial Frontier: Quantifying Robustness via Adversarial Hypervolume0
ImageNet performance correlates with pose estimation robustness and generalization on out-of-domain data0
Improved YOLOv12 with LLM-Generated Synthetic Data for Enhanced Apple Detection and Benchmarking Against YOLOv11 and YOLOv100
A Survey of Model Compression and Acceleration for Deep Neural Networks0
Geometric feature performance under downsampling for EEG classification tasks0
Benchmarking Poisoning Attacks against Retrieval-Augmented Generation0
Geometry Matters: Benchmarking Scientific ML Approaches for Flow Prediction around Complex Geometries0
Image2Struct: Benchmarking Structure Extraction for Vision-Language Models0
Exploring Continual Learning of Diffusion Models0
Show:102550
← PrevPage 277 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified