SOTAVerified

Benchmarking

Papers

Showing 791800 of 5548 papers

TitleStatusHype
Benchmarking Large Multimodal Models against Common CorruptionsCode1
Benchmarking Adversarial Patch Against Aerial DetectionCode1
dMelodies: A Music Dataset for Disentanglement LearningCode1
GeoBenchX: Benchmarking LLMs for Multistep Geospatial TasksCode1
Beyond Correctness: Benchmarking Multi-dimensional Code Generation for Large Language ModelsCode1
Benchmarking Adversarial Robustness on Image ClassificationCode1
Benchmarking of DL Libraries and Models on Mobile DevicesCode1
GLGENN: A Novel Parameter-Light Equivariant Neural Networks Architecture Based on Clifford Geometric AlgebrasCode1
DNN+NeuroSim V2.0: An End-to-End Benchmarking Framework for Compute-in-Memory Accelerators for On-chip TrainingCode1
Does your model understand genes? A benchmark of gene properties for biological and text modelsCode1
Show:102550
← PrevPage 80 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified