SOTAVerified

Benchmarking

Papers

Showing 11511160 of 5548 papers

TitleStatusHype
CODEBench: A Neural Architecture and Hardware Accelerator Co-Design FrameworkCode1
CodeIF: Benchmarking the Instruction-Following Capabilities of Large Language Models for Code GenerationCode1
A Comparative Attention Framework for Better Few-Shot Object Detection on Aerial ImagesCode1
DNN+NeuroSim V2.0: An End-to-End Benchmarking Framework for Compute-in-Memory Accelerators for On-chip TrainingCode1
CO-Bench: Benchmarking Language Model Agents in Algorithm Search for Combinatorial OptimizationCode1
Does your model understand genes? A benchmark of gene properties for biological and text modelsCode1
DomainLab: A modular Python package for domain generalization in deep learningCode1
A Closer Look at Mortality Risk Prediction from ElectrocardiogramsCode1
Benchmarking MRI Reconstruction Neural Networks on Large Public DatasetsCode1
COCO: The Large Scale Black-Box Optimization Benchmarking (bbob-largescale) Test SuiteCode1
Show:102550
← PrevPage 116 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified