SOTAVerified

Benchmarking

Papers

Showing 18911900 of 5548 papers

TitleStatusHype
HERMES: Holographic Equivariant neuRal network model for Mutational Effect and Stability predictionCode0
CodeUpdateArena: Benchmarking Knowledge Editing on API UpdatesCode1
Simulation-based Benchmarking for Causal Structure Learning in Gene Perturbation ExperimentsCode0
OpenCIL: Benchmarking Out-of-Distribution Detection in Class-Incremental LearningCode1
GTP-4o: Modality-prompted Heterogeneous Graph Learning for Omni-modal Biomedical Representation0
TARGO: Benchmarking Target-driven Object Grasping under Occlusions0
A Benchmark for Multi-speaker Anonymization0
MERGE -- A Bimodal Audio-Lyrics Dataset for Static Music Emotion Recognition0
Replication in Visual Diffusion Models: A Survey and OutlookCode1
Rethinking the Effectiveness of Graph Classification Datasets in Benchmarks for Assessing GNNsCode0
Show:102550
← PrevPage 190 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified