SOTAVerified

Benchmarking

Papers

Showing 911920 of 5548 papers

TitleStatusHype
Benchmarking MRI Reconstruction Neural Networks on Large Public DatasetsCode1
Benchmarking Data-driven Surrogate Simulators for Artificial Electromagnetic MaterialsCode1
LLM-Pilot: Characterize and Optimize Performance of your LLM Inference ServicesCode1
Enhancing Biomedical Relation Extraction with DirectionalityCode1
End-to-end Emotion-Cause Pair Extraction via Learning to LinkCode1
ConsumerBench: Benchmarking Generative AI Applications on End-User DevicesCode1
Guardians of Image Quality: Benchmarking Defenses Against Adversarial Attacks on Image Quality MetricsCode1
LOB-Bench: Benchmarking Generative AI for Finance -- an Application to Limit Order Book DataCode1
Controlgym: Large-Scale Control Environments for Benchmarking Reinforcement Learning AlgorithmsCode1
End-to-end Knowledge Retrieval with Multi-modal QueriesCode1
Show:102550
← PrevPage 92 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified