SOTAVerified

Benchmarking

Papers

Showing 861870 of 5548 papers

TitleStatusHype
Introducing Milabench: Benchmarking Accelerators for AICode1
Introducing the VoicePrivacy InitiativeCode1
GEOM-Drugs Revisited: Toward More Chemically Accurate Benchmarks for 3D Molecule GenerationCode1
CheX-GPT: Harnessing Large Language Models for Enhanced Chest X-ray Report LabelingCode1
An Evaluation Dataset for Intent Classification and Out-of-Scope PredictionCode1
Benchmarking Batch Deep Reinforcement Learning AlgorithmsCode1
Benchmarking Multimodal Knowledge Conflict for Large Multimodal ModelsCode1
EH-DNAS: End-to-End Hardware-aware Differentiable Neural Architecture SearchCode1
Emoji Prediction: Extensions and BenchmarkingCode1
Benchmarking Low-Shot Robustness to Natural Distribution ShiftsCode1
Show:102550
← PrevPage 87 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified