SOTAVerified

Benchmarking

Papers

Showing 11911200 of 5548 papers

TitleStatusHype
AI Accelerator Survey and TrendsCode1
ISLES 2022: A multi-center magnetic resonance imaging stroke lesion segmentation datasetCode1
Benchmarking Neural Network Generalization for Grammar InductionCode1
Benchmarking Neural Network Robustness to Common Corruptions and Surface VariationsCode1
Benchmarking Segmentation Models with Mask-Preserved Attribute EditingCode1
Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation and BeyondCode1
Benchmarking Large Language Models for Persian: A Preliminary Study Focusing on ChatGPTCode1
GNNX-BENCH: Unravelling the Utility of Perturbation-based GNN Explainers through In-depth BenchmarkingCode1
GoMatching++: Parameter- and Data-Efficient Arbitrary-Shaped Video Text Spotting and BenchmarkingCode1
GraCoRe: Benchmarking Graph Comprehension and Complex Reasoning in Large Language ModelsCode1
Show:102550
← PrevPage 120 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified