SOTAVerified

Benchmarking

Papers

Showing 27412750 of 5548 papers

TitleStatusHype
Benchmarking Domain Generalization Algorithms in Computational PathologyCode0
Benchmarking Deep Learning Models for Object Detection on Edge Computing Devices0
Proof of Thought : Neurosymbolic Program Synthesis allows Robust and Interpretable Reasoning0
Omnibenchmark (alpha) for continuous and open benchmarking in bioinformatics0
SEN12-WATER: A New Dataset for Hydrological Applications and its Benchmarking0
Controlling Risk of Retrieval-augmented Generation: A Counterfactual Prompting FrameworkCode0
HLB: Benchmarking LLMs' Humanlikeness in Language Use0
Benchmarking Robustness of Endoscopic Depth Estimation with Synthetically Corrupted DataCode0
Qualitative Insights Tool (QualIT): LLM Enhanced Topic Modeling0
Ducho meets Elliot: Large-scale Benchmarks for Multimodal RecommendationCode0
Show:102550
← PrevPage 275 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified