SOTAVerified

Benchmarking

Papers

Showing 29112920 of 5548 papers

TitleStatusHype
Quality Assured: Rethinking Annotation Strategies in Imaging AI0
Building a Domain-specific Guardrail Model in Production0
Flexible Generation of Preference Data for Recommendation AnalysisCode0
Can time series forecasting be automated? A benchmark and analysis0
Aggregated Attributions for Explanatory Analysis of 3D Segmentation ModelsCode0
Hi-EF: Benchmarking Emotion Forecasting in Human-interactionCode0
BONES: a Benchmark fOr Neural Estimation of Shapley valuesCode0
StylusAI: Stylistic Adaptation for Robust German Handwritten Text Generation0
Customized Retrieval Augmented Generation and Benchmarking for EDA Tool Documentation QACode0
Benchmarks as Microscopes: A Call for Model Metrology0
Show:102550
← PrevPage 292 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified