SOTAVerified

Benchmarking

Papers

Showing 17511760 of 5548 papers

TitleStatusHype
Analyzing the Feature Extractor Networks for Face Image SynthesisCode0
Inverse Contextual Bandits: Learning How Behavior Evolves over TimeCode0
Benchmarking Multi-dimensional AIGC Video Quality Assessment: A Dataset and Unified ModelCode0
Integration of nested cross-validation, automated hyperparameter optimization, high-performance computing to reduce and quantify the variance of test performance estimation of deep learning modelsCode0
Integrating Expert Knowledge into Logical Programs via LLMsCode0
Calibrating Pre-trained Language Classifiers on LLM-generated Noisy Labels via Iterative RefinementCode0
COCO: A Platform for Comparing Continuous Optimizers in a Black-Box SettingCode0
COCO: Performance AssessmentCode0
STEP: A Unified Spiking Transformer Evaluation Platform for Fair and Reproducible BenchmarkingCode0
Calibrated Adaptive Probabilistic ODE SolversCode0
Show:102550
← PrevPage 176 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified