SOTAVerified

Benchmarking

Papers

Showing 32213230 of 5548 papers

TitleStatusHype
TIIF-Bench: How Does Your T2I Model Follow Your Instructions?0
Knowledge Sharing in Manufacturing using Large Language Models: User Evaluation and Model Benchmarking0
3D Compositional Zero-shot Learning with DeCompositional Consensus0
Benchmarking Performance of Deep Learning Model for Material Segmentation on Two HPC Systems0
Know Thy Judge: On the Robustness Meta-Evaluation of LLM Safety Judges0
Benchmarking Pedestrian Odometry: The Brown Pedestrian Odometry Dataset (BPOD)0
Benchmarking PathCLIP for Pathology Image Analysis0
Kolmogorov-Arnold Network for Transistor Compact Modeling0
Koopman Theory-Inspired Method for Learning Time Advancement Operators in Unstable Flame Front Evolution0
Benchmarking Out-of-Distribution Generalization Capabilities of DNN-based Encoding Models for the Ventral Visual Cortex0
Show:102550
← PrevPage 323 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified