SOTAVerified

Benchmarking

Papers

Showing 33713380 of 5548 papers

TitleStatusHype
Vi(E)va LLM! A Conceptual Stack for Evaluating and Interpreting Generative AI-based VisualizationsCode0
Probing Critical Learning Dynamics of PLMs for Hate Speech DetectionCode0
Can LLMs perform structured graph reasoning?Code0
Variational Quantum Circuits Enhanced Generative Adversarial Network0
Benchmarking Spiking Neural Network Learning Methods with Varying Locality0
Coherent Feed Forward Quantum Neural Network0
MRAnnotator: multi-Anatomy and many-Sequence MRI segmentation of 44 structures0
Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation dataCode0
Benchmarking Sensitivity of Continual Graph Learning for Skeleton-Based Action Recognition0
ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling TasksCode0
Show:102550
← PrevPage 338 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified