SOTAVerified

Benchmarking

Papers

Showing 23112320 of 5548 papers

TitleStatusHype
Graph Convolutional Networks Meet with High Dimensionality ReductionCode0
Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation dataCode0
GPT4Graph: Can Large Language Models Understand Graph Structured Data ? An Empirical Evaluation and BenchmarkingCode0
Echo State Networks with Self-Normalizing Activations on the Hyper-SphereCode0
Aggregated Attributions for Explanatory Analysis of 3D Segmentation ModelsCode0
ECBD: Evidence-Centered Benchmark Design for NLPCode0
Benchmarking LLMs' Judgments with No Gold StandardCode0
Agentic-HLS: An agentic reasoning based high-level synthesis system using large language models (AI for EDA workshop 2024)Code0
GNNMerge: Merging of GNN Models Without Accessing Training DataCode0
A Collection of Quality Diversity Optimization Problems Derived from Hyperparameter Optimization of Machine Learning ModelsCode0
Show:102550
← PrevPage 232 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified