SOTAVerified

Benchmarking

Papers

Showing 23212330 of 5548 papers

TitleStatusHype
A Collection of Quality Diversity Optimization Problems Derived from Hyperparameter Optimization of Machine Learning ModelsCode0
Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation dataCode0
An Evaluation of Machine Learning Approaches for Early Diagnosis of Autism Spectrum DisorderCode0
A Review of Testing Object-Based Environment Perception for Safe Automated DrivingCode0
Benchmarking Machine Translation with Cultural AwarenessCode0
EmProx: Neural Network Performance Estimation For Neural Architecture SearchCode0
GPT4Graph: Can Large Language Models Understand Graph Structured Data ? An Empirical Evaluation and BenchmarkingCode0
Dyport: Dynamic Importance-based Hypothesis Generation Benchmarking TechniqueCode0
DynCIM: Dynamic Curriculum for Imbalanced Multimodal LearningCode0
GOAL: Towards Benchmarking Few-Shot Sports Game SummarizationCode0
Show:102550
← PrevPage 233 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified