SOTAVerified

Benchmarking

Papers

Showing 17261750 of 5548 papers

TitleStatusHype
Disability prediction in multiple sclerosis using performance outcome measures and demographic data0
Discriminative Link Prediction using Local Links, Node Features and Community Structure0
CLAMS: A Cluster Ambiguity Measure for Estimating Perceptual Variability in Visual Clustering0
Benchmarking a wide range of optimisers for solving the Fermi-Hubbard model using the variational quantum eigensolver0
Classification and Retrieval of Digital Pathology Scans: A New Dataset0
A biologically-inspired multi-modal evaluation of molecular generative machine learning0
Classifying neuromorphic data using a deep learning framework for image classification0
DIF: A Framework for Benchmarking and Verifying Implicit Bias in LLMs0
Benchmarking Automatic Speech Recognition coupled LLM Modules for Medical Diagnostics0
DI-BENCH: Benchmarking Large Language Models on Dependency Inference with Testable Repositories at Scale0
Diff5T: Benchmarking Human Brain Diffusion MRI with an Extensive 5.0 Tesla K-Space and Spatial Dataset0
CityLearn v2: Energy-flexible, resilient, occupant-centric, and carbon-aware management of grid-interactive communities0
Benchmarking Bayesian Deep Learning on Diabetic Retinopathy Detection Tasks0
Addressing the Real-world Class Imbalance Problem in Dermatology0
CISOL: An Open and Extensible Dataset for Table Structure Recognition in the Construction Industry0
Benchmarking Automated Review Response Generation for the Hospitality Domain0
Benchmarking bias: Expanding clinical AI model card to incorporate bias reporting of social and non-social factors0
Dialogue Games for Benchmarking Language Understanding: Motivation, Taxonomy, Strategy0
CLIRudit: Cross-Lingual Information Retrieval of Scientific Documents0
DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior0
CLLMate: A Multimodal Benchmark for Weather and Climate Events Forecasting0
Benchmarking Automated Machine Learning Methods for Price Forecasting Applications0
CIMLA: Interpretable AI for inference of differential causal networks0
CloudifierNet -- Deep Vision Models for Artificial Image Processing0
CIFAR-10-Warehouse: Broad and More Realistic Testbeds in Model Generalization Analysis0
Show:102550
← PrevPage 70 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified