SOTAVerified

Benchmarking

Papers

Showing 12011225 of 5548 papers

TitleStatusHype
When Do Flat Minima Optimizers Work?Code1
Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical StudyCode1
Are we really making much progress? Revisiting, benchmarking, and refining heterogeneous graph neural networksCode1
Leveraging Trust for Joint Multi-Objective and Multi-Fidelity OptimizationCode1
Autonomous Reinforcement Learning: Formalism and BenchmarkingCode1
High-Dimensional Inference in Bayesian NetworksCode1
Boosting Neural Image Compression for Machines Using Latent Space MaskingCode1
Label, Verify, Correct: A Simple Few Shot Object Detection MethodCode1
Learning Representations with Contrastive Self-Supervised Learning for Histopathology ApplicationsCode1
Benchmarking human visual search computational models in natural scenes: models comparison and reference datasetsCode1
Object Shape Error Response Using Bayesian 3-D Convolutional Neural Networks for Assembly Systems With Compliant PartsCode1
HyFactor: Hydrogen-count labelled graph-based defactorization AutoencoderCode1
Neuro-Symbolic Inductive Logic Programming with Logical Neural NetworksCode1
BenchML: an extensible pipelining framework for benchmarking representations of materials and molecules at scaleCode1
CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of CancerCode1
TISE: Bag of Metrics for Text-to-Image Synthesis EvaluationCode1
MC-Blur: A Comprehensive Benchmark for Image DeblurringCode1
Neural Regression, Representational Similarity, Model Zoology & Neural Taskonomy at Scale in Rodent Visual CortexCode1
NEORL: NeuroEvolution Optimization with Reinforcement LearningCode1
ClimART: A Benchmark Dataset for Emulating Atmospheric Radiative Transfer in Weather and Climate ModelsCode1
Benchmarking Accuracy and Generalizability of Four Graph Neural Networks Using Large In Vitro ADME Datasets from Different Chemical SpacesCode1
EH-DNAS: End-to-End Hardware-aware Differentiable Neural Architecture SearchCode1
FedCV: A Federated Learning Framework for Diverse Computer Vision TasksCode1
Benchmarking Detection Transfer Learning with Vision TransformersCode1
Benchmarking emergency department triage prediction models with machine learning and large public electronic health recordsCode1
Show:102550
← PrevPage 49 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified