SOTAVerified

Benchmarking

Papers

Showing 41014125 of 5548 papers

TitleStatusHype
Benchmarking Safe Deep Reinforcement Learning in Aquatic Navigation0
High-Dimensional Inference in Bayesian NetworksCode1
Logically at Factify 2022: Multimodal Fact Verification0
A Modular Workflow for Performance Benchmarking of Neuronal Network SimulationsCode0
On the Use of Quality Diversity Algorithms for The Traveling Thief Problem0
Boosting Neural Image Compression for Machines Using Latent Space MaskingCode1
On the Value of ML Models0
GUNNEL: Guided Mixup Augmentation and Multi-View Fusion for Aquatic Animal SegmentationCode0
Benchmarking human visual search computational models in natural scenes: models comparison and reference datasetsCode1
Learning Representations with Contrastive Self-Supervised Learning for Histopathology ApplicationsCode1
Label, Verify, Correct: A Simple Few Shot Object Detection MethodCode1
7th AI Driving Olympics: 1st Place Report for Panoptic Tracking0
GreenPCO: An Unsupervised Lightweight Point Cloud Odometry Method0
Object Shape Error Response Using Bayesian 3-D Convolutional Neural Networks for Assembly Systems With Compliant PartsCode1
HyFactor: Hydrogen-count labelled graph-based defactorization AutoencoderCode1
Neuro-Symbolic Inductive Logic Programming with Logical Neural NetworksCode1
BenchML: an extensible pipelining framework for benchmarking representations of materials and molecules at scaleCode1
Reduced, Reused and Recycled: The Life of a Dataset in Machine Learning Research0
TISE: Bag of Metrics for Text-to-Image Synthesis EvaluationCode1
CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of CancerCode1
NEORL: NeuroEvolution Optimization with Reinforcement LearningCode1
Certified Adversarial Defenses Meet Out-of-Distribution Corruptions: Benchmarking Robustness and Simple Baselines0
MC-Blur: A Comprehensive Benchmark for Image DeblurringCode1
Neural Regression, Representational Similarity, Model Zoology & Neural Taskonomy at Scale in Rodent Visual CortexCode1
TinyML Platforms Benchmarking0
Show:102550
← PrevPage 165 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified