SOTAVerified

Benchmarking

Papers

Showing 14011425 of 5548 papers

TitleStatusHype
PT-Ranking: A Benchmarking Platform for Neural Learning-to-RankCode1
NATS-Bench: Benchmarking NAS Algorithms for Architecture Topology and SizeCode1
Image Colorization: A Survey and DatasetCode1
ScrewNet: Category-Independent Articulation Model Estimation From Depth Images Using Screw TheoryCode1
Quantitative Survey of the State of the Art in Sign Language RecognitionCode1
Automatic sleep stage classification with deep residual networks in a mixed-cohort settingCode1
ISSAFE: Improving Semantic Segmentation in Accidents by Fusing Event-based DataCode1
AIPerf: Automated machine learning as an AI-HPC benchmarkCode1
dMelodies: A Music Dataset for Disentanglement LearningCode1
WordCraft: An Environment for Benchmarking Commonsense AgentsCode1
Are We There Yet? Evaluating State-of-the-Art Neural Network based Geoparsers Using EUPEG as a Benchmarking PlatformCode1
Emoji Prediction: Extensions and BenchmarkingCode1
CheXphoto: 10,000+ Photos and Transformations of Chest X-rays for Benchmarking Deep Learning RobustnessCode1
Enhancing spatial and textual analysis with EUPEG: an extensible and unified platform for evaluating geoparsersCode1
GAMA: a General Automated Machine learning AssistantCode1
IOHanalyzer: Detailed Performance Analyses for Iterative Optimization HeuristicsCode1
RobFR: Benchmarking Adversarial Robustness on Face RecognitionCode1
URSABench: Comprehensive Benchmarking of Approximate Bayesian Inference Methods for Deep Neural NetworksCode1
Re-thinking Co-Salient Object DetectionCode1
Wiki-CS: A Wikipedia-Based Benchmark for Graph Neural NetworksCode1
Quo Vadis, Skeleton Action Recognition ?Code1
Descending through a Crowded Valley - Benchmarking Deep Learning OptimizersCode1
Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via MetagradientCode1
EndoSLAM Dataset and An Unsupervised Monocular Visual Odometry and Depth Estimation Approach for Endoscopic Videos: Endo-SfMLearnerCode1
Labelling unlabelled videos from scratch with multi-modal self-supervisionCode1
Show:102550
← PrevPage 57 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified