SOTAVerified

Benchmarking

Papers

Showing 34413450 of 5548 papers

TitleStatusHype
Revisiting the Gumbel-Softmax in MADDPGCode1
A framework for benchmarking class-out-of-distribution detection and its application to ImageNetCode1
Dermatological Diagnosis Explainability Benchmark for Convolutional Neural NetworksCode0
MultiRobustBench: Benchmarking Robustness Against Multiple Attacks0
An Efficient Two-stage Gradient Boosting Framework for Short-term Traffic State EstimationCode0
Time to Embrace Natural Language Processing (NLP)-based Digital Pathology: Benchmarking NLP- and Convolutional Neural Network-based Deep Learning Pipelines0
Determinants of Performance in European ATM -- How to Analyze a Diverse Industry0
Arena-Rosnav 2.0: A Development and Benchmarking Platform for Robot Navigation in Highly Dynamic EnvironmentsCode0
Fuzzy Knowledge Distillation from High-Order TSK to Low-Order TSK0
Towards Fair Machine Learning Software: Understanding and Addressing Model Bias Through Counterfactual Thinking0
Show:102550
← PrevPage 345 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified