SOTAVerified

Benchmarking

Papers

Showing 34263450 of 5548 papers

TitleStatusHype
FluidLab: A Differentiable Environment for Benchmarking Complex Fluid ManipulationCode2
Benchmarking framework for machine learning classification from fNIRS dataCode0
Benchmarking White Blood Cell Classification Under Domain ShiftCode0
Data-Efficient Training of CNNs and Transformers with Coresets: A Stability PerspectiveCode0
POPGym: Benchmarking Partially Observable Reinforcement LearningCode2
Structure-Based Experimental Datasets for Benchmarking Protein Simulation Force Fields0
Learning to Adapt to Online Streams with Distribution Shifts0
Benchmarking Self-Supervised Contrastive Learning Methods for Image-Based Plant PhenotypingCode0
A Comprehensive Study on Robustness of Image Classification Models: Benchmarking and Rethinking0
Benchmarking Deepart Detection0
Predicting the Performance of a Computing System with Deep Networks0
Benchmarking of Cancelable Biometrics for Deep Templates0
STA: Self-controlled Text Augmentation for Improving Text ClassificationsCode0
Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift with Multiple Views0
What Can We Learn From The Selective Prediction And Uncertainty Estimation Performance Of 523 Imagenet ClassifiersCode1
Revisiting the Gumbel-Softmax in MADDPGCode1
A framework for benchmarking class-out-of-distribution detection and its application to ImageNetCode1
Dermatological Diagnosis Explainability Benchmark for Convolutional Neural NetworksCode0
MultiRobustBench: Benchmarking Robustness Against Multiple Attacks0
An Efficient Two-stage Gradient Boosting Framework for Short-term Traffic State EstimationCode0
Time to Embrace Natural Language Processing (NLP)-based Digital Pathology: Benchmarking NLP- and Convolutional Neural Network-based Deep Learning Pipelines0
Determinants of Performance in European ATM -- How to Analyze a Diverse Industry0
Arena-Rosnav 2.0: A Development and Benchmarking Platform for Robot Navigation in Highly Dynamic EnvironmentsCode0
Fuzzy Knowledge Distillation from High-Order TSK to Low-Order TSK0
Towards Fair Machine Learning Software: Understanding and Addressing Model Bias Through Counterfactual Thinking0
Show:102550
← PrevPage 138 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified