SOTAVerified

Benchmarking

Papers

Showing 39514000 of 5548 papers

TitleStatusHype
Structure-Based Experimental Datasets for Benchmarking Protein Simulation Force Fields0
Learning to Adapt to Online Streams with Distribution Shifts0
Benchmarking Self-Supervised Contrastive Learning Methods for Image-Based Plant PhenotypingCode0
A Comprehensive Study on Robustness of Image Classification Models: Benchmarking and Rethinking0
Benchmarking Deepart Detection0
Predicting the Performance of a Computing System with Deep Networks0
Benchmarking of Cancelable Biometrics for Deep Templates0
STA: Self-controlled Text Augmentation for Improving Text ClassificationsCode0
Dermatological Diagnosis Explainability Benchmark for Convolutional Neural NetworksCode0
Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift with Multiple Views0
MultiRobustBench: Benchmarking Robustness Against Multiple Attacks0
Time to Embrace Natural Language Processing (NLP)-based Digital Pathology: Benchmarking NLP- and Convolutional Neural Network-based Deep Learning Pipelines0
An Efficient Two-stage Gradient Boosting Framework for Short-term Traffic State EstimationCode0
Determinants of Performance in European ATM -- How to Analyze a Diverse Industry0
Arena-Rosnav 2.0: A Development and Benchmarking Platform for Robot Navigation in Highly Dynamic EnvironmentsCode0
Fuzzy Knowledge Distillation from High-Order TSK to Low-Order TSK0
Towards Fair Machine Learning Software: Understanding and Addressing Model Bias Through Counterfactual Thinking0
Benchmarking Continuous Time Models for Predicting Multiple Sclerosis Progression0
Efficiency in European Air Traffic Management -- A Fundamental Analysis of Data, Models, and Methods0
Model-Based Underwater 6D Pose Estimation from RGB0
A Neuromorphic Dataset for Object Segmentation in Indoor Cluttered EnvironmentCode0
Deep Imputation of Missing Values in Time Series Health Data: A Review with Benchmarking0
AI Sound Recognition on Asthma Medication Adherence: Evaluation With the RDA Benchmark SuiteCode0
CrossCodeBench: Benchmarking Cross-Task Generalization of Source Code Models0
Participatory Personalization in Classification0
Arena-Web -- A Web-based Development and Benchmarking Platform for Autonomous Navigation Approaches0
NA-SODINN: a deep learning algorithm for exoplanet image detection based on residual noise regimes0
Stability Constrained OPF in Microgrids: A Chance Constrained Optimization Framework with Non-Gaussian Uncertainty0
Benchmarking sparse system identification with low-dimensional chaos0
Characterization of Constrained Continuous Multiobjective Optimization Problems: A Performance Space Perspective0
An Operational Perspective to Fairness Interventions: Where and How to Intervene0
Benchmarking Probabilistic Deep Learning Methods for License Plate RecognitionCode0
Data-driven Approach for Static Hedging of Exchange Traded Options0
Continuous U-Net: Faster, Greater and Noiseless0
Enhancing Hyper-To-Real Space Projections Through Euclidean Norm Meta-Heuristic OptimizationCode0
Benchmarking Model Predictive Control Algorithms in Building Optimization Testing Framework (BOPTEST)0
Sport Task: Fine Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2022Code0
Population-wise Labeling of Sulcal Graphs using Multi-graph MatchingCode0
Benchmarking optimality of time series classification methods in distinguishing diffusionsCode0
Cross-Subject Deep Transfer Models for Evoked Potentials in Brain-Computer Interface0
Quality Indicators for Preference-based Evolutionary Multi-objective Optimization Using a Reference Point: A Review and AnalysisCode0
Heterogeneous Datasets for Federated Survival Analysis SimulationCode0
Task-Agnostic Graph Neural Network Evaluation via Adversarial CollaborationCode0
A Systematic Review of Green AICode0
Out of Distribution Performance of State of Art Vision Model0
Towards Robust Metrics for Concept Representation EvaluationCode0
SpaceTx: A Roadmap for Benchmarking Spatial Transcriptomics Exploration of the Brain0
Job recommendations: benchmarking of collaborative filtering methods for classifieds0
Benchmarking YOLOv5 and YOLOv7 models with DeepSORT for droplet tracking applicationsCode0
Vision Learners Meet Web Image-Text Pairs0
Show:102550
← PrevPage 80 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified