SOTAVerified

Benchmarking

Papers

Showing 34513500 of 5548 papers

TitleStatusHype
Benchmarking Continuous Time Models for Predicting Multiple Sclerosis Progression0
Efficiency in European Air Traffic Management -- A Fundamental Analysis of Data, Models, and Methods0
Model-Based Underwater 6D Pose Estimation from RGB0
A Neuromorphic Dataset for Object Segmentation in Indoor Cluttered EnvironmentCode0
Deep Imputation of Missing Values in Time Series Health Data: A Review with Benchmarking0
A SWAT-based Reinforcement Learning Framework for Crop ManagementCode1
AI Sound Recognition on Asthma Medication Adherence: Evaluation With the RDA Benchmark SuiteCode0
Fortuna: A Library for Uncertainty Quantification in Deep LearningCode2
Participatory Personalization in Classification0
CrossCodeBench: Benchmarking Cross-Task Generalization of Source Code Models0
NA-SODINN: a deep learning algorithm for exoplanet image detection based on residual noise regimes0
Arena-Web -- A Web-based Development and Benchmarking Platform for Autonomous Navigation Approaches0
SurgT challenge: Benchmark of Soft-Tissue Trackers for Robotic SurgeryCode1
Benchmarking sparse system identification with low-dimensional chaos0
Stability Constrained OPF in Microgrids: A Chance Constrained Optimization Framework with Non-Gaussian Uncertainty0
Characterization of Constrained Continuous Multiobjective Optimization Problems: A Performance Space Perspective0
CosPGD: an efficient white-box adversarial attack for pixel-wise prediction tasksCode1
An Operational Perspective to Fairness Interventions: Where and How to Intervene0
Benchmarking Algorithms for Submodular Optimization Problems Using IOHProfilerCode1
Benchmarking Probabilistic Deep Learning Methods for License Plate RecognitionCode0
Rethinking low-cost microscopy workflow: Image enhancement using deep based Extended Depth of Field methodsCode1
Data-driven Approach for Static Hedging of Exchange Traded Options0
Continuous U-Net: Faster, Greater and Noiseless0
Enhancing Hyper-To-Real Space Projections Through Euclidean Norm Meta-Heuristic OptimizationCode0
Population-wise Labeling of Sulcal Graphs using Multi-graph MatchingCode0
Benchmarking Large Language Models for News SummarizationCode1
Benchmarking Model Predictive Control Algorithms in Building Optimization Testing Framework (BOPTEST)0
Sport Task: Fine Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2022Code0
Benchmarking Robustness to Adversarial Image ObfuscationsCode1
Benchmarking optimality of time series classification methods in distinguishing diffusionsCode0
Cross-Subject Deep Transfer Models for Evoked Potentials in Brain-Computer Interface0
Heterogeneous Datasets for Federated Survival Analysis SimulationCode0
Quality Indicators for Preference-based Evolutionary Multi-objective Optimization Using a Reference Point: A Review and AnalysisCode0
TemporAI: Facilitating Machine Learning Innovation in Time Domain Tasks for MedicineCode1
Task-Agnostic Graph Neural Network Evaluation via Adversarial CollaborationCode0
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement LearningCode3
A Systematic Review of Green AICode0
BiBench: Benchmarking and Analyzing Network BinarizationCode1
Out of Distribution Performance of State of Art Vision Model0
Towards Robust Metrics for Concept Representation EvaluationCode0
SpaceTx: A Roadmap for Benchmarking Spatial Transcriptomics Exploration of the Brain0
Benchmarking YOLOv5 and YOLOv7 models with DeepSORT for droplet tracking applicationsCode0
Job recommendations: benchmarking of collaborative filtering methods for classifieds0
Vision Learners Meet Web Image-Text Pairs0
Hawk: An Industrial-strength Multi-label Document Classifier0
Desbordante: from benchmarking suite to high-performance science-intensive data profiler (preprint)Code2
Young Labeled Faces in the Wild (YLFW): A Dataset for Children Faces RecognitionCode1
Evaluating the Transferability of Machine-Learned Force Fields for Material Property ModelingCode0
Critical review of conformational B-cell epitope prediction methodsCode0
Benchmarking Robustness in Neural Radiance Fields0
Show:102550
← PrevPage 70 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified