SOTAVerified

Benchmarking

Papers

Showing 40264050 of 5548 papers

TitleStatusHype
Benchmarking Algorithmic Bias in Face Recognition: An Experimental Approach Using Synthetic Faces and Human Evaluation0
Opposition based Ensemble Micro Differential Evolution0
Trial-Based Dominance Enables Non-Parametric Tests to Compare both the Speed and Accuracy of Stochastic Optimizers0
Optimal Eco-driving Control of Autonomous and Electric Trucks in Adaptation to Highway Topography: Energy Minimization and Battery Life Extension0
Optimally-Weighted Maximum Mean Discrepancy Framework for Continual Learning0
Optimal PMU Placement for Kalman Filtering of DAE Power System Models0
Optimal Scheduling of Anticipated COVID-19 Vaccination: A Case Study of New York State0
Optimization of Genomic Classifiers for Clinical Deployment: Evaluation of Bayesian Optimization to Select Predictive Models of Acute Infection and In-Hospital Mortality0
Optimization Techniques for a Physical Model of Human Vocalisation0
Optimizing open-domain question answering with graph-based retrieval augmented generation0
Benchmarking air-conditioning energy performance of residential rooms based on regression and clustering techniques0
Optimizing Recommendations using Fine-Tuned LLMs0
OPTION: OPTImization Algorithm Benchmarking ONtology0
OPTION: OPTImization Algorithm Benchmarking ONtology0
Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol0
Benchmarking Agility and Reconfigurability in Satellite Systems for Tropical Cyclone Monitoring0
Trident: Efficient 4PC Framework for Privacy Preserving Machine Learning0
When Reasoning Meets Compression: Benchmarking Compressed Large Reasoning Models on Complex Reasoning Tasks0
TriSAM: Tri-Plane SAM for zero-shot cortical blood vessel segmentation in VEM images0
OReole-FM: successes and challenges toward billion-parameter foundation models for high-resolution satellite imagery0
Organ-aware Multi-scale Medical Image Segmentation Using Text Prompt Engineering0
Benchmarking Aggression Identification in Social Media0
Orthogonal Deep Features Decomposition for Age-Invariant Face Recognition0
A critical look at the current train/test split in machine learning0
Benchmarking a foundation LLM on its ability to re-label structure names in accordance with the AAPM TG-263 report0
Show:102550
← PrevPage 162 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified