SOTAVerified

Benchmarking

Papers

Showing 16761700 of 5548 papers

TitleStatusHype
Benchmarking Collaborative Learning Methods Cost-Effectiveness for Prostate Segmentation0
Benchmarking Cognitive Domains for LLMs: Insights from Taiwanese Hakka Culture0
A Distance Oriented Kalman Filter Particle Swarm Optimizer Applied to Multi-Modality Image Registration0
MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems0
Detecting Out-Of-Distribution Samples Using Low-Order Deep Features Statistics0
Device Modeling Bias in ReRAM-based Neural Network Simulations0
Different Horses for Different Courses: Comparing Bias Mitigation Algorithms in ML0
Diverse Community Data for Benchmarking Data Privacy Algorithms0
Benchmarking CNN on 3D Anatomical Brain MRI: Architectures, Data Augmentation and Deep Ensemble Learning0
Benchmarking Clinical Decision Support Search0
Ad-hoc Concept Forming in the Game Codenames as a Means for Evaluating Large Language Models0
Design2Code: Benchmarking Multimodal Code Generation for Automated Front-End Engineering0
Benchmarking Classical, Deep, and Generative Models for Human Activity Recognition0
An Experimental Study: Assessing the Combined Framework of WavLM and BEST-RQ for Text-to-Speech Synthesis0
Benchmarking Chinese Medical LLMs: A Medbench-based Analysis of Performance Gaps and Hierarchical Optimization Strategies0
A Comprehensive Library for Benchmarking Multi-class Visual Anomaly Detection0
ABOUT ML: Annotation and Benchmarking on Understanding and Transparency of Machine Learning Lifecycles0
Design and benchmarking of a two degree of freedom tendon driver unit for cable-driven wearable technologies0
CKnowEdit: A New Chinese Knowledge Editing Dataset for Linguistics, Facts, and Logic Error Correction in LLMs0
A New Stereo Benchmarking Dataset for Satellite Images0
A New Real-World Video Dataset for the Comparison of Defogging Algorithms0
Benchmarking Chest X-ray Diagnosis Models Across Multinational Datasets0
A Density-Guided Temporal Attention Transformer for Indiscernible Object Counting in Underwater Video0
A Boosting Approach to Constructing an Ensemble Stack0
An Analysis of an Integrated Mathematical Modeling -- Artificial Neural Network Approach for the Problems with a Limited Learning Dataset0
Show:102550
← PrevPage 68 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified