SOTAVerified

Benchmarking

Papers

Showing 41014125 of 5548 papers

TitleStatusHype
Benchmarking the Performance and Energy Efficiency of AI Accelerators for AI Training0
Performance Benchmarking of Psychomotor Skills Using Wearable Devices: An Application in Sport0
Performance Comparison of Surrogate-Assisted Evolutionary Algorithms on Computational Fluid Dynamics Problems0
Performance Evaluation Methodology for Long-Term Visual Object Tracking0
Benchmark Dataset for Pore-Scale CO2-Water Interaction0
TTSlow: Slow Down Text-to-Speech with Efficiency Robustness Evaluations0
Performance Evaluation of Transcriptomics Data Normalization for Survival Risk Prediction0
Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale0
Where Paths Collide: A Comprehensive Survey of Classic and Learning-Based Multi-Agent Pathfinding0
Performance of large language models in numerical vs. semantic medical knowledge: Benchmarking on evidence-based Q&As0
Performance prediction of data streams on high-performance architecture0
Periocular Recognition in the Wild with Orthogonal Combination of Local Binary Coded Pattern in Dual-stream Convolutional Neural Network0
Which models are innately best at uncertainty estimation?0
PerMedCQA: Benchmarking Large Language Models on Medical Consumer Question Answering in Persian Language0
WeQA: A Benchmark for Retrieval Augmented Generation in Wind Energy Domain0
Perona: Robust Infrastructure Fingerprinting for Resource-Efficient Big Data Analytics0
PerSEval: Assessing Personalization in Text Summarizers0
A Conformance Checking-based Approach for Drift Detection in Business Processes0
Personalised Feedback Framework for Online Education Programmes Using Generative AI0
Benchmark Data Repositories for Better Benchmarking0
Personalized Multimodal Large Language Models: A Survey0
Personalized On-Device E-health Analytics with Decentralized Block Coordinate Descent0
Person Re-Identification by Unsupervised Video Matching0
Person Re-Identification in Identity Regression Space0
Person Re-identification in the Wild0
Show:102550
← PrevPage 165 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified