SOTAVerified

Benchmarking

Papers

Showing 34763500 of 5548 papers

TitleStatusHype
Benchmarking Large Language Models for News SummarizationCode1
Benchmarking Model Predictive Control Algorithms in Building Optimization Testing Framework (BOPTEST)0
Sport Task: Fine Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2022Code0
Benchmarking Robustness to Adversarial Image ObfuscationsCode1
Benchmarking optimality of time series classification methods in distinguishing diffusionsCode0
Cross-Subject Deep Transfer Models for Evoked Potentials in Brain-Computer Interface0
Heterogeneous Datasets for Federated Survival Analysis SimulationCode0
Quality Indicators for Preference-based Evolutionary Multi-objective Optimization Using a Reference Point: A Review and AnalysisCode0
TemporAI: Facilitating Machine Learning Innovation in Time Domain Tasks for MedicineCode1
Task-Agnostic Graph Neural Network Evaluation via Adversarial CollaborationCode0
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement LearningCode3
A Systematic Review of Green AICode0
BiBench: Benchmarking and Analyzing Network BinarizationCode1
Out of Distribution Performance of State of Art Vision Model0
Towards Robust Metrics for Concept Representation EvaluationCode0
SpaceTx: A Roadmap for Benchmarking Spatial Transcriptomics Exploration of the Brain0
Benchmarking YOLOv5 and YOLOv7 models with DeepSORT for droplet tracking applicationsCode0
Job recommendations: benchmarking of collaborative filtering methods for classifieds0
Vision Learners Meet Web Image-Text Pairs0
Hawk: An Industrial-strength Multi-label Document Classifier0
Desbordante: from benchmarking suite to high-performance science-intensive data profiler (preprint)Code2
Young Labeled Faces in the Wild (YLFW): A Dataset for Children Faces RecognitionCode1
Evaluating the Transferability of Machine-Learned Force Fields for Material Property ModelingCode0
Critical review of conformational B-cell epitope prediction methodsCode0
Benchmarking Robustness in Neural Radiance Fields0
Show:102550
← PrevPage 140 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified