SOTAVerified

Benchmarking

Papers

Showing 26262650 of 5548 papers

TitleStatusHype
AI-ready Snow Radar Echogram Dataset (SRED) for climate change monitoring0
A Comprehensive Benchmarking Platform for Deep Generative Models in Molecular Design0
High Accuracy Tumor Diagnoses and Benchmarking of Hematoxylin and Eosin Stained Prostate Core Biopsy Images Generated by Explainable Deep Neural Networks0
Benchmarking Sample Selection Strategies for Batch Reinforcement Learning0
HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects0
Fast Labeling and Transcription with the Speechalyzer Toolkit0
Benchmarking Quantum Hardware for Training of Fully Visible Boltzmann Machines0
FastEnsemble: Benchmarking and Accelerating Ensemble-based Uncertainty Estimation for Image-to-Image Translation0
Fast Empirical Scenarios0
Benchmarking Quantum Convolutional Neural Networks for Signal Classification in Simulated Gamma-Ray Burst Detection0
A Survey on Model Compression for Large Language Models0
FastDraft: How to Train Your Draft0
Forecasting NIFTY 50 benchmark Index using Seasonal ARIMA time series models0
AI-Powered Cow Detection in Complex Farm Environments0
Benchmarking quantized LLaMa-based models on the Brazilian Secondary School Exam0
Fast, approximate kinetics of RNA folding0
A Survey on Masked Facial Detection Methods and Datasets for Fighting Against COVID-190
Hide and Seek: on the Stealthiness of Attacks against Deep Learning Systems0
Formal Covariate Benchmarking to Bound Omitted Variable Bias0
Hiding in Plain Sight: Reframing Hardware Trojan Benchmarking as a Hide&Seek Modification0
Benchmarking Quality-Diversity Algorithms on Neuroevolution for Reinforcement Learning0
FormFactory: An Interactive Benchmarking Suite for Multimodal Form-Filling Agents0
Benchmarking Quality-Dependent and Cost-Sensitive Score-Level Multimodal Biometric Fusion Algorithms0
FarsBase-KBP: A Knowledge Base Population System for the Persian Knowledge Graph0
Fantastic Questions and Where to Find Them: FairytaleQA – An Authentic Dataset for Narrative Comprehension0
Show:102550
← PrevPage 106 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified