SOTAVerified

Benchmarking

Papers

Showing 15011525 of 5548 papers

TitleStatusHype
Benchmarking GPUs on SVBRDF Extractor Model0
Benchmarking GPU and TPU Performance with Graph Neural Networks0
CubeSat-Enabled Free-Space Optics: Joint Data Communication and Fine Beam Tracking0
Benchmarking GPT-4 on Algorithmic Problems: A Systematic Evaluation of Prompting Strategies0
Approaches for benchmarking single-cell gene regulatory network inference methods0
Applying Standards to Advance Upstream & Downstream Ethics in Large Language Models0
Benchmarking GNNs Using Lightning Network Data0
Benchmarking global optimization techniques for unmanned aerial vehicle path planning0
Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data0
Benchmarking Generative Models on Computational Thinking Tests in Elementary Visual Programming0
Applications in CityLearn Gym Environment for Multi-Objective Control Benchmarking in Grid-Interactive Buildings and Districts0
AEON: Adaptive Estimation of Instance-Dependent In-Distribution and Out-of-Distribution Label Noise for Robust Learning0
CT-Eval: Benchmarking Chinese Text-to-Table Performance in Large Language Models0
Benchmarking Generative AI for Scoring Medical Student Interviews in Objective Structured Clinical Examinations (OSCEs)0
Application of Machine Learning for Online Reputation Systems0
Benchmarking General-Purpose In-Context Learning0
Application of DEA in International Market Selection for the export of products from Spain0
CUB: Benchmarking Context Utilisation Techniques for Language Models0
CULEMO: Cultural Lenses on Emotion -- Benchmarking LLMs for Cross-Cultural Emotion Understanding0
Application Inference using Machine Learning based Side Channel Analysis0
Benchmarking Foundation Speech and Language Models for Alzheimer's Disease and Related Dementia Detection from Spontaneous Speech0
Application based Evaluation of an Efficient Spike-Encoder, "Spiketrum"0
Benchmarking Foundation Models with Language-Model-as-an-Examiner0
Applicability and Challenges of Deep Reinforcement Learning for Satellite Frequency Plan Design0
Apples to Apples: Learning Semantics of Common Entities Through a Novel Comprehension Task0
Show:102550
← PrevPage 61 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified