SOTAVerified

Benchmarking

Papers

Showing 46764700 of 5548 papers

TitleStatusHype
HERMES: Holographic Equivariant neuRal network model for Mutational Effect and Stability predictionCode0
HATE-ITA: New Baselines for Hate Speech Detection in ItalianCode0
Benchmarking YOLOv5 and YOLOv7 models with DeepSORT for droplet tracking applicationsCode0
Benchmarking White Blood Cell Classification Under Domain ShiftCode0
MAYA: Addressing Inconsistencies in Generative Password Guessing through a Unified BenchmarkCode0
Robust Benchmarking for Machine Learning of Clinical Entity ExtractionCode0
MCA-Bench: A Multimodal Benchmark for Evaluating CAPTCHA Robustness Against VLM-based AttacksCode0
Benchmarking Vision-Language Contrastive Methods for Medical Representation LearningCode0
A Wild Bootstrap for Degenerate Kernel TestsCode0
Harnessing Orthogonality to Train Low-Rank Neural NetworksCode0
Aux-Drop: Handling Haphazard Inputs in Online Learning Using Auxiliary DropoutsCode0
Causally Testing Gender Bias in LLMs: A Case Study on Occupational BiasCode0
Benchmarking Unsupervised Strategies for Anomaly Detection in Multivariate Time SeriesCode0
Harmonization Benchmarking Tool for Neuroimaging DatasetsCode0
Adaptive Shrinkage Estimation For Personalized Deep Kernel Regression In Modeling Brain TrajectoriesCode0
Benchmarking Unsupervised Online IDS for Masquerade Attacks in CANCode0
The iToBoS dataset: skin region images extracted from 3D total body photographs for lesion detectionCode0
Benchmarking Ultra-High-Definition Image Reflection RemovalCode0
Understanding the Role of LLMs in Multimodal Evaluation BenchmarksCode0
VocalBench: Benchmarking the Vocal Conversational Abilities for Speech Interaction ModelsCode0
Measuring what Really Matters: Optimizing Neural Networks for TinyMLCode0
Benchmarking Traditional Machine Learning and Deep Learning Models for Fault Detection in Power TransformersCode0
Benchmarking TPU, GPU, and CPU Platforms for Deep LearningCode0
RoLargeSum: A Large Dialect-Aware Romanian News Dataset for Summary, Headline, and Keyword GenerationCode0
Hardware Aware Neural Network Architectures using FbNetCode0
Show:102550
← PrevPage 188 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified