SOTAVerified

Benchmarking

Papers

Showing 39013925 of 5548 papers

TitleStatusHype
Improving Items and Contexts Understanding with Descriptive Graph for Conversational Recommendation0
Benchmarking the Physical-world Adversarial Robustness of Vehicle Detection0
Certifiable Black-Box Attacks with Randomized Adversarial Examples: Breaking Defenses with Provable ConfidenceCode0
ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit0
On Evaluation of Bangla Word Analogies0
ForamViT-GAN: Exploring New Paradigms in Deep Learning for Micropaleontological Image Analysis0
SimbaML: Connecting Mechanistic Models and Machine Learning with Augmented DataCode0
Benchmarking the Robustness of Quantized Models0
Probing Conceptual Understanding of Large Visual-Language ModelsCode0
Benchmarking Robustness to Text-Guided CorruptionsCode0
IHCV: Discovery of Hidden Time-Dependent Control Variables in Non-Linear Dynamical SystemsCode0
DRAC: Diabetic Retinopathy Analysis Challenge with Ultra-Wide Optical Coherence Tomography Angiography Images0
The Saudi Privacy Policy DatasetCode0
LogoNet: a fine-grained network for instance-level logo sketch retrievalCode0
OpenContrails: Benchmarking Contrail Detection on GOES-16 ABI0
A Latent Fingerprint in the Wild Database0
LaCViT: A Label-aware Contrastive Fine-tuning Framework for Vision TransformersCode0
Benchmarking FedAvg and FedCurv for Image Classification Tasks0
Prediction of cancer driver genes and mutations: the potential of integrative computational frameworks0
Why is the winner the best?0
From Private to Public: Benchmarking GANs in the Context of Private Time Series Classification0
Open the box of digital neuromorphic processor: Towards effective algorithm-hardware co-design0
GeoNet: Benchmarking Unsupervised Adaptation across Geographies0
Hyperparameter optimization, quantum-assisted model performance prediction, and benchmarking of AI-based High Energy Physics workloads using HPC0
Exploring Continual Learning of Diffusion Models0
Show:102550
← PrevPage 157 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified