SOTAVerified

Benchmarking

Papers

Showing 28762900 of 5548 papers

TitleStatusHype
Benchmarking Vision Language Models on German Factual Data0
The Effect of Domain and Diacritics in Yoruba–English Neural Machine Translation0
Jointly Modeling and Clustering Tensors in High Dimensions0
Heterogeneous graph neural networks for species distribution modeling0
Hide and Seek: on the Stealthiness of Attacks against Deep Learning Systems0
Hiding in Plain Sight: Reframing Hardware Trojan Benchmarking as a Hide&Seek Modification0
Agentic Mixture-of-Workflows for Multi-Modal Chemical Search0
Benchmarking Vision Language Models for Cultural Understanding0
Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce0
AA3DNet: Attention Augmented Real Time 3D Object Detection0
High Accuracy Tumor Diagnoses and Benchmarking of Hematoxylin and Eosin Stained Prostate Core Biopsy Images Generated by Explainable Deep Neural Networks0
Agentic AI for Improving Precision in Identifying Contributions to Sustainable Development Goals0
High Fidelity RF Clutter Modeling and Simulation0
High-Level Synthesis Performance Prediction using GNNs: Benchmarking, Modeling, and Advancing0
Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving0
The EuroCity Persons Dataset: A Novel Benchmark for Object Detection0
The Evolutionary Computation Methods No One Should Use0
HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects0
Benchmarking Vision-Based Object Tracking for USVs in Complex Maritime Environments0
Hints-In-Browser: Benchmarking Language Models for Programming Feedback Generation0
Benchmarking Video Frame Interpolation0
SnCQA: A hardware-efficient equivariant quantum convolutional circuit architecture0
HLB: Benchmarking LLMs' Humanlikeness in Language Use0
Benchmarking Unsupervised Outlier Detection with Realistic Synthetic Data0
The Expressive Power of Word Embeddings0
Show:102550
← PrevPage 116 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified