SOTAVerified

Benchmarking

Papers

Showing 16511660 of 5548 papers

TitleStatusHype
JExplore: Design Space Exploration Tool for Nvidia Jetson BoardsCode0
Benchmarking a transformer-FREE model for ad-hoc retrievalCode0
Benchmarking Approximate Inference Methods for Neural Structured PredictionCode0
JALMBench: Benchmarking Jailbreak Vulnerabilities in Audio Language ModelsCode0
Benchmarking Apache Spark and Hadoop MapReduce on Big Data ClassificationCode0
a-DCF: an architecture agnostic metric with application to spoofing-robust speaker verificationCode0
DyKnow: Dynamically Verifying Time-Sensitive Factual Knowledge in LLMsCode0
Benchmarking Jetson Edge Devices with an End-to-end Video-based Anomaly Detection SystemCode0
Is Your Model Fairly Certain? Uncertainty-Aware Fairness Evaluation for LLMsCode0
JATE 2.0: Java Automatic Term Extraction with Apache SolrCode0
Show:102550
← PrevPage 166 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified