SOTAVerified

Benchmarking

Papers

Showing 38513860 of 5548 papers

TitleStatusHype
A Semi-Automated Live Interlingual Communication Workflow Featuring Intralingual Respeaking: Evaluation and Benchmarking0
Needle In A Haystack, Fast: Benchmarking Image Perceptual Similarity Metrics At ScaleCode1
NEWTS: A Corpus for News Topic-Focused Summarization0
Hide and Seek: on the Stealthiness of Attacks against Deep Learning Systems0
AI-enabled Sound Pattern Recognition on Asthma Medication Adherence: Evaluation with the RDA Benchmark SuiteCode0
bsnsing: A decision tree induction method based on recursive optimal boolean rule compositionCode0
Benchmarking Unsupervised Anomaly Detection and Localization0
Benchmarking the Robustness of LiDAR-Camera Fusion for 3D Object DetectionCode1
A Framework for Generating Informative Benchmark InstancesCode0
Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object InteractionsCode1
Show:102550
← PrevPage 386 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified