SOTAVerified

Benchmarking

Papers

Showing 36213630 of 5548 papers

TitleStatusHype
Benchmarking performance of object detection under image distortions in an uncontrolled environmentCode0
Benchmarking Language Models for Code Syntax UnderstandingCode1
What's Different between Visual Question Answering for Machine "Understanding" Versus for Accessibility?Code0
pmuBAGE: The Benchmarking Assortment of Generated PMU Data for Power System EventsCode0
CrisisLTLSum: A Benchmark for Local Crisis Event Timeline Extraction and SummarizationCode0
A Comparative Attention Framework for Better Few-Shot Object Detection on Aerial ImagesCode1
Deep Crowd Anomaly Detection: State-of-the-Art, Challenges, and Future Research Directions0
What cleaves? Is proteasomal cleavage prediction reaching a ceiling?0
ESB: A Benchmark For Multi-Domain End-to-End Speech RecognitionCode1
SpikeSim: An end-to-end Compute-in-Memory Hardware Evaluation Tool for Benchmarking Spiking Neural NetworksCode1
Show:102550
← PrevPage 363 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified