SOTAVerified

Benchmarking

Papers

Showing 23512360 of 5548 papers

TitleStatusHype
Keras Sig: Efficient Path Signature Computation on GPU in Keras 30
Investigating Energy Efficiency and Performance Trade-offs in LLM Inference Across Tasks and DVFS Settings0
Benchmarking Graph Representations and Graph Neural Networks for Multivariate Time Series ClassificationCode0
Lessons From Red Teaming 100 Generative AI Products0
Stronger Than You Think: Benchmarking Weak Supervision on Realistic TasksCode0
Benchmarking Abstractive Summarisation: A Dataset of Human-authored Summaries of Norwegian News Articles0
The Paradox of Success in Evolutionary and Bioinspired Optimization: Revisiting Critical Issues, Key Studies, and Methodological Pathways0
Understanding and Benchmarking Artificial Intelligence: OpenAI's o3 Is Not AGI0
Benchmarking YOLOv8 for Optimal Crack Detection in Civil Infrastructure0
Benchmarking Rotary Position Embeddings for Automatic Speech Recognition0
Show:102550
← PrevPage 236 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified