SOTAVerified

Benchmarking

Papers

Showing 49414950 of 5548 papers

TitleStatusHype
ExEBench: Benchmarking Foundation Models on Extreme Earth EventsCode0
MULTITAT: Benchmarking Multilingual Table-and-Text Question AnsweringCode0
Evolving Evolutionary Algorithms with PatternsCode0
Semantic Hilbert Space for Text Representation LearningCode0
A Continuous Information Gain Measure to Find the Most Discriminatory Problems for AI BenchmarkingCode0
Timage -- A Robust Time Series Classification PipelineCode0
AttackNet: Enhancing Biometric Security via Tailored Convolutional Neural Network Architectures for Liveness DetectionCode0
EvoLearner: Learning Description Logics with Evolutionary AlgorithmsCode0
Evidential Deep Learning for Uncertainty Quantification and Out-of-Distribution Detection in Jet Identification using Deep Neural NetworksCode0
Integrating Large Language Models and Knowledge Graphs for Extraction and Validation of Textual Test DataCode0
Show:102550
← PrevPage 495 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified