SOTAVerified

Benchmarking

Papers

Showing 43014325 of 5548 papers

TitleStatusHype
Understanding and Benchmarking Artificial Intelligence: OpenAI's o3 Is Not AGI0
Quantifying Social Biases Using Templates is Unreliable0
Quantifying the Complexity of Standard Benchmarking Datasets for Long-Term Human Trajectory Prediction0
Quantifying the Impact of Boundary Constraint Handling Methods on Differential Evolution0
A Comparison of Pooling Methods on LSTM Models for Rare Acoustic Event Classification0
Quantitative Benchmarking of Anomaly Detection Methods in Digital Pathology0
A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking0
Quantitative evaluation of brain-inspired vision sensors in high-speed robotic perception0
A Unified Framework for Provably Efficient Algorithms to Estimate Shapley Values0
Understanding Foundation Models: Are We Back in 1924?0
Quantitative Metrics for Benchmarking Medical Image Harmonization0
Benchmarking Bayesian neural networks and evaluation metrics for regression tasks0
A Unified Framework and Dataset for Assessing Societal Bias in Vision-Language Models0
Quantum-Assisted Learning of Hardware-Embedded Probabilistic Graphical Models0
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems0
Quantum classification of the MNIST dataset with Slow Feature Analysis0
Quantum Cognitively Motivated Decision Fusion for Video Sentiment Analysis0
A Comparison of Directional Distances for Hand Pose Estimation0
Quantum Kernel Methods under Scrutiny: A Benchmarking Study0
Quantum Long Short-Term Memory (QLSTM) vs Classical LSTM in Time Series Forecasting: A Comparative Study in Solar Power Forecasting0
Quantum Kernel Learning for Small Dataset Modeling in Semiconductor Fabrication: Application to Ohmic Contact0
Quantum-tunnelling deep neural network for optical illusion recognition0
QuArch: A Question-Answering Dataset for AI Agents in Computer Architecture0
Stereotype Detection in LLMs: A Multiclass, Explainable, and Benchmark-Driven Approach0
Understanding Recurrent Neural Architectures by Analyzing and Synthesizing Long Distance Dependencies in Benchmark Sequential Datasets0
Show:102550
← PrevPage 173 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified