Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4301–4325 of 5548 papers

Title	Date	Tasks	Status
Understanding and Benchmarking Artificial Intelligence: OpenAI's o3 Is Not AGI	Jan 13, 2025	ARCBenchmarking	—Unverified
Quantifying Social Biases Using Templates is Unreliable	Oct 9, 2022	AttributeBenchmarking	—Unverified
Quantifying the Complexity of Standard Benchmarking Datasets for Long-Term Human Trajectory Prediction	May 28, 2020	BenchmarkingPrediction	—Unverified
Quantifying the Impact of Boundary Constraint Handling Methods on Differential Evolution	May 14, 2021	Benchmarking	—Unverified
A Comparison of Pooling Methods on LSTM Models for Rare Acoustic Event Classification	Feb 14, 2020	BenchmarkingClassification	—Unverified
Quantitative Benchmarking of Anomaly Detection Methods in Digital Pathology	Jun 24, 2025	Anomaly DetectionArtifact Detection	—Unverified
A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking	May 26, 2025	BenchmarkingOptical Flow Estimation	—Unverified
Quantitative evaluation of brain-inspired vision sensors in high-speed robotic perception	Apr 27, 2025	BenchmarkingEvent-based vision	—Unverified
A Unified Framework for Provably Efficient Algorithms to Estimate Shapley Values	Jun 5, 2025	Benchmarking	—Unverified
Understanding Foundation Models: Are We Back in 1924?	Sep 11, 2024	Benchmarking	—Unverified
Quantitative Metrics for Benchmarking Medical Image Harmonization	Feb 6, 2024	AnatomyBenchmarking	—Unverified
Benchmarking Bayesian neural networks and evaluation metrics for regression tasks	Jun 8, 2022	BenchmarkingOpen-Ended Question Answering	—Unverified
A Unified Framework and Dataset for Assessing Societal Bias in Vision-Language Models	Feb 21, 2024	BenchmarkingImage to text	—Unverified
Quantum-Assisted Learning of Hardware-Embedded Probabilistic Graphical Models	Sep 8, 2016	BenchmarkingBIG-bench Machine Learning	—Unverified
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems	Oct 11, 2022	BenchmarkingRecommendation Systems	—Unverified
Quantum classification of the MNIST dataset with Slow Feature Analysis	May 22, 2018	BenchmarkingClassification	—Unverified
Quantum Cognitively Motivated Decision Fusion for Video Sentiment Analysis	Jan 12, 2021	BenchmarkingDecision Making	—Unverified
A Comparison of Directional Distances for Hand Pose Estimation	Apr 3, 2017	BenchmarkingHand Pose Estimation	—Unverified
Quantum Kernel Methods under Scrutiny: A Benchmarking Study	Sep 6, 2024	BenchmarkingQuantum Machine Learning	—Unverified
Quantum Long Short-Term Memory (QLSTM) vs Classical LSTM in Time Series Forecasting: A Comparative Study in Solar Power Forecasting	Oct 25, 2023	BenchmarkingHyperparameter Optimization	—Unverified
Quantum Kernel Learning for Small Dataset Modeling in Semiconductor Fabrication: Application to Ohmic Contact	Sep 17, 2024	BenchmarkingQuantum Machine Learning	—Unverified
Quantum-tunnelling deep neural network for optical illusion recognition	Jun 26, 2024	Autonomous VehiclesBenchmarking	—Unverified
QuArch: A Question-Answering Dataset for AI Agents in Computer Architecture	Jan 3, 2025	BenchmarkingQuestion Answering	—Unverified
Stereotype Detection in LLMs: A Multiclass, Explainable, and Benchmark-Driven Approach	Apr 2, 2024	BenchmarkingCommon Sense Reasoning	—Unverified
Understanding Recurrent Neural Architectures by Analyzing and Synthesizing Long Distance Dependencies in Benchmark Sequential Datasets	Oct 6, 2018	BenchmarkingLanguage Modeling	—Unverified

Show:10 25 50

← PrevPage 173 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified