Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4276–4300 of 5548 papers

Title	Date	Tasks	Status
AutoLay: Benchmarking amodal layout estimation for autonomous driving	Aug 20, 2021	Amodal Layout EstimationAutonomous Driving	—Unverified
Pythae: Unifying Generative Autoencoders in Python -- A Benchmarking Use Case	Jun 16, 2022	BenchmarkingDensity Estimation	—Unverified
Python Random Graph Generator	Sep 20, 2017	BenchmarkingGraph Generation	—Unverified
Q2SAR: A Quantum Multiple Kernel Learning Approach for Drug Discovery	Jun 17, 2025	BenchmarkingDrug Discovery	—Unverified
Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs	Sep 30, 2024	BenchmarkingMultiple-choice	—Unverified
AutoAI-TS: AutoAI for Time Series Forecasting	Feb 24, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified
QDA^2: A principled approach to automatically annotating charge stability diagrams	Dec 18, 2023	Benchmarking	—Unverified
A Universal Protocol to Benchmark Camera Calibration for Sports	Apr 15, 2024	BenchmarkingCamera Calibration	—Unverified
A Unified Taylor Framework for Revisiting Attribution Methods	Aug 21, 2020	BenchmarkingDecision Making	—Unverified
A Complementarity Analysis of the COCO Benchmark Problems and Artificially Generated Problems	Apr 27, 2021	Benchmarking	—Unverified
QHackBench: Benchmarking Large Language Models for Quantum Code Generation Using PennyLane Hackathon Challenges	Jun 24, 2025	BenchmarkingCode Generation	—Unverified
A Comparison of Word Embeddings for English and Cross-Lingual Chinese Word Sense Disambiguation	Nov 9, 2016	BenchmarkingTranslation	—Unverified
QPO: Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learning	Aug 20, 2024	BenchmarkingLanguage Modelling	—Unverified
QSAM-Net: Rain streak removal by quaternion neural network with self-attention module	Aug 8, 2022	Benchmarkingobject-detection	—Unverified
Decoding Intelligence: A Framework for Certifying Knowledge Comprehension in LLMs	Feb 24, 2024	BenchmarkingKnowledge Graphs	—Unverified
QualBench: Benchmarking Chinese LLMs with Localized Professional Qualifications for Vertical Domain Evaluation	May 8, 2025	BenchmarkingFederated Learning	—Unverified
Unbounded Bayesian Optimization via Regularization	Aug 14, 2015	Bayesian OptimizationBenchmarking	—Unverified
Qualitative Insights Tool (QualIT): LLM Enhanced Topic Modeling	Sep 24, 2024	ArticlesBenchmarking	—Unverified
Quality Assessment of Low Light Restored Images: A Subjective Study and an Unsupervised Model	Feb 4, 2022	BenchmarkingContrastive Learning	—Unverified
Quality Assured: Rethinking Annotation Strategies in Imaging AI	Jul 24, 2024	Benchmarking	—Unverified
Quality at the Tail of Machine Learning Inference	Dec 25, 2022	Autonomous DrivingBenchmarking	—Unverified
Uncertainty estimation for Cross-dataset performance in Trajectory prediction	May 15, 2022	BenchmarkingPrediction	—Unverified
A Unified Study of Machine Learning Explanation Evaluation Metrics	Mar 27, 2022	BenchmarkingBIG-bench Machine Learning	—Unverified
QuantBench: Benchmarking AI Methods for Quantitative Investment	Apr 24, 2025	BenchmarkingContinual Learning	—Unverified
Uncertainty Estimation with Deep Learning for Rainfall-Runoff Modelling	Dec 15, 2020	BenchmarkingDeep Learning	—Unverified

Show:10 25 50

← PrevPage 172 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified