Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4051–4075 of 5548 papers

Title	Date	Tasks	Status
Syn3DWound: A Synthetic Dataset for 3D Wound Bed Analysis	Nov 27, 2023	BenchmarkingDiagnostic	—Unverified
SynBench: Task-Agnostic Benchmarking of Pretrained Representations using Synthetic Data	Oct 6, 2022	BenchmarkingRepresentation Learning	—Unverified
Synplex: A synthetic simulator of highly multiplexed histological images	Mar 8, 2021	Benchmarking	—Unverified
Syntactically Aware Neural Architectures for Definition Extraction	Jun 1, 2018	BenchmarkingBinary Classification	—Unverified
Syntax Encoding with Application in Authorship Attribution	Oct 1, 2018	Authorship AttributionBenchmarking	—Unverified
A Synthetic Benchmarking Pipeline to Compare Camera Calibration Algorithms	Jul 3, 2023	BenchmarkingCamera Calibration	—Unverified
Synthetic Video Generation for Robust Hand Gesture Recognition in Augmented Reality Applications	Nov 4, 2019	BenchmarkingGesture Recognition	—Unverified
Synthetic weather radar using hybrid quantum-classical machine learning	Nov 30, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified
SynthRAD2025 Grand Challenge dataset: generating synthetic CTs for radiotherapy	Feb 24, 2025	BenchmarkingImage Generation	—Unverified
SysML'19 demo: customizable and reusable Collective Knowledge pipelines to automate and reproduce machine learning experiments	Mar 31, 2019	BenchmarkingBIG-bench Machine Learning	—Unverified
SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency	Jul 1, 2023	BenchmarkingData Augmentation	—Unverified
Systematic Comparison of Path Planning Algorithms using PathBench	Mar 7, 2022	Benchmarking	—Unverified
Systematic Review: Anomaly Detection in Connected and Autonomous Vehicles	May 4, 2024	Anomaly DetectionArticles	—Unverified
SzCORE as a benchmark: report from the seizure detection challenge at the 2025 AI in Epilepsy and Neurological Disorders Conference	May 19, 2025	BenchmarkingEEG	—Unverified
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts	Dec 5, 2024	BenchmarkingImage Generation	—Unverified
T^2K^2: The Twitter Top-K Keywords Benchmark	Sep 14, 2017	BenchmarkingInformation Retrieval	—Unverified
TabKAN: Advancing Tabular Data Analysis using Kolmogorov-Arnold Network	Apr 9, 2025	BenchmarkingDeep Learning	—Unverified
TabTreeFormer: Tabular Data Generation Using Hybrid Tree-Transformer	Jan 2, 2025	BenchmarkingQuantization	—Unverified
TabularQGAN: A Quantum Generative Model for Tabular Data	May 28, 2025	BenchmarkingGenerative Adversarial Network	—Unverified
Tackling the Story Ending Biases in The Story Cloze Test	Jul 1, 2018	BenchmarkingCloze Test	—Unverified
Tackling Visual Control via Multi-View Exploration Maximization	Nov 28, 2022	BenchmarkingReinforcement Learning (RL)	—Unverified
TACO: Benchmarking Generalizable Bimanual Tool-ACtion-Object Understanding	Jan 16, 2024	Action RecognitionBenchmarking	—Unverified
Tactile MNIST: Benchmarking Active Tactile Perception	Jun 3, 2025	BenchmarkingScene Understanding	—Unverified
Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics	Mar 3, 2025	BenchmarkingSpoken Dialogue Systems	—Unverified
TARGET: Benchmarking Table Retrieval for Generative Tasks	May 14, 2025	BenchmarkingRepresentation Learning	—Unverified

Show:10 25 50

← PrevPage 163 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified