Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3001–3025 of 5548 papers

Title	Date	Tasks	Status
Benchmarking the Gerchberg-Saxton Algorithm	May 18, 2020	Benchmarking	—Unverified
Benchmarking the Fidelity and Utility of Synthetic Relational Data	Oct 4, 2024	BenchmarkingFeature Importance	—Unverified
Benchmarking the Extraction and Disambiguation of Named Entities on the Semantic Web	May 1, 2014	BenchmarkingEntity Linking	—Unverified
ImageNet performance correlates with pose estimation robustness and generalization on out-of-domain data	Jul 17, 2020	Animal Pose EstimationBenchmarking	—Unverified
ImagePairs: Realistic Super Resolution Dataset via Beam Splitter Camera Rig	Apr 18, 2020	BenchmarkingBIG-bench Machine Learning	—Unverified
Imagining and building wise machines: The centrality of AI metacognition	Nov 4, 2024	BenchmarkingNavigate	—Unverified
Benchmarking the Effectiveness of Classification Algorithms and SVM Kernels for Dry Beans	Jul 15, 2023	BenchmarkingDimensionality Reduction	—Unverified
SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World	Dec 5, 2023	BenchmarkingDiversity	—Unverified
Imitation Learning Datasets: A Toolkit For Creating Datasets, Training Agents and Benchmarking	Mar 1, 2024	BenchmarkingImitation Learning	—Unverified
Imitation Learning from Pixel Observations for Continuous Control	Sep 29, 2021	Benchmarkingcontinuous-control	—Unverified
Practical Guidelines for Cell Segmentation Models Under Optical Aberrations in Microscopy	Apr 12, 2024	BenchmarkingCell Segmentation	—Unverified
A Functional Analysis Approach to Symbolic Regression	Feb 9, 2024	Benchmarkingregression	—Unverified
Benchmarking the Capabilities of Large Language Models in Transportation System Engineering: Accuracy, Consistency, and Reasoning Behaviors	Aug 15, 2024	BenchmarkingManagement	—Unverified
A Framework for Large Scale Synthetic Graph Dataset Generation	Oct 4, 2022	BenchmarkingDataset Generation	—Unverified
Dataset Properties Shape the Success of Neuroimaging-Based Patient Stratification: A Benchmarking Analysis Across Clustering Algorithms	Mar 15, 2025	BenchmarkingBrain Morphometry	—Unverified
A Framework for Evaluating Predictive Models Using Synthetic Image Covariates and Longitudinal Data	Oct 21, 2024	Benchmarking	—Unverified
Impact of spatial transformations on landscape features of CEC2022 basic benchmark problems	Feb 12, 2024	Benchmarking	—Unverified
Implementing and Benchmarking the Locally Competitive Algorithm on the Loihi 2 Neuromorphic Processor	Jul 25, 2023	BenchmarkingCPU	—Unverified
Implementing hosting capacity analysis in distribution networks: Practical considerations, advancements and future directions	Dec 11, 2023	BenchmarkingCapacity Estimation	—Unverified
Benchmarking the Benchmark -- Analysis of Synthetic NIDS Datasets	Apr 19, 2021	BenchmarkingIntrusion Detection	—Unverified
Implicit Causality-biases in humans and LLMs as a tool for benchmarking LLM discourse capabilities	Jan 22, 2025	BenchmarkingReferring Expression	—Unverified
Benchmarking the Accuracy and Robustness of Feedback Alignment Algorithms	Aug 30, 2021	Benchmarking	—Unverified
Implicit to Explicit Entropy Regularization: Benchmarking ViT Fine-tuning under Noisy Labels	Oct 5, 2024	Benchmarking	—Unverified
The Moral Mind(s) of Large Language Models	Nov 19, 2024	BenchmarkingDecision Making	—Unverified
Benchmarking Test-Time Unsupervised Deep Neural Network Adaptation on Edge Devices	Mar 21, 2022	BenchmarkingGPU	—Unverified

Show:10 25 50

← PrevPage 121 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified