Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4901–4925 of 5548 papers

Title	Date	Tasks	Status
Fast Benchmarking of Asynchronous Multi-Fidelity Optimization on Zero-Cost Benchmarks	Mar 4, 2024	Benchmarking	CodeCode Available
Benchmarking Post-Hoc Interpretability Approaches for Transformer-based Misogyny Detection	May 1, 2022	BenchmarkingHate Speech Detection	CodeCode Available
Fast Benchmarking of Accuracy vs. Training Time with Cyclic Learning Rates	Jun 2, 2022	Benchmarking	CodeCode Available
Benchmarking Positional Encodings for GNNs and Graph Transformers	Nov 19, 2024	Benchmarking	CodeCode Available
Fast and accurate alignment of long bisulfite-seq reads	Jan 6, 2014	Benchmarking	CodeCode Available
Benchmarking Popular Classification Models' Robustness to Random and Targeted Corruptions	Jan 31, 2020	BenchmarkingClassification	CodeCode Available
False Promises in Medical Imaging AI? Assessing Validity of Outperformance Claims	May 7, 2025	Benchmarking	CodeCode Available
Benchmarking Perturbation-based Saliency Maps for Explaining Atari Agents	Jan 18, 2021	Atari GamesBenchmarking	CodeCode Available
Unsupervised Anomaly Detection in Multivariate Time Series across Heterogeneous Domains	Mar 29, 2025	Anomaly DetectionBenchmarking	CodeCode Available
Benchmarking person re-identification datasets and approaches for practical real-world implementations	Dec 20, 2022	BenchmarkingPedestrian Detection	CodeCode Available
FALCON: Feature-Label Constrained Graph Net Collapse for Memory Efficient GNNs	Dec 27, 2023	BenchmarkingGPU	CodeCode Available
FairX: A comprehensive benchmarking tool for model analysis using fairness, utility, and explainability	Jun 20, 2024	BenchmarkingFairness	CodeCode Available
Authentic Emotion Mapping: Benchmarking Facial Expressions in Real News	Apr 21, 2024	BenchmarkingEmotion Recognition	CodeCode Available
Benchmarking performance of object detection under image distortions in an uncontrolled environment	Oct 28, 2022	BenchmarkingObject	CodeCode Available
GUNNEL: Guided Mixup Augmentation and Multi-View Fusion for Aquatic Animal Segmentation	Dec 12, 2021	BenchmarkingInstance Segmentation	CodeCode Available
Multimodal Benchmarking and Recommendation of Text-to-Image Generation Models	May 6, 2025	BenchmarkingImage Generation	CodeCode Available
Segmenting France Across Four Centuries	May 30, 2025	BenchmarkingImage-to-Image Translation	CodeCode Available
Audio Explanation Synthesis with Generative Foundation Models	Oct 10, 2024	BenchmarkingDecision Making	CodeCode Available
Benchmarking Tropical Cyclone Rapid Intensification with Satellite Images and Attention-based Deep Models	Sep 25, 2019	BenchmarkingDeep Learning	CodeCode Available
FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure Modes	Jun 3, 2025	BenchmarkingFeature Engineering	CodeCode Available
Can LLMs perform structured graph reasoning?	Feb 2, 2024	BenchmarkingNavigate	CodeCode Available
Attention-based Class-Conditioned Alignment for Multi-Source Domain Adaptation of Object Detectors	Mar 14, 2024	BenchmarkingDomain Adaptation	CodeCode Available
Exploring Model-based Planning with Policy Networks	Jun 20, 2019	Benchmarkingmodel	CodeCode Available
Exploring Context Generalizability in Citywide Crowd Mobility Prediction: An Analytic Framework and Benchmark	Jun 30, 2021	BenchmarkingPrediction	CodeCode Available
Multimodal Multi-User Surface Recognition with the Kernel Two-Sample Test	Mar 8, 2023	BenchmarkingTime Series	CodeCode Available

Show:10 25 50

← PrevPage 197 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified