Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2101–2125 of 5548 papers

Title	Date	Tasks	Status	Score
Hyperopt-Sklearn: Automatic Hyperparameter Configuration for Scikit-Learn	Jan 1, 2014	AutoMLBenchmarking	CodeCode Available	5
Hybrid Random Features	Oct 8, 2021	Benchmarking	CodeCode Available	5
Hyperbolic Benchmarking Unveils Network Topology-Feature Relationship in GNN Performance	Jun 4, 2024	BenchmarkingDrug Discovery	CodeCode Available	5
Hyperparameter-Free Losses for Model-Based Monocular Reconstruction	Aug 16, 2019	3D ReconstructionBenchmarking	CodeCode Available	5
Benchmarking Suite for Synthetic Aperture Radar Imagery Anomaly Detection (SARIAD) Algorithms	Apr 10, 2025	Anomaly DetectionBenchmarking	CodeCode Available	5
HuSc3D: Human Sculpture dataset for 3D object reconstruction	Jun 9, 2025	3D Object Reconstruction3D Reconstruction	CodeCode Available	5
Benchmarking Subset Selection from Large Candidate Solution Sets in Evolutionary Multi-objective Optimization	Jan 18, 2022	Benchmarking	CodeCode Available	5
A*3D Dataset: Towards Autonomous Driving in Challenging Environments	Sep 17, 2019	3D Object DetectionAutonomous Driving	CodeCode Available	5
Hybrid Machine Learning Models of Classifying Residential Requests for Smart Dispatching	Dec 22, 2019	BenchmarkingBIG-bench Machine Learning	CodeCode Available	5
Identifying the Smallest Adversarial Load Perturbations that Render DC-OPF Infeasible	Jul 10, 2025	Adversarial AttackBenchmarking	CodeCode Available	5
HRIBench: Benchmarking Vision-Language Models for Real-Time Human Perception in Human-Robot Interaction	Jun 25, 2025	BenchmarkingPerson Identification	CodeCode Available	5
Benchmarking Spurious Bias in Few-Shot Image Classifiers	Sep 4, 2024	AttributeBenchmarking	CodeCode Available	5
HRNET: AI on Edge for mask detection and social distancing	Nov 30, 2021	BenchmarkingEdge-computing	CodeCode Available	5
A Comprehensive Summarization and Evaluation of Feature Refinement Modules for CTR Prediction	Nov 8, 2023	BenchmarkingClick-Through Rate Prediction	CodeCode Available	5
Benchmarking Spatiotemporal Reasoning in LLMs and Reasoning Models: Capabilities and Challenges	May 16, 2025	BenchmarkingState Estimation	CodeCode Available	5
Audio Explanation Synthesis with Generative Foundation Models	Oct 10, 2024	BenchmarkingDecision Making	CodeCode Available	5
How to Manage Tiny Machine Learning at Scale: An Industrial Perspective	Feb 18, 2022	BenchmarkingBIG-bench Machine Learning	CodeCode Available	5
HR-VILAGE-3K3M: A Human Respiratory Viral Immunization Longitudinal Gene Expression Dataset for Systems Immunity	May 19, 2025	Benchmarkingfeature selection	CodeCode Available	5
How Far Are We from Optimal Reasoning Efficiency?	Jun 8, 2025	16kBenchmarking	CodeCode Available	5
Towards Segment Anything Model (SAM) for Medical Image Segmentation: A Survey	May 5, 2023	BenchmarkingImage Generation	CodeCode Available	5
HopaDIFF: Holistic-Partial Aware Fourier Conditioned Diffusion for Referring Human Action Segmentation in Multi-Person Scenarios	Jun 11, 2025	Action RecognitionAction Segmentation	CodeCode Available	5
Benchmarking Single Image Dehazing and Beyond	Dec 12, 2017	BenchmarkingImage Dehazing	CodeCode Available	5
HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language Models	Jun 4, 2025	BenchmarkingGeneral Knowledge	CodeCode Available	5
Benchmarking Sequential Visual Input Reasoning and Prediction in Multimodal Large Language Models	Oct 20, 2023	Activity PredictionBenchmarking	CodeCode Available	5
Benchmarking sentiment analysis methods for large-scale texts: A case for using continuum-scored words and word shift graphs	Dec 2, 2015	BenchmarkingSentiment Analysis	CodeCode Available	5

Show:10 25 50

← PrevPage 85 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified