Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2951–3000 of 5548 papers

Title	Date	Tasks	Status
Hybrid Precoder and Combiner Designs for Decentralized Parameter Estimation in mmWave MIMO Wireless Sensor Networks	Jun 25, 2023	Benchmarkingparameter estimation	—Unverified
Hybrid Quantum Computing -- Tabu Search Algorithm for Partitioning Problems: preliminary study on the Traveling Salesman Problem	Dec 9, 2020	BenchmarkingTraveling Salesman Problem	—Unverified
The Interactive Effects of Operators and Parameters to GA Performance Under Different Problem Sizes	Aug 1, 2015	Benchmarking	—Unverified
Hybrid Transceiver Design for Tera-Hertz MIMO Systems Relying on Bayesian Learning Aided Sparse Channel Estimation	Sep 20, 2021	Benchmarking	—Unverified
Hydra: Marker-Free RGB-D Hand-Eye Calibration	Apr 29, 2025	Benchmarking	—Unverified
Hydrological time series forecasting using simple combinations: Big data testing and investigations on one-year ahead river flow predictability	Jan 2, 2020	BenchmarkingManagement	—Unverified
Benchmarking the Spatial Robustness of DNNs via Natural and Adversarial Localized Corruptions	Apr 2, 2025	BenchmarkingSegmentation	—Unverified
Hyperbolic Anomaly Detection	Jan 1, 2024	Anomaly DetectionBenchmarking	—Unverified
V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning	Mar 14, 2025	BenchmarkingRelational Reasoning	—Unverified
HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere	Nov 13, 2024	BenchmarkingDataset Generation	—Unverified
Hypergraph Neural Networks through the Lens of Message Passing: A Common Perspective to Homophily and Architecture Design	Oct 11, 2023	BenchmarkingRepresentation Learning	—Unverified
The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine	Sep 12, 2024	Autonomous DrivingBenchmarking	—Unverified
The Jungle of Generative Drug Discovery: Traps, Treasures, and Ways Out	Dec 24, 2024	BenchmarkingDeep Learning	—Unverified
Benchmarking the Sim-to-Real Gap in Cloth Manipulation	Oct 14, 2023	BenchmarkingMuJoCo	—Unverified
Hyperparameter optimization, quantum-assisted model performance prediction, and benchmarking of AI-based High Energy Physics workloads using HPC	Mar 27, 2023	BenchmarkingHyperparameter Optimization	—Unverified
Hyperspectral Anomaly Detection Methods: A Survey and Comparative Study	Jul 8, 2025	Anomaly DetectionBenchmarking	—Unverified
v-SVR Polynomial Kernel for Predicting the Defect Density in New Software Projects	Dec 15, 2018	Benchmarkingregression	—Unverified
Benchmarking the Robustness of Semantic Segmentation Models	Aug 14, 2019	Autonomous DrivingBenchmarking	—Unverified
The Karp Dataset	Jan 24, 2025	BenchmarkingMathematical Reasoning	—Unverified
HySpecNet-11k: A Large-Scale Hyperspectral Dataset for Benchmarking Learning-Based Hyperspectral Image Compression Methods	Jun 1, 2023	BenchmarkingHyperspectral image analysis	—Unverified
Benchmarking the Robustness of Quantized Models	Apr 8, 2023	BenchmarkingQuantization	—Unverified
Vulnerability of Face Morphing Attacks: A Case Study on Lookalike and Identical Twins	Mar 24, 2023	BenchmarkingFace Recognition	—Unverified
Ice Cream Doesn't Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Inference	May 19, 2025	BenchmarkingCausal Inference	—Unverified
ICE-ID: A Novel Historical Census Data Benchmark Comparing NARS against LLMs, \& a ML Ensemble on Longitudinal Identity Resolution	Jun 11, 2025	Benchmarking	—Unverified
ICON^2: Reliably Benchmarking Predictive Inequity in Object Detection	Jun 7, 2023	AttributeAutonomous Driving	—Unverified
Benchmarking the Robustness of Panoptic Segmentation for Automated Driving	Feb 23, 2024	BenchmarkingDecision Making	—Unverified
The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs	Oct 2, 2024	BenchmarkingHallucination	—Unverified
Identifiable Convex-Concave Regression via Sub-gradient Regularised Least Squares	Jun 22, 2025	Benchmarkingregression	—Unverified
Identification of vortex in unstructured mesh with graph neural networks	Nov 11, 2023	BenchmarkingGraph Generation	—Unverified
The Leaderboard Illusion	Apr 29, 2025	BenchmarkingChatbot	—Unverified
XCSP3: An Integrated Format for Benchmarking Combinatorial Constrained Problems	Nov 10, 2016	Benchmarking	—Unverified
Identifying patterns and recommendations of and for sustainable open data initiatives: a benchmarking-driven analysis of open government data initiatives among European countries	Dec 1, 2023	Benchmarking	—Unverified
Identifying the Context Shift between Test Benchmarks and Production Data	Jul 3, 2022	BenchmarkingBIG-bench Machine Learning	—Unverified
The Liouville Generator for Producing Integrable Expressions	Jun 17, 2024	Benchmarking	—Unverified
Benchmarking the Robustness of Instance Segmentation Models	Sep 2, 2021	BenchmarkingDomain Adaptation	—Unverified
Benchmarking the Reliability of Post-training Quantization: a Particular Focus on Worst-case Performance	Mar 23, 2023	BenchmarkingData Augmentation	—Unverified
IEA: Inner Ensemble Average within a convolutional neural network	Aug 30, 2018	BenchmarkingEnsemble Learning	—Unverified
Benchmarking the rationality of AI decision making using the transitivity axiom	Feb 14, 2025	BenchmarkingDecision Making	—Unverified
A Gap in Time: The Challenge of Processing Heterogeneous IoT Data in Digitalized Buildings	May 23, 2024	BenchmarkingData Integration	—Unverified
Exploring the Decentraland Economy: Multifaceted Parcel Attributes, Key Insights, and Benchmarking	Apr 11, 2024	AttributeBenchmarking	—Unverified
A2Perf: Real-World Autonomous Agents Benchmark	Mar 4, 2025	BenchmarkingCombinatorial Optimization	—Unverified
Benchmarking the Physical-world Adversarial Robustness of Vehicle Detection	Apr 11, 2023	Adversarial AttackAdversarial Robustness	—Unverified
Benchmarking the Neural Linear Model for Regression	Dec 18, 2019	Bayesian OptimizationBenchmarking	—Unverified
The Low Emission Oil&Gas Open (LEOGO) Reference Platform of an Off-Grid Energy System for Renewable Integration Studies	Aug 16, 2022	BenchmarkingManagement	—Unverified
From Attack to Protection: Leveraging Watermarking Attack Network for Advanced Add-on Watermarking	Aug 14, 2020	Benchmarking	—Unverified
Image2Struct: Benchmarking Structure Extraction for Vision-Language Models	Oct 29, 2024	Benchmarking	—Unverified
Image-Based Benchmarking and Visualization for Large-Scale Global Optimization	Jul 24, 2020	BenchmarkingDimensionality Reduction	—Unverified
Benchmarking the Impact of Noise on Deep Learning-based Classification of Atrial Fibrillation in 12-Lead ECG	Mar 24, 2023	Atrial Fibrillation DetectionBenchmarking	—Unverified
Benchmarking the human brain against computational architectures	May 15, 2023	BenchmarkingComputational Efficiency	—Unverified
Image Matching: An Application-oriented Benchmark	Sep 12, 2017	AttributeBenchmarking	—Unverified

Show:10 25 50

← PrevPage 60 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified