Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2876–2900 of 5548 papers

Title	Date	Tasks	Status
Benchmarking Vision Language Models on German Factual Data	Apr 15, 2025	Benchmarking	—Unverified
The Effect of Domain and Diacritics in Yoruba–English Neural Machine Translation	Aug 1, 2021	BenchmarkingMachine Translation	—Unverified
Jointly Modeling and Clustering Tensors in High Dimensions	Apr 15, 2021	BenchmarkingClustering	—Unverified
Heterogeneous graph neural networks for species distribution modeling	Mar 14, 2025	Benchmarking	—Unverified
Hide and Seek: on the Stealthiness of Attacks against Deep Learning Systems	May 31, 2022	Benchmarking	—Unverified
Hiding in Plain Sight: Reframing Hardware Trojan Benchmarking as a Hide&Seek Modification	Oct 21, 2024	Benchmarking	—Unverified
Agentic Mixture-of-Workflows for Multi-Modal Chemical Search	Feb 26, 2025	BenchmarkingRetrieval	—Unverified
Benchmarking Vision Language Models for Cultural Understanding	Jul 15, 2024	BenchmarkingQuestion Answering	—Unverified
Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce	Oct 28, 2024	Benchmarkinggraph construction	—Unverified
AA3DNet: Attention Augmented Real Time 3D Object Detection	Jul 26, 2021	3D Object DetectionAutonomous Vehicles	—Unverified
High Accuracy Tumor Diagnoses and Benchmarking of Hematoxylin and Eosin Stained Prostate Core Biopsy Images Generated by Explainable Deep Neural Networks	Aug 2, 2019	BenchmarkingSSIM	—Unverified
Agentic AI for Improving Precision in Identifying Contributions to Sustainable Development Goals	Nov 26, 2024	BenchmarkingRetrieval	—Unverified
High Fidelity RF Clutter Modeling and Simulation	Feb 10, 2022	BenchmarkingVocal Bursts Intensity Prediction	—Unverified
High-Level Synthesis Performance Prediction using GNNs: Benchmarking, Modeling, and Advancing	Jan 18, 2022	BenchmarkingFeature Engineering	—Unverified
Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving	Jan 14, 2025	Autonomous DrivingBenchmarking	—Unverified
The EuroCity Persons Dataset: A Novel Benchmark for Object Detection	May 18, 2018	BenchmarkingObject	—Unverified
The Evolutionary Computation Methods No One Should Use	Jan 5, 2023	Benchmarking	—Unverified
HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects	Jul 17, 2024	BenchmarkingHuman-Object Interaction Detection	—Unverified
Benchmarking Vision-Based Object Tracking for USVs in Complex Maritime Environments	Dec 10, 2024	Benchmarkingobject-detection	—Unverified
Hints-In-Browser: Benchmarking Language Models for Programming Feedback Generation	Jun 7, 2024	Benchmarking	—Unverified
Benchmarking Video Frame Interpolation	Mar 25, 2024	BenchmarkingComputational Efficiency	—Unverified
SnCQA: A hardware-efficient equivariant quantum convolutional circuit architecture	Nov 23, 2022	BenchmarkingComputational chemistry	—Unverified
HLB: Benchmarking LLMs' Humanlikeness in Language Use	Sep 24, 2024	Benchmarking	—Unverified
Benchmarking Unsupervised Outlier Detection with Realistic Synthetic Data	Apr 15, 2020	BenchmarkingOutlier Detection	—Unverified
The Expressive Power of Word Embeddings	Jan 15, 2013	BenchmarkingSentence	—Unverified

Show:10 25 50

← PrevPage 116 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified