Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2976–3000 of 5548 papers

Title	Date	Tasks	Status
Efficient Benchmarking of Algorithm Configuration Procedures via Model-Based Surrogates	Mar 30, 2017	BenchmarkingHyperparameter Optimization	—Unverified
Efficient Benchmarking of Language Models	Aug 22, 2023	BenchmarkingGPU	—Unverified
Efficient Benchmarking of NLP APIs using Multi-armed Bandits	Apr 1, 2017	BenchmarkingMulti-Armed Bandits	—Unverified
Efficient but Vulnerable: Benchmarking and Defending LLM Batch Prompting Attack	Mar 18, 2025	8kBenchmarking	—Unverified
Efficient Channel Estimation for Millimeter Wave and Terahertz Systems Enabled by Integrated Super-resolution Sensing and Communication	Jul 30, 2024	BenchmarkingSuper-Resolution	—Unverified
Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models	Apr 26, 2024	AttributeBayesian Optimization	—Unverified
Efficient Expression Neutrality Estimation with Application to Face Recognition Utility Prediction	Feb 8, 2024	BenchmarkingFace Image Quality	—Unverified
Efficiently Exploring Ordering Problems through Conflict-directed Search	Apr 15, 2019	BenchmarkingScheduling	—Unverified
Efficiently Quantifying Individual Agent Importance in Cooperative MARL	Dec 13, 2023	BenchmarkingMulti-agent Reinforcement Learning	—Unverified
Efficient Processing of Deep Neural Networks: A Tutorial and Survey	Mar 27, 2017	Benchmarkingspeech-recognition	—Unverified
Efficient Sparse Coding with the Adaptive Locally Competitive Algorithm for Speech Classification	Sep 12, 2024	BenchmarkingClassification	—Unverified
EfficientSRFace: An Efficient Network with Super-Resolution Enhancement for Accurate Face Detection	Jun 4, 2023	BenchmarkingFace Detection	—Unverified
Efficient Training of Deep Classifiers for Wireless Source Identification using Test SNR Estimates	Dec 26, 2019	Benchmarking	—Unverified
Egocentric Human-Object Interaction Detection: A New Benchmark and Method	Jun 17, 2025	BenchmarkingHuman-Object Interaction Detection	—Unverified
EgoPressure: A Dataset for Hand Pressure and Pose Estimation in Egocentric Vision	Sep 3, 2024	BenchmarkingMixed Reality	—Unverified
EGraFFBench: Evaluation of Equivariant Graph Neural Network Force Fields for Atomistic Simulations	Oct 3, 2023	Atomic ForcesBenchmarking	—Unverified
ELKI: A large open-source library for data analysis - ELKI Release 0.7.5 "Heidelberg"	Feb 10, 2019	BenchmarkingClustering	—Unverified
ELSA: Evaluating Localization of Social Activities in Urban Streets using Open-Vocabulary Detection	Jun 3, 2024	Action RecognitionBenchmarking	—Unverified
Embarrassingly Simple Scribble Supervision for 3D Medical Segmentation	Mar 19, 2024	BenchmarkingSegmentation	—Unverified
Embodied Artificial Intelligence through Distributed Adaptive Control: An Integrated Framework	Apr 5, 2017	BenchmarkingBoard Games	—Unverified
EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents	Feb 13, 2025	Benchmarking	—Unverified
Emerging Approaches for THz Array Imaging: A Tutorial Review and Software Tool	Sep 16, 2023	BenchmarkingImage Super-Resolution	—Unverified
Emo3D: Metric and Benchmarking Dataset for 3D Facial Expression Generation from Emotion Description	Oct 2, 2024	BenchmarkingFacial expression generation	—Unverified
EmoBench-M: Benchmarking Emotional Intelligence for Multimodal Large Language Models	Feb 6, 2025	BenchmarkingEmotional Intelligence	—Unverified
Emotion Analysis of Tweets Banning Education in Afghanistan	Jun 28, 2023	BenchmarkingEmotion Classification	—Unverified

Show:10 25 50

← PrevPage 120 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified