SOTAVerified

Benchmarking

Papers

Showing 29763000 of 5548 papers

TitleStatusHype
Efficient Benchmarking of Algorithm Configuration Procedures via Model-Based Surrogates0
Efficient Benchmarking of Language Models0
Efficient Benchmarking of NLP APIs using Multi-armed Bandits0
Efficient but Vulnerable: Benchmarking and Defending LLM Batch Prompting Attack0
Efficient Channel Estimation for Millimeter Wave and Terahertz Systems Enabled by Integrated Super-resolution Sensing and Communication0
Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models0
Efficient Expression Neutrality Estimation with Application to Face Recognition Utility Prediction0
Efficiently Exploring Ordering Problems through Conflict-directed Search0
Efficiently Quantifying Individual Agent Importance in Cooperative MARL0
Efficient Processing of Deep Neural Networks: A Tutorial and Survey0
Efficient Sparse Coding with the Adaptive Locally Competitive Algorithm for Speech Classification0
EfficientSRFace: An Efficient Network with Super-Resolution Enhancement for Accurate Face Detection0
Efficient Training of Deep Classifiers for Wireless Source Identification using Test SNR Estimates0
Egocentric Human-Object Interaction Detection: A New Benchmark and Method0
EgoPressure: A Dataset for Hand Pressure and Pose Estimation in Egocentric Vision0
EGraFFBench: Evaluation of Equivariant Graph Neural Network Force Fields for Atomistic Simulations0
ELKI: A large open-source library for data analysis - ELKI Release 0.7.5 "Heidelberg"0
ELSA: Evaluating Localization of Social Activities in Urban Streets using Open-Vocabulary Detection0
Embarrassingly Simple Scribble Supervision for 3D Medical Segmentation0
Embodied Artificial Intelligence through Distributed Adaptive Control: An Integrated Framework0
EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents0
Emerging Approaches for THz Array Imaging: A Tutorial Review and Software Tool0
Emo3D: Metric and Benchmarking Dataset for 3D Facial Expression Generation from Emotion Description0
EmoBench-M: Benchmarking Emotional Intelligence for Multimodal Large Language Models0
Emotion Analysis of Tweets Banning Education in Afghanistan0
Show:102550
← PrevPage 120 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified