SOTAVerified

Benchmarking

Papers

Showing 28512875 of 5548 papers

TitleStatusHype
The Design and Implementation of a Scalable DL Benchmarking Platform0
Handwritten Text Recognition: A Survey0
HaN-Seg: The head and neck organ-at-risk CT and MR segmentation dataset0
xai_evals : A Framework for Evaluating Post-Hoc Local Explanation Methods0
Hardware and Software Optimizations for Accelerating Deep Neural Networks: Survey of Current Trends, Challenges, and the Road Ahead0
Hardware-aware mobile building block evaluation for computer vision0
The Disagreement Problem in Faithfulness Metrics0
The DLV System for Knowledge Representation and Reasoning0
Harnessing Large Language Models for Software Vulnerability Detection: A Comprehensive Benchmarking Study0
The Dota 2 Bot Competition0
Benchmarking XAI Explanations with Human-Aligned Evaluations0
A Baseline Method for Removing Invisible Image Watermarks using Deep Image Prior0
Benchmarking with MIMIC-IV, an irregular, spare clinical time series dataset0
HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human Interactions, Real-World Validation, and an Open Leaderboard0
Hawk: An Industrial-strength Multi-label Document Classifier0
Benchmarking Waitlist Mortality Prediction in Heart Transplantation Through Time-to-Event Modeling using New Longitudinal UNOS Dataset0
Benchmarking VLMs' Reasoning About Persuasive Atypical Images0
Haze Visibility Enhancement: A Survey and Quantitative Benchmarking0
Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information0
Heidelberg Colorectal Data Set for Surgical Data Science in the Sensor Operating Room0
HelixDesign-Binder: A Scalable Production-Grade Platform for Binder Design Built on HelixFold30
Helsinki Deblur Challenge 2021: description of photographic data0
HERM: Benchmarking and Enhancing Multimodal LLMs for Human-Centric Understanding0
Agent-oriented Joint Decision Support for Data Owners in Auction-based Federated Learning0
Benchmarking Visual-Inertial Deep Multimodal Fusion for Relative Pose Regression and Odometry-aided Absolute Pose Regression0
Show:102550
← PrevPage 115 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified