SOTAVerified

Benchmarking

Papers

Showing 28762900 of 5548 papers

TitleStatusHype
Designing labeled graph classifiers by exploiting the Rényi entropy of the dissimilarity representation0
Design of intelligent proofreading system for English translation based on CNN and BERT0
Design of Supervision-Scalable Learning Systems: Methodology and Performance Benchmarking0
Design Target Achievement Index: A Differentiable Metric to Enhance Deep Generative Models in Multi-Objective Inverse Design0
Detecting Finger-Vein Presentation Attacks Using 3D Shape & Diffuse Reflectance Decomposition0
Detecting Out-Of-Distribution Samples Using Low-Order Deep Features Statistics0
Detection and Evaluation of Clusters within Sequential Data0
Detection of Adversarial Attacks and Characterization of Adversarial Subspace0
Determinants of Performance in European ATM -- How to Analyze a Diverse Industry0
DetoxBench: Benchmarking Large Language Models for Multitask Fraud & Abuse Detection0
detrex: Benchmarking Detection Transformers0
Development details and computational benchmarking of DEPAM0
Device Modeling Bias in ReRAM-based Neural Network Simulations0
DexGraspNet 2.0: Learning Generative Dexterous Grasping in Large-scale Synthetic Cluttered Scenes0
DFTR: Depth-supervised Fusion Transformer for Salient Object Detection0
DHP Benchmark: Are LLMs Good NLG Evaluators?0
Diagnosing and Addressing Pitfalls in KG-RAG Datasets: Toward More Reliable Benchmarking0
Dialogue Games for Benchmarking Language Understanding: Motivation, Taxonomy, Strategy0
DI-BENCH: Benchmarking Large Language Models on Dependency Inference with Testable Repositories at Scale0
DIF: A Framework for Benchmarking and Verifying Implicit Bias in LLMs0
Diff5T: Benchmarking Human Brain Diffusion MRI with an Extensive 5.0 Tesla K-Space and Spatial Dataset0
DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior0
Different Horses for Different Courses: Comparing Bias Mitigation Algorithms in ML0
Differential Privacy for Adaptive Weight Aggregation in Federated Tumor Segmentation0
Diffusion-Driven Domain Adaptation for Generating 3D Molecules0
Show:102550
← PrevPage 116 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified