SOTAVerified

Benchmarking

Papers

Showing 18511875 of 5548 papers

TitleStatusHype
An Empirical Study of Automated Mislabel Detection in Real World Vision Datasets0
Chart-to-Experience: Benchmarking Multimodal LLMs for Predicting Experiential Impact of Charts0
Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination0
Dynamic-KGQA: A Scalable Framework for Generating Adaptive Question Answering Datasets0
DynamicVL: Benchmarking Multimodal Large Language Models for Dynamic City Understanding0
ECG-Adv-GAN: Detecting ECG Adversarial Examples with Conditional Generative Adversarial Networks0
CHaRNet: Conditioned Heatmap Regression for Robust Dental Landmark Localization0
Characterizing Transactional Databases for Frequent Itemset Mining0
Benchmarking and Validation of Sub-mW 30GHz VG-LNAs in 22nm FDSOI CMOS for 5G/6G Phased-Array Receivers0
Benchmarking Deep Learning Architectures for Urban Vegetation Point Cloud Semantic Segmentation from MLS0
Context-guided Triple Matching for Multiple Choice Question Answering0
Context-guided Triple Matching for Multiple Choice Question Answering0
Characterizing the adversarial vulnerability of speech self-supervised learning0
Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking0
Exploring the Practicality of Generative Retrieval on Dynamic Corpora0
Continuous Function Structured in Multilayer Perceptron for Global Optimization0
Benchmarking Deep Learning Frameworks for Automated Diagnosis of Ocular Toxoplasmosis: A Comprehensive Approach to Classification and Segmentation0
Continuous-Time Gaussian Process Motion-Compensation for Event-vision Pattern Tracking with Distance Fields0
Characterizing Missing Information in Deep Networks Using Backpropagated Gradients0
Contrastive Learning-Based Spectral Knowledge Distillation for Multi-Modality and Missing Modality Scenarios in Semantic Segmentation0
An Empirical Investigation into Benchmarking Model Multiplicity for Trustworthy Machine Learning: A Case Study on Image Classification0
Characterization of Multiple 3D LiDARs for Localization and Mapping using Normal Distributions Transform0
Characterization of Constrained Continuous Multiobjective Optimization Problems: A Performance Space Perspective0
Benchmarking Deep Learning Models for Object Detection on Edge Computing Devices0
Dynabench: Rethinking Benchmarking in NLP0
Show:102550
← PrevPage 75 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified