SOTAVerified

Benchmarking

Papers

Showing 18211830 of 5548 papers

TitleStatusHype
Improving the Perturbation-Based Explanation of Deepfake Detectors Through the Use of Adversarially-Generated SamplesCode0
MLaKE: Multilingual Knowledge Editing Benchmark for Large Language ModelsCode0
LMEMs for post-hoc analysis of HPO BenchmarkingCode0
InDL: A New Dataset and Benchmark for In-Diagram Logic Interpretation based on Visual IllusionCode0
inMOTIFin: a lightweight end-to-end simulation software for regulatory sequencesCode0
Improved Multilingual Language Model Pretraining for Social Media Text via Translation Pair PredictionCode0
Improved Target-specific Stance Detection on Social Media Platforms by Delving into Conversation ThreadsCode0
Benchmark Generation Framework with Customizable Distortions for Image Classifier RobustnessCode0
Importance of Disjoint Sampling in Conventional and Transformer Models for Hyperspectral Image ClassificationCode0
Improve Machine Learning carbon footprint using Nvidia GPU and Mixed Precision training for classification models -- Part ICode0
Show:102550
← PrevPage 183 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified