SOTAVerified

Benchmarking

Papers

Showing 18611870 of 5548 papers

TitleStatusHype
Improving the Perturbation-Based Explanation of Deepfake Detectors Through the Use of Adversarially-Generated SamplesCode0
Multimodal Multi-User Surface Recognition with the Kernel Two-Sample TestCode0
InDL: A New Dataset and Benchmark for In-Diagram Logic Interpretation based on Visual IllusionCode0
inMOTIFin: a lightweight end-to-end simulation software for regulatory sequencesCode0
Benchmarking Robustness of Deep Learning Classifiers Using Two-Factor PerturbationCode0
MineRL: A Large-Scale Dataset of Minecraft DemonstrationsCode0
OpenDMC: An Open-Source Library and Performance Evaluation for Deep-learning-based Multi-frame CompressionCode0
Advancing and Benchmarking Personalized Tool Invocation for LLMsCode0
ImpliRet: Benchmarking the Implicit Fact Retrieval ChallengeCode0
Impact of ImageNet Model Selection on Domain AdaptationCode0
Show:102550
← PrevPage 187 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified