SOTAVerified

Benchmarking

Papers

Showing 25312540 of 5548 papers

TitleStatusHype
From MNIST to ImageNet and Back: Benchmarking Continual Curriculum LearningCode0
MIP-GAF: A MLLM-annotated Benchmark for Most Important Person Localization and Group Context UnderstandingCode0
FRAMES-VQA: Benchmarking Fine-Tuning Robustness across Multi-Modal Shifts in Visual Question AnsweringCode0
FR-MRInet: A Deep Convolutional Encoder-Decoder for Brain Tumor Segmentation with Relu-RGB and Sliding-windowCode0
From Past to Present: A Survey of Malicious URL Detection Techniques, Datasets and Code RepositoriesCode0
Fast Benchmarking of Accuracy vs. Training Time with Cyclic Learning RatesCode0
Arabic Speech Recognition by End-to-End, Modular Systems and HumanCode0
Detecting Stereotypes and Anti-stereotypes the Correct Way Using Social Psychological UnderpinningsCode0
Recognizing Object Affordances to Support Scene Reasoning for Manipulation TasksCode0
Forecasting time series with constraintsCode0
Show:102550
← PrevPage 254 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified