SOTAVerified

Benchmarking

Papers

Showing 26012625 of 5548 papers

TitleStatusHype
Fine-grained Hand Gesture Recognition in Multi-viewpoint Hand HygieneCode0
FR-MRInet: A Deep Convolutional Encoder-Decoder for Brain Tumor Segmentation with Relu-RGB and Sliding-windowCode0
From Bytes to Borsch: Fine-Tuning Gemma and Mistral for the Ukrainian Language RepresentationCode0
Aesthetic Image Captioning From Weakly-Labelled PhotographsCode0
Defense-friendly Images in Adversarial Attacks: Dataset and Metrics for Perturbation DifficultyCode0
DefAn: Definitive Answer Dataset for LLMs Hallucination EvaluationCode0
FRAMES-VQA: Benchmarking Fine-Tuning Robustness across Multi-Modal Shifts in Visual Question AnsweringCode0
From Modern CNNs to Vision Transformers: Assessing the Performance, Robustness, and Classification Strategies of Deep Learning Models in HistopathologyCode0
Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word ProblemCode0
Benchmarking Graph Representations and Graph Neural Networks for Multivariate Time Series ClassificationCode0
A projected nonlinear state-space model for forecasting time series signalsCode0
First-frame Supervised Video Polyp Segmentation via Propagative and Semantic Dual-teacher NetworkCode0
FORLORN: A Framework for Comparing Offline Methods and Reinforcement Learning for Optimization of RAN ParametersCode0
Deep Reinforcement Learning for General Video Game AICode0
Forecasting Future International Events: A Reliable Dataset for Text-Based Event ModelingCode0
Forecasting time series with constraintsCode0
FlexMol: A Flexible Toolkit for Benchmarking Molecular Relational LearningCode0
Benchmarking Robust Self-Supervised Learning Across Diverse Downstream TasksCode0
2017 Robotic Instrument Segmentation ChallengeCode0
fMRI-S4: learning short- and long-range dynamic fMRI dependencies using 1D Convolutions and State Space ModelsCode0
DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing UnderstandingCode0
A predictive analytics approach for stroke prediction using machine learning and neural networksCode0
Fluorescence Reference Target Quantitative Analysis LibraryCode0
DeepOBS: A Deep Learning Optimizer Benchmark SuiteCode0
Deep Neural Network Benchmarks for Selective ClassificationCode0
Show:102550
← PrevPage 105 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified