SOTAVerified

Benchmarking

Papers

Showing 20012025 of 5548 papers

TitleStatusHype
BeSt-LeS: Benchmarking Stroke Lesion Segmentation using Deep SupervisionCode0
Balancing policy constraint and ensemble size in uncertainty-based offline reinforcement learningCode0
ImpliRet: Benchmarking the Implicit Fact Retrieval ChallengeCode0
Action-conditioned Benchmarking of Robotic Video Prediction Models: a Comparative StudyCode0
Immunofluorescence Capillary Imaging Segmentation: Cases StudyCode0
Impact of ImageNet Model Selection on Domain AdaptationCode0
Importance of Disjoint Sampling in Conventional and Transformer Models for Hyperspectral Image ClassificationCode0
A Meta-Analysis of the Anomaly Detection ProblemCode0
Benchmarks for Graph Embedding EvaluationCode0
BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis DatasetCode0
Back to Basics: Benchmarking Canonical Evolution Strategies for Playing AtariCode0
Benchmark of Deep Learning Models on Large Healthcare MIMIC DatasetsCode0
AlphaZip: Neural Network-Enhanced Lossless Text CompressionCode0
Benchmarking Zero-Shot Robustness of Multimodal Foundation Models: A Pilot StudyCode0
Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual IllusionsCode0
A Wild Bootstrap for Degenerate Kernel TestsCode0
Benchmarking YOLOv5 and YOLOv7 models with DeepSORT for droplet tracking applicationsCode0
IJCB 2022 Mobile Behavioral Biometrics Competition (MobileB2C)Code0
Illuminating the Diversity-Fitness Trade-Off in Black-Box OptimizationCode0
ImmersePro: End-to-End Stereo Video Synthesis Via Implicit Disparity LearningCode0
IndiBias: A Benchmark Dataset to Measure Social Biases in Language Models for Indian ContextCode0
Aux-Drop: Handling Haphazard Inputs in Online Learning Using Auxiliary DropoutsCode0
Benchmarking White Blood Cell Classification Under Domain ShiftCode0
Identifying the Smallest Adversarial Load Perturbations that Render DC-OPF InfeasibleCode0
Identifying and Benchmarking Natural Out-of-Context Prediction ProblemsCode0
Show:102550
← PrevPage 81 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified