SOTAVerified

Benchmarking

Papers

Showing 44264450 of 5548 papers

TitleStatusHype
Certified Adversarial Defenses Meet Out-of-Distribution Corruptions: Benchmarking Robustness and Simple Baselines0
Synthetic weather radar using hybrid quantum-classical machine learning0
An implementation of the "Guess who?" game using CLIPCode0
Dyna-bAbI: unlocking bAbI's potential with dynamic synthetic benchmarking0
HRNET: AI on Edge for mask detection and social distancingCode0
TinyML Platforms Benchmarking0
An in-depth experimental study of sensor usage and visual reasoning of robots navigating in real environments0
OOD-CV: A Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images0
3D Compositional Zero-shot Learning with DeCompositional Consensus0
EffCNet: An Efficient CondenseNet for Image Classification on NXP BlueBox0
Benchmarking Shadow Removal for Facial Landmark Detection and Beyond0
Learning to Transfer for Traffic Forecasting via Multi-task LearningCode0
Using Color To Identify Insider ThreatsCode0
A War Beyond Deepfake: Benchmarking Facial Counterfeits and Countermeasures0
A Modular Framework for Centrality and Clustering in Complex Networks0
RadFusion: Benchmarking Performance and Fairness for Multimodal Pulmonary Embolism Detection from CT and EHR0
Filter Methods for Feature Selection in Supervised Machine Learning Applications -- Review and Benchmark0
Novel Real-Time EMT-TS Modeling Architecture for Feeder Blackstart Simulations0
CLMB: deep contrastive learning for robust metagenomic binningCode0
Benchmarking Quality-Dependent and Cost-Sensitive Score-Level Multimodal Biometric Fusion Algorithms0
FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding0
MSAMSum: Towards Benchmarking Multi-lingual Dialogue Summarization0
Fantastic Questions and Where to Find Them: FairytaleQA--An Authentic Dataset for Narrative Comprehension0
Mukayese: Turkish NLP Strikes Back0
Multiclass Optimal Classification Trees with SVM-splits0
Show:102550
← PrevPage 178 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified