SOTAVerified

Benchmarking

Papers

Showing 41264150 of 5548 papers

TitleStatusHype
An implementation of the "Guess who?" game using CLIPCode0
Synthetic weather radar using hybrid quantum-classical machine learning0
Dyna-bAbI: unlocking bAbI's potential with dynamic synthetic benchmarking0
HRNET: AI on Edge for mask detection and social distancingCode0
3D Compositional Zero-shot Learning with DeCompositional Consensus0
ClimART: A Benchmark Dataset for Emulating Atmospheric Radiative Transfer in Weather and Climate ModelsCode1
OOD-CV: A Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images0
An in-depth experimental study of sensor usage and visual reasoning of robots navigating in real environments0
EffCNet: An Efficient CondenseNet for Image Classification on NXP BlueBox0
Learning to Transfer for Traffic Forecasting via Multi-task LearningCode0
Benchmarking Shadow Removal for Facial Landmark Detection and Beyond0
Benchmarking Accuracy and Generalizability of Four Graph Neural Networks Using Large In Vitro ADME Datasets from Different Chemical SpacesCode1
A War Beyond Deepfake: Benchmarking Facial Counterfeits and Countermeasures0
Using Color To Identify Insider ThreatsCode0
Investigating Tradeoffs in Real-World Video Super-ResolutionCode2
EH-DNAS: End-to-End Hardware-aware Differentiable Neural Architecture SearchCode1
RadFusion: Benchmarking Performance and Fairness for Multimodal Pulmonary Embolism Detection from CT and EHR0
A Modular Framework for Centrality and Clustering in Complex Networks0
Filter Methods for Feature Selection in Supervised Machine Learning Applications -- Review and Benchmark0
Evaluating Adversarial Attacks on ImageNet: A Reality Check on Misclassification ClassesCode1
Benchmarking Detection Transfer Learning with Vision TransformersCode1
FedCV: A Federated Learning Framework for Diverse Computer Vision TasksCode1
Benchmarking emergency department triage prediction models with machine learning and large public electronic health recordsCode1
GRecX: An Efficient and Unified Benchmark for GNN-based RecommendationCode1
Novel Real-Time EMT-TS Modeling Architecture for Feeder Blackstart Simulations0
Show:102550
← PrevPage 166 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified