SOTAVerified

Benchmarking

Papers

Showing 43514375 of 5548 papers

TitleStatusHype
Hierarchical graph neural nets can capture long-range interactionsCode1
A multi-schematic classifier-independent oversampling approach for imbalanced datasetsCode1
The Benchmark Lottery0
Generative and reproducible benchmarks for comprehensive evaluation of machine learning classifiersCode1
Inverse Contextual Bandits: Learning How Behavior Evolves over TimeCode0
R3L: Connecting Deep Reinforcement Learning to Recurrent Neural Networks for Image Denoising via Residual Recovery0
MECT: Multi-Metadata Embedding based Cross-Transformer for Chinese Named Entity RecognitionCode1
A Framework and Benchmarking Study for Counterfactual Generating Methods on Tabular Data0
Benchmarking for Biomedical Natural Language Processing Tasks with a Domain Specific ALBERTCode1
Benchpress: A Scalable and Versatile Workflow for Benchmarking Structure Learning AlgorithmsCode1
Intrinsic uncertainties and where to find them0
The RSNA-ASNR-MICCAI BraTS 2021 Benchmark on Brain Tumor Segmentation and Radiogenomic ClassificationCode1
Connectivity Matters: Neural Network Pruning Through the Lens of Effective SparsityCode0
Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement LearningCode1
SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement Learning Agents0
Benchmarking ASR Systems Based on Post-Editing Effort and Error Analysis0
Modelling Neuronal Behaviour with Time Series Regression: Recurrent Neural Networks on C. Elegans Data0
CityNet: A Comprehensive Multi-Modal Urban Dataset for Advanced Research in Urban ComputingCode0
Exploring Context Generalizability in Citywide Crowd Mobility Prediction: An Analytic Framework and BenchmarkCode0
On the Interaction of Belief Bias and Explanations0
Benchmarking Knowledge-driven Zero-shot LearningCode1
Efficient Realistic Data Generation Framework leveraging Deep Learning-based Human DigitizationCode0
Dataset and Benchmarking of Real-Time Embedded Object Detection for RoboCup SSL0
Kimera-Multi: Robust, Distributed, Dense Metric-Semantic SLAM for Multi-Robot SystemsCode1
Rail-5k: a Real-World Dataset for Rail Surface Defects Detection0
Show:102550
← PrevPage 175 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified