SOTAVerified

Benchmarking

Papers

Showing 43514400 of 5548 papers

TitleStatusHype
Hierarchical graph neural nets can capture long-range interactionsCode1
A multi-schematic classifier-independent oversampling approach for imbalanced datasetsCode1
The Benchmark Lottery0
Generative and reproducible benchmarks for comprehensive evaluation of machine learning classifiersCode1
Inverse Contextual Bandits: Learning How Behavior Evolves over TimeCode0
R3L: Connecting Deep Reinforcement Learning to Recurrent Neural Networks for Image Denoising via Residual Recovery0
MECT: Multi-Metadata Embedding based Cross-Transformer for Chinese Named Entity RecognitionCode1
A Framework and Benchmarking Study for Counterfactual Generating Methods on Tabular Data0
Benchmarking for Biomedical Natural Language Processing Tasks with a Domain Specific ALBERTCode1
Benchpress: A Scalable and Versatile Workflow for Benchmarking Structure Learning AlgorithmsCode1
Intrinsic uncertainties and where to find them0
The RSNA-ASNR-MICCAI BraTS 2021 Benchmark on Brain Tumor Segmentation and Radiogenomic ClassificationCode1
Connectivity Matters: Neural Network Pruning Through the Lens of Effective SparsityCode0
Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement LearningCode1
SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement Learning Agents0
Benchmarking ASR Systems Based on Post-Editing Effort and Error Analysis0
Modelling Neuronal Behaviour with Time Series Regression: Recurrent Neural Networks on C. Elegans Data0
CityNet: A Comprehensive Multi-Modal Urban Dataset for Advanced Research in Urban ComputingCode0
Exploring Context Generalizability in Citywide Crowd Mobility Prediction: An Analytic Framework and BenchmarkCode0
On the Interaction of Belief Bias and Explanations0
Benchmarking Knowledge-driven Zero-shot LearningCode1
Efficient Realistic Data Generation Framework leveraging Deep Learning-based Human DigitizationCode0
Dataset and Benchmarking of Real-Time Embedded Object Detection for RoboCup SSL0
Kimera-Multi: Robust, Distributed, Dense Metric-Semantic SLAM for Multi-Robot SystemsCode1
Rail-5k: a Real-World Dataset for Rail Surface Defects Detection0
Mitigating severe over-parameterization in deep convolutional neural networks through forced feature abstraction and compression with an entropy-based heuristic0
Benchmarking Differential Privacy and Federated Learning for BERT ModelsCode1
You are AllSet: A Multiset Function Framework for Hypergraph Neural NetworksCode1
PatentNet: A Large-Scale Incomplete Multiview, Multimodal, Multilabel Industrial Goods Image Database0
Mutual-Information Based Few-Shot ClassificationCode1
Synthetic Benchmarks for Scientific Research in Explainable Machine LearningCode1
CUDA-GHR: Controllable Unsupervised Domain Adaptation for Gaze and Head RedirectionCode0
Underwater Image Restoration via Contrastive Learning and a Real-world DatasetCode1
Perception Matters: Detecting Perception Failures of VQA Models Using Metamorphic TestingCode1
Learning Graphs for Knowledge Transfer With Limited Labels0
Intrinsic Image HarmonizationCode1
Effective Evaluation of Deep Active Learning on Image Classification Tasks0
A Spiking Neural Network for Image Segmentation0
Understanding and Evaluating Racial Biases in Image CaptioningCode1
A Survey on Semi-Supervised Learning for Delayed Partially Labelled Data Streams0
Hotel Recognition via Latent Image Embedding0
Selection of Source Images Heavily Influences the Effectiveness of Adversarial AttacksCode1
Node Classification Meets Link Prediction on Knowledge Graphs0
On the Convergence of Differentially Private Federated Learning on Non-Lipschitz Objectives, and with Normalized Client Updates0
Online Learning with Optimism and DelayCode1
Cross-replication Reliability -- An Empirical Approach to Interpreting Inter-rater Reliability0
Interpretable machine learning applied to on-farm biosecurity and porcine reproductive and respiratory syndrome virus0
Problem-solving benefits of down-sampled lexicase selection0
Shades of BLEU, Flavours of Success: The Case of MultiWOZCode1
Signals to Spikes for Neuromorphic Regulated Reservoir Computing and EMG Hand Gesture RecognitionCode1
Show:102550
← PrevPage 88 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified