SOTAVerified

Benchmarking

Papers

Showing 43014350 of 5548 papers

TitleStatusHype
EVOPS Benchmark: Evaluation of Plane Segmentation from RGBD and LiDAR Data0
Benchmarking Active Learning Strategies for Materials Optimization and Discovery0
From Modern CNNs to Vision Transformers: Assessing the Performance, Robustness, and Classification Strategies of Deep Learning Models in HistopathologyCode0
Metaethical Perspectives on 'Benchmarking' AI Ethics0
Benchmarking for Public Health Surveillance tasks on Social Media with a Domain-Specific Pretrained Language Model0
Disability prediction in multiple sclerosis using performance outcome measures and demographic data0
tmVar 3.0: an improved variant concept recognition and normalization tool0
CLEAVE: Scalable and Edge-native Benchmarking of Networked Control SystemsCode0
A lightweight and accurate YOLO-like network for small target detection in Aerial Imagery0
A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality0
Efficient, Uncertainty-based Moderation of Neural Networks Text ClassifiersCode0
pmuBAGE: The Benchmarking Assortment of Generated PMU Data for Power System Events -- Part I: Overview and ResultsCode0
Intelligence at the Extreme Edge: A Survey on Reformable TinyML0
Unitail: Detecting, Reading, and Matching in Retail Scene0
Assessing the risk of re-identification arising from an attack on anonymised data0
Is Word Error Rate a good evaluation metric for Speech Recognition in Indic Languages?0
To Find Waldo You Need Contextual Cues: Debiasing Who's WaldoCode0
Treatment Learning Causal Transformer for Noisy Image Classification0
A Unified Study of Machine Learning Explanation Evaluation Metrics0
Benchmarking Deep AUROC Optimization: Loss Functions and Algorithmic Choices0
Benchmarking Algorithms for Automatic License Plate Recognition0
LAMBDA: Covering the Solution Set of Black-Box Inequality by Search Space Quantization0
Comprehensive Benchmark Datasets for Amharic Scene Text Detection and Recognition0
An Optical Control Environment for Benchmarking Reinforcement Learning AlgorithmsCode0
A Perspective on Neural Capacity Estimation: Viability and Reliability0
Benchmarking Test-Time Unsupervised Deep Neural Network Adaptation on Edge Devices0
Policy Gradients using Variational Quantum Circuits0
Grasp Pre-shape Selection by Synthetic Training: Eye-in-hand Shared Control on the Hannes ProsthesisCode0
A Statistical Framework to Investigate the Optimality of Signal-Reconstruction Methods0
On the Usefulness of the Fit-on-the-Test View on Evaluating Calibration of ClassifiersCode0
Fiber Bundle Morphisms as a Framework for Modeling Many-to-Many Maps0
From 2D to 3D: Re-thinking Benchmarking of Monocular Depth Prediction0
ALDI++: Automatic and parameter-less discord and outlier detection for building energy load profilesCode0
DFTR: Depth-supervised Fusion Transformer for Salient Object Detection0
A Closer Look at Debiased Temporal Sentence Grounding in Videos: Dataset, Metric, and Approach0
IndicNLG Benchmark: Multilingual Datasets for Diverse NLG Tasks in Indic Languages0
Metastatic Cancer Outcome Prediction with Injective Multiple Instance Pooling0
Mapping global dynamics of benchmark creation and saturation in artificial intelligence0
Score-Based Generative Models for Molecule Generation0
Systematic Comparison of Path Planning Algorithms using PathBench0
Multi-channel deep convolutional neural networks for multi-classifying thyroid disease0
Automated Machine Learning: A Case Study on Non-Intrusive Appliance Load Monitoring0
Benchmarking real-time algorithms for in-phase auditory stimulation of low amplitude slow waves with wearable EEG devices during sleep0
Graph clustering with Boltzmann machines0
Towards Benchmarking and Evaluating Deepfake Detection0
Benchmarking Instance-Centric Counterfactual Algorithms for XAI: From White Box to Black BoxCode0
KamNet: An Integrated Spatiotemporal Deep Neural Network for Rare Event Search in KamLAND-ZenCode0
Reliable validation of Reinforcement Learning Benchmarks0
Benchmarking Robustness of Deep Learning Classifiers Using Two-Factor Perturbation0
Adaptive Gradient Methods with Local Guarantees0
Show:102550
← PrevPage 87 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified