Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4301–4350 of 5548 papers

Title	Date	Tasks	Status
EVOPS Benchmark: Evaluation of Plane Segmentation from RGBD and LiDAR Data	Apr 12, 2022	BenchmarkingSegmentation	—Unverified
Benchmarking Active Learning Strategies for Materials Optimization and Discovery	Apr 12, 2022	Active LearningBenchmarking	—Unverified
From Modern CNNs to Vision Transformers: Assessing the Performance, Robustness, and Classification Strategies of Deep Learning Models in Histopathology	Apr 11, 2022	BenchmarkingCancer Classification	CodeCode Available
Metaethical Perspectives on 'Benchmarking' AI Ethics	Apr 11, 2022	BenchmarkingEthics	—Unverified
Benchmarking for Public Health Surveillance tasks on Social Media with a Domain-Specific Pretrained Language Model	Apr 9, 2022	BenchmarkingLanguage Modeling	—Unverified
Disability prediction in multiple sclerosis using performance outcome measures and demographic data	Apr 8, 2022	BenchmarkingBIG-bench Machine Learning	—Unverified
tmVar 3.0: an improved variant concept recognition and normalization tool	Apr 7, 2022	Benchmarking	—Unverified
CLEAVE: Scalable and Edge-native Benchmarking of Networked Control Systems	Apr 5, 2022	BenchmarkingEdge-computing	CodeCode Available
A lightweight and accurate YOLO-like network for small target detection in Aerial Imagery	Apr 5, 2022	Benchmarkingobject-detection	—Unverified
A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality	Apr 5, 2022	BenchmarkingSelf-Supervised Learning	—Unverified
Efficient, Uncertainty-based Moderation of Neural Networks Text Classifiers	Apr 4, 2022	Benchmarking	CodeCode Available
pmuBAGE: The Benchmarking Assortment of Generated PMU Data for Power System Events -- Part I: Overview and Results	Apr 3, 2022	Benchmarking	CodeCode Available
Intelligence at the Extreme Edge: A Survey on Reformable TinyML	Apr 2, 2022	BenchmarkingBIG-bench Machine Learning	—Unverified
Unitail: Detecting, Reading, and Matching in Retail Scene	Apr 1, 2022	BenchmarkingDense Object Detection	—Unverified
Assessing the risk of re-identification arising from an attack on anonymised data	Mar 31, 2022	Benchmarking	—Unverified
Is Word Error Rate a good evaluation metric for Speech Recognition in Indic Languages?	Mar 30, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
To Find Waldo You Need Contextual Cues: Debiasing Who's Waldo	Mar 30, 2022	BenchmarkingPerson-centric Visual Grounding	CodeCode Available
Treatment Learning Causal Transformer for Noisy Image Classification	Mar 29, 2022	BenchmarkingClassification	—Unverified
A Unified Study of Machine Learning Explanation Evaluation Metrics	Mar 27, 2022	BenchmarkingBIG-bench Machine Learning	—Unverified
Benchmarking Deep AUROC Optimization: Loss Functions and Algorithmic Choices	Mar 27, 2022	Benchmarkingimbalanced classification	—Unverified
Benchmarking Algorithms for Automatic License Plate Recognition	Mar 27, 2022	BenchmarkingLicense Plate Recognition	—Unverified
LAMBDA: Covering the Solution Set of Black-Box Inequality by Search Space Quantization	Mar 25, 2022	BenchmarkingQuantization	—Unverified
Comprehensive Benchmark Datasets for Amharic Scene Text Detection and Recognition	Mar 23, 2022	BenchmarkingScene Text Detection	—Unverified
An Optical Control Environment for Benchmarking Reinforcement Learning Algorithms	Mar 23, 2022	BenchmarkingDeep Reinforcement Learning	CodeCode Available
A Perspective on Neural Capacity Estimation: Viability and Reliability	Mar 22, 2022	BenchmarkingCapacity Estimation	—Unverified
Benchmarking Test-Time Unsupervised Deep Neural Network Adaptation on Edge Devices	Mar 21, 2022	BenchmarkingGPU	—Unverified
Policy Gradients using Variational Quantum Circuits	Mar 20, 2022	BenchmarkingQuantum Machine Learning	—Unverified
Grasp Pre-shape Selection by Synthetic Training: Eye-in-hand Shared Control on the Hannes Prosthesis	Mar 18, 2022	BenchmarkingObject Recognition	CodeCode Available
A Statistical Framework to Investigate the Optimality of Signal-Reconstruction Methods	Mar 18, 2022	Benchmarking	—Unverified
On the Usefulness of the Fit-on-the-Test View on Evaluating Calibration of Classifiers	Mar 16, 2022	Benchmarking	CodeCode Available
Fiber Bundle Morphisms as a Framework for Modeling Many-to-Many Maps	Mar 15, 2022	BenchmarkingSentiment Analysis	—Unverified
From 2D to 3D: Re-thinking Benchmarking of Monocular Depth Prediction	Mar 15, 2022	3D geometryBenchmarking	—Unverified
ALDI++: Automatic and parameter-less discord and outlier detection for building energy load profiles	Mar 13, 2022	BenchmarkingBIG-bench Machine Learning	CodeCode Available
DFTR: Depth-supervised Fusion Transformer for Salient Object Detection	Mar 12, 2022	BenchmarkingObject	—Unverified
A Closer Look at Debiased Temporal Sentence Grounding in Videos: Dataset, Metric, and Approach	Mar 10, 2022	BenchmarkingSentence	—Unverified
IndicNLG Benchmark: Multilingual Datasets for Diverse NLG Tasks in Indic Languages	Mar 10, 2022	ArticlesBenchmarking	—Unverified
Metastatic Cancer Outcome Prediction with Injective Multiple Instance Pooling	Mar 9, 2022	BenchmarkingManagement	—Unverified
Mapping global dynamics of benchmark creation and saturation in artificial intelligence	Mar 9, 2022	Benchmarking	—Unverified
Score-Based Generative Models for Molecule Generation	Mar 7, 2022	Benchmarking	—Unverified
Systematic Comparison of Path Planning Algorithms using PathBench	Mar 7, 2022	Benchmarking	—Unverified
Multi-channel deep convolutional neural networks for multi-classifying thyroid disease	Mar 6, 2022	BenchmarkingBinary Classification	—Unverified
Automated Machine Learning: A Case Study on Non-Intrusive Appliance Load Monitoring	Mar 6, 2022	AutoMLBayesian Optimization	—Unverified
Benchmarking real-time algorithms for in-phase auditory stimulation of low amplitude slow waves with wearable EEG devices during sleep	Mar 4, 2022	BenchmarkingComputational Efficiency	—Unverified
Graph clustering with Boltzmann machines	Mar 4, 2022	BenchmarkingClustering	—Unverified
Towards Benchmarking and Evaluating Deepfake Detection	Mar 4, 2022	BenchmarkingDeepFake Detection	—Unverified
Benchmarking Instance-Centric Counterfactual Algorithms for XAI: From White Box to Black Box	Mar 4, 2022	Benchmarkingcounterfactual	CodeCode Available
KamNet: An Integrated Spatiotemporal Deep Neural Network for Rare Event Search in KamLAND-Zen	Mar 3, 2022	Benchmarking	CodeCode Available
Reliable validation of Reinforcement Learning Benchmarks	Mar 2, 2022	BenchmarkingData Compression	—Unverified
Benchmarking Robustness of Deep Learning Classifiers Using Two-Factor Perturbation	Mar 2, 2022	BenchmarkingDeep Learning	—Unverified
Adaptive Gradient Methods with Local Guarantees	Mar 2, 2022	Benchmarking	—Unverified

Show:10 25 50

← PrevPage 87 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified