Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4351–4400 of 5548 papers

Title	Date	Tasks	Status	Hype
Hierarchical graph neural nets can capture long-range interactions	Jul 15, 2021	BenchmarkingMolecular Property Prediction	CodeCode Available	1
A multi-schematic classifier-independent oversampling approach for imbalanced datasets	Jul 15, 2021	Benchmarking	CodeCode Available	1
The Benchmark Lottery	Jul 14, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified	0
Generative and reproducible benchmarks for comprehensive evaluation of machine learning classifiers	Jul 14, 2021	BenchmarkingBIG-bench Machine Learning	CodeCode Available	1
Inverse Contextual Bandits: Learning How Behavior Evolves over Time	Jul 13, 2021	BenchmarkingDecision Making	CodeCode Available	0
R3L: Connecting Deep Reinforcement Learning to Recurrent Neural Networks for Image Denoising via Residual Recovery	Jul 12, 2021	BenchmarkingDeep Reinforcement Learning	—Unverified	0
MECT: Multi-Metadata Embedding based Cross-Transformer for Chinese Named Entity Recognition	Jul 12, 2021	BenchmarkingChinese Named Entity Recognition	CodeCode Available	1
A Framework and Benchmarking Study for Counterfactual Generating Methods on Tabular Data	Jul 9, 2021	Benchmarkingcounterfactual	—Unverified	0
Benchmarking for Biomedical Natural Language Processing Tasks with a Domain Specific ALBERT	Jul 9, 2021	BenchmarkingDocument Classification	CodeCode Available	1
Benchpress: A Scalable and Versatile Workflow for Benchmarking Structure Learning Algorithms	Jul 8, 2021	Benchmarking	CodeCode Available	1
Intrinsic uncertainties and where to find them	Jul 6, 2021	Benchmarking	—Unverified	0
The RSNA-ASNR-MICCAI BraTS 2021 Benchmark on Brain Tumor Segmentation and Radiogenomic Classification	Jul 5, 2021	BenchmarkingBrain Tumor Segmentation	CodeCode Available	1
Connectivity Matters: Neural Network Pruning Through the Lens of Effective Sparsity	Jul 5, 2021	BenchmarkingNetwork Pruning	CodeCode Available	0
Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning	Jul 2, 2021	BenchmarkingCausal Discovery	CodeCode Available	1
SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement Learning Agents	Jul 2, 2021	BenchmarkingDeep Reinforcement Learning	—Unverified	0
Benchmarking ASR Systems Based on Post-Editing Effort and Error Analysis	Jul 1, 2021	Benchmarking	—Unverified	0
Modelling Neuronal Behaviour with Time Series Regression: Recurrent Neural Networks on C. Elegans Data	Jul 1, 2021	Benchmarkingregression	—Unverified	0
CityNet: A Comprehensive Multi-Modal Urban Dataset for Advanced Research in Urban Computing	Jun 30, 2021	BenchmarkingTransfer Learning	CodeCode Available	0
Exploring Context Generalizability in Citywide Crowd Mobility Prediction: An Analytic Framework and Benchmark	Jun 30, 2021	BenchmarkingPrediction	CodeCode Available	0
On the Interaction of Belief Bias and Explanations	Jun 29, 2021	Benchmarking	—Unverified	0
Benchmarking Knowledge-driven Zero-shot Learning	Jun 29, 2021	AttributeBenchmarking	CodeCode Available	1
Efficient Realistic Data Generation Framework leveraging Deep Learning-based Human Digitization	Jun 28, 2021	BenchmarkingDeep Learning	CodeCode Available	0
Dataset and Benchmarking of Real-Time Embedded Object Detection for RoboCup SSL	Jun 28, 2021	BenchmarkingObject	—Unverified	0
Kimera-Multi: Robust, Distributed, Dense Metric-Semantic SLAM for Multi-Robot Systems	Jun 28, 2021	3D ReconstructionBenchmarking	CodeCode Available	1
Rail-5k: a Real-World Dataset for Rail Surface Defects Detection	Jun 28, 2021	4kBenchmarking	—Unverified	0
Mitigating severe over-parameterization in deep convolutional neural networks through forced feature abstraction and compression with an entropy-based heuristic	Jun 27, 2021	BenchmarkingFeature Compression	—Unverified	0
Benchmarking Differential Privacy and Federated Learning for BERT Models	Jun 26, 2021	BenchmarkingFederated Learning	CodeCode Available	1
You are AllSet: A Multiset Function Framework for Hypergraph Neural Networks	Jun 24, 2021	BenchmarkingNode Classification	CodeCode Available	1
PatentNet: A Large-Scale Incomplete Multiview, Multimodal, Multilabel Industrial Goods Image Database	Jun 23, 2021	BenchmarkingClustering	—Unverified	0
Mutual-Information Based Few-Shot Classification	Jun 23, 2021	BenchmarkingClassification	CodeCode Available	1
Synthetic Benchmarks for Scientific Research in Explainable Machine Learning	Jun 23, 2021	BenchmarkingBIG-bench Machine Learning	CodeCode Available	1
CUDA-GHR: Controllable Unsupervised Domain Adaptation for Gaze and Head Redirection	Jun 21, 2021	BenchmarkingDomain Adaptation	CodeCode Available	0
Underwater Image Restoration via Contrastive Learning and a Real-world Dataset	Jun 20, 2021	BenchmarkingContrastive Learning	CodeCode Available	1
Perception Matters: Detecting Perception Failures of VQA Models Using Metamorphic Testing	Jun 19, 2021	BenchmarkingDNN Testing	CodeCode Available	1
Learning Graphs for Knowledge Transfer With Limited Labels	Jun 19, 2021	Action RecognitionBenchmarking	—Unverified	0
Intrinsic Image Harmonization	Jun 19, 2021	BenchmarkingImage Harmonization	CodeCode Available	1
Effective Evaluation of Deep Active Learning on Image Classification Tasks	Jun 16, 2021	Active LearningBenchmarking	—Unverified	0
A Spiking Neural Network for Image Segmentation	Jun 16, 2021	BenchmarkingCPU	—Unverified	0
Understanding and Evaluating Racial Biases in Image Captioning	Jun 16, 2021	BenchmarkingImage Captioning	CodeCode Available	1
A Survey on Semi-Supervised Learning for Delayed Partially Labelled Data Streams	Jun 16, 2021	Active LearningBenchmarking	—Unverified	0
Hotel Recognition via Latent Image Embedding	Jun 15, 2021	BenchmarkingMetric Learning	—Unverified	0
Selection of Source Images Heavily Influences the Effectiveness of Adversarial Attacks	Jun 14, 2021	Benchmarking	CodeCode Available	1
Node Classification Meets Link Prediction on Knowledge Graphs	Jun 14, 2021	BenchmarkingClassification	—Unverified	0
On the Convergence of Differentially Private Federated Learning on Non-Lipschitz Objectives, and with Normalized Client Updates	Jun 13, 2021	BenchmarkingFederated Learning	—Unverified	0
Online Learning with Optimism and Delay	Jun 13, 2021	BenchmarkingWeather Forecasting	CodeCode Available	1
Cross-replication Reliability -- An Empirical Approach to Interpreting Inter-rater Reliability	Jun 11, 2021	Benchmarking	—Unverified	0
Interpretable machine learning applied to on-farm biosecurity and porcine reproductive and respiratory syndrome virus	Jun 11, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified	0
Problem-solving benefits of down-sampled lexicase selection	Jun 10, 2021	Benchmarking	—Unverified	0
Shades of BLEU, Flavours of Success: The Case of MultiWOZ	Jun 10, 2021	BenchmarkingTask-Oriented Dialogue Systems	CodeCode Available	1
Signals to Spikes for Neuromorphic Regulated Reservoir Computing and EMG Hand Gesture Recognition	Jun 9, 2021	BenchmarkingEMG Gesture Recognition	CodeCode Available	1

Show:10 25 50

← PrevPage 88 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified