Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4551–4600 of 5548 papers

Title	Date	Tasks	Status
Deep Reinforcement Learning for Continuous Docking Control of Autonomous Underwater Vehicles: A Benchmarking Study	Aug 5, 2021	BenchmarkingDeep Reinforcement Learning	—Unverified
Intelligent Railway Foreign Object Detection: A Semi-supervised Convolutional Autoencoder Based Method	Aug 5, 2021	BenchmarkingDecoder	—Unverified
Terabyte-scale supervised 3D training and benchmarking dataset of the mouse kidney	Aug 4, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified
Comparative Analysis of Packages and Algorithms for the Analysis of Spatially Resolved Transcriptomics Data	Aug 3, 2021	Benchmarking	—Unverified
The Effect of Domain and Diacritics in Yoruba–English Neural Machine Translation	Aug 1, 2021	BenchmarkingMachine Translation	—Unverified
Improving Model Generalization: A Chinese Named Entity Recognition Case Study	Aug 1, 2021	BenchmarkingChinese Named Entity Recognition	—Unverified
What Motivates You? Benchmarking Automatic Detection of Basic Needs from Short Posts	Aug 1, 2021	BenchmarkingBinary Classification	—Unverified
Benchmarking Scalable Methods for Streaming Cross Document Entity Coreference	Aug 1, 2021	BenchmarkingClustering	CodeCode Available
Multimodal or Text? Retrieval or BERT? Benchmarking Classifiers for the Shared Task on Hateful Memes	Aug 1, 2021	BenchmarkingBinary Classification	—Unverified
Multilingual Protest News Detection - Shared Task 1, CASE 2021	Aug 1, 2021	BenchmarkingDecision Making	—Unverified
Benchmarking Neural Topic Models: An Empirical Study	Aug 1, 2021	BenchmarkingTopic Models	—Unverified
Cross-replication Reliability - An Empirical Approach to Interpreting Inter-rater Reliability	Aug 1, 2021	Benchmarking	—Unverified
SignalGP-Lite: Event Driven Genetic Programming Library for Large-Scale Artificial Life Applications	Aug 1, 2021	Artificial LifeBenchmarking	CodeCode Available
Reradiation and Scattering from a Reconfigurable Intelligent Surface: A General Macroscopic Model	Jul 27, 2021	Benchmarking	—Unverified
AA3DNet: Attention Augmented Real Time 3D Object Detection	Jul 26, 2021	3D Object DetectionAutonomous Vehicles	—Unverified
Benchmarking AutoML Frameworks for Disease Prediction Using Medical Claims	Jul 22, 2021	AutoMLBenchmarking	—Unverified
3D fluorescence microscopy data synthesis for segmentation and benchmarking	Jul 21, 2021	Benchmarking	CodeCode Available
An Exploration of Exploration: Measuring the ability of lexicase selection to find obscure pathways to optimality	Jul 20, 2021	BenchmarkingDiagnostic	CodeCode Available
PhD Thesis on Code Modulated Interferometric Imaging System using Phased Arrays	Jul 19, 2021	Benchmarking	—Unverified
Attribution of Predictive Uncertainties in Classification Models	Jul 19, 2021	BenchmarkingClassification	CodeCode Available
Learned Sorted Table Search and Static Indexes in Small Model Space	Jul 19, 2021	BenchmarkingOpen-Ended Question Answering	CodeCode Available
Better force fields start with better data -- A data set of cation dipeptide interactions	Jul 19, 2021	Benchmarking	CodeCode Available
ECG-Adv-GAN: Detecting ECG Adversarial Examples with Conditional Generative Adversarial Networks	Jul 16, 2021	BenchmarkingGenerative Adversarial Network	—Unverified
Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi	Jul 15, 2021	BenchmarkingDeep Reinforcement Learning	—Unverified
The Benchmark Lottery	Jul 14, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified
Inverse Contextual Bandits: Learning How Behavior Evolves over Time	Jul 13, 2021	BenchmarkingDecision Making	CodeCode Available
R3L: Connecting Deep Reinforcement Learning to Recurrent Neural Networks for Image Denoising via Residual Recovery	Jul 12, 2021	BenchmarkingDeep Reinforcement Learning	—Unverified
A Framework and Benchmarking Study for Counterfactual Generating Methods on Tabular Data	Jul 9, 2021	Benchmarkingcounterfactual	—Unverified
Intrinsic uncertainties and where to find them	Jul 6, 2021	Benchmarking	—Unverified
Connectivity Matters: Neural Network Pruning Through the Lens of Effective Sparsity	Jul 5, 2021	BenchmarkingNetwork Pruning	CodeCode Available
SocialAI: Benchmarking Socio-Cognitive Abilities in Deep Reinforcement Learning Agents	Jul 2, 2021	BenchmarkingDeep Reinforcement Learning	—Unverified
Modelling Neuronal Behaviour with Time Series Regression: Recurrent Neural Networks on C. Elegans Data	Jul 1, 2021	Benchmarkingregression	—Unverified
Benchmarking ASR Systems Based on Post-Editing Effort and Error Analysis	Jul 1, 2021	Benchmarking	—Unverified
CityNet: A Comprehensive Multi-Modal Urban Dataset for Advanced Research in Urban Computing	Jun 30, 2021	BenchmarkingTransfer Learning	CodeCode Available
Exploring Context Generalizability in Citywide Crowd Mobility Prediction: An Analytic Framework and Benchmark	Jun 30, 2021	BenchmarkingPrediction	CodeCode Available
On the Interaction of Belief Bias and Explanations	Jun 29, 2021	Benchmarking	—Unverified
Dataset and Benchmarking of Real-Time Embedded Object Detection for RoboCup SSL	Jun 28, 2021	BenchmarkingObject	—Unverified
Efficient Realistic Data Generation Framework leveraging Deep Learning-based Human Digitization	Jun 28, 2021	BenchmarkingDeep Learning	CodeCode Available
Rail-5k: a Real-World Dataset for Rail Surface Defects Detection	Jun 28, 2021	4kBenchmarking	—Unverified
Mitigating severe over-parameterization in deep convolutional neural networks through forced feature abstraction and compression with an entropy-based heuristic	Jun 27, 2021	BenchmarkingFeature Compression	—Unverified
PatentNet: A Large-Scale Incomplete Multiview, Multimodal, Multilabel Industrial Goods Image Database	Jun 23, 2021	BenchmarkingClustering	—Unverified
CUDA-GHR: Controllable Unsupervised Domain Adaptation for Gaze and Head Redirection	Jun 21, 2021	BenchmarkingDomain Adaptation	CodeCode Available
Learning Graphs for Knowledge Transfer With Limited Labels	Jun 19, 2021	Action RecognitionBenchmarking	—Unverified
A Survey on Semi-Supervised Learning for Delayed Partially Labelled Data Streams	Jun 16, 2021	Active LearningBenchmarking	—Unverified
A Spiking Neural Network for Image Segmentation	Jun 16, 2021	BenchmarkingCPU	—Unverified
Effective Evaluation of Deep Active Learning on Image Classification Tasks	Jun 16, 2021	Active LearningBenchmarking	—Unverified
Hotel Recognition via Latent Image Embedding	Jun 15, 2021	BenchmarkingMetric Learning	—Unverified
Node Classification Meets Link Prediction on Knowledge Graphs	Jun 14, 2021	BenchmarkingClassification	—Unverified
On the Convergence of Differentially Private Federated Learning on Non-Lipschitz Objectives, and with Normalized Client Updates	Jun 13, 2021	BenchmarkingFederated Learning	—Unverified
Cross-replication Reliability -- An Empirical Approach to Interpreting Inter-rater Reliability	Jun 11, 2021	Benchmarking	—Unverified

Show:10 25 50

← PrevPage 92 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified