Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4301–4350 of 5548 papers

Title	Date	Tasks	Status	Hype
Searching for an Effective Defender: Benchmarking Defense against Adversarial Word Substitution	Aug 29, 2021	Benchmarking	CodeCode Available	1
Pulling Up by the Causal Bootstraps: Causal Data Augmentation for Pre-training Debiasing	Aug 27, 2021	BenchmarkingData Augmentation	CodeCode Available	1
Benchmarking high-fidelity pedestrian tracking systems for research, real-time monitoring and crowd control	Aug 26, 2021	BenchmarkingDensity Estimation	—Unverified	0
Technological Approaches to Detecting Online Disinformation and Manipulation	Aug 26, 2021	BenchmarkingFact Checking	—Unverified	0
A Unified Taxonomy and Multimodal Dataset for Events in Invasion Games	Aug 25, 2021	BenchmarkingVideo Classification	CodeCode Available	1
A Benchmark for Spray from Nearby Cutting Vehicles	Aug 24, 2021	Autonomous DrivingBenchmarking	—Unverified	0
Evolving Evolutionary Algorithms using Linear Genetic Programming	Aug 21, 2021	BenchmarkingEvolutionary Algorithms	—Unverified	0
DeepEdgeBench: Benchmarking Deep Neural Networks on Edge Devices	Aug 21, 2021	BenchmarkingEdge-computing	—Unverified	0
AutoLay: Benchmarking amodal layout estimation for autonomous driving	Aug 20, 2021	Amodal Layout EstimationAutonomous Driving	—Unverified	0
Generative Wind Power Curve Modeling Via Machine Vision: A Self-learning Deep Convolutional Network Based Method	Aug 19, 2021	BenchmarkingSynthetic Data Generation	CodeCode Available	1
Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks	Aug 19, 2021	BenchmarkingClassification	—Unverified	0
Discriminating modelling approaches for Point in Time Economic Scenario Generation	Aug 19, 2021	Benchmarking	—Unverified	0
SSH: A Self-Supervised Framework for Image Harmonization	Aug 15, 2021	BenchmarkingData Augmentation	CodeCode Available	1
SIAM: Chiplet-based Scalable In-Memory Acceleration with Mesh for Deep Neural Networks	Aug 14, 2021	Benchmarking	—Unverified	0
A Dataset for Answering Time-Sensitive Questions	Aug 13, 2021	Benchmarking	CodeCode Available	1
A Systematic Benchmarking Analysis of Transfer Learning for Medical Image Analysis	Aug 12, 2021	BenchmarkingMedical Image Analysis	CodeCode Available	1
Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-based Hate	Aug 12, 2021	Benchmarking	CodeCode Available	1
Distributional Depth-Based Estimation of Object Articulation Models	Aug 12, 2021	BenchmarkingObject	CodeCode Available	0
BenchENAS: A Benchmarking Platform for Evolutionary Neural Architecture Search	Aug 9, 2021	BenchmarkingGPU	CodeCode Available	0
A Look at the Evaluation Setup of the M5 Forecasting Competition	Aug 8, 2021	BenchmarkingDecision Making	—Unverified	0
Secure Neuroimaging Analysis using Federated Learning with Homomorphic Encryption	Aug 7, 2021	BenchmarkingFederated Learning	—Unverified	0
Intelligent Railway Foreign Object Detection: A Semi-supervised Convolutional Autoencoder Based Method	Aug 5, 2021	BenchmarkingDecoder	—Unverified	0
Webly Supervised Fine-Grained Recognition: Benchmark Datasets and An Approach	Aug 5, 2021	Benchmarking	CodeCode Available	1
Deep Reinforcement Learning for Continuous Docking Control of Autonomous Underwater Vehicles: A Benchmarking Study	Aug 5, 2021	BenchmarkingDeep Reinforcement Learning	—Unverified	0
Terabyte-scale supervised 3D training and benchmarking dataset of the mouse kidney	Aug 4, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified	0
Comparative Analysis of Packages and Algorithms for the Analysis of Spatially Resolved Transcriptomics Data	Aug 3, 2021	Benchmarking	—Unverified	0
Quantum machine learning of large datasets using randomized measurements	Aug 2, 2021	BenchmarkingBIG-bench Machine Learning	CodeCode Available	1
CARLA: A Python Library to Benchmark Algorithmic Recourse and Counterfactual Explanation Algorithms	Aug 2, 2021	Benchmarkingcounterfactual	CodeCode Available	1
Multilingual Protest News Detection - Shared Task 1, CASE 2021	Aug 1, 2021	BenchmarkingDecision Making	—Unverified	0
Benchmarking: Past, Present and Future	Aug 1, 2021	BenchmarkingReading Comprehension	CodeCode Available	1
Benchmarking Neural Topic Models: An Empirical Study	Aug 1, 2021	BenchmarkingTopic Models	—Unverified	0
The Effect of Domain and Diacritics in Yoruba–English Neural Machine Translation	Aug 1, 2021	BenchmarkingMachine Translation	—Unverified	0
Multimodal or Text? Retrieval or BERT? Benchmarking Classifiers for the Shared Task on Hateful Memes	Aug 1, 2021	BenchmarkingBinary Classification	—Unverified	0
What Motivates You? Benchmarking Automatic Detection of Basic Needs from Short Posts	Aug 1, 2021	BenchmarkingBinary Classification	—Unverified	0
Benchmarking Scalable Methods for Streaming Cross Document Entity Coreference	Aug 1, 2021	BenchmarkingClustering	CodeCode Available	0
Cross-replication Reliability - An Empirical Approach to Interpreting Inter-rater Reliability	Aug 1, 2021	Benchmarking	—Unverified	0
Improving Model Generalization: A Chinese Named Entity Recognition Case Study	Aug 1, 2021	BenchmarkingChinese Named Entity Recognition	—Unverified	0
SignalGP-Lite: Event Driven Genetic Programming Library for Large-Scale Artificial Life Applications	Aug 1, 2021	Artificial LifeBenchmarking	CodeCode Available	0
Contemporary Symbolic Regression Methods and their Relative Performance	Jul 29, 2021	Benchmarkingparameter estimation	CodeCode Available	1
Reradiation and Scattering from a Reconfigurable Intelligent Surface: A General Macroscopic Model	Jul 27, 2021	Benchmarking	—Unverified	0
AA3DNet: Attention Augmented Real Time 3D Object Detection	Jul 26, 2021	3D Object DetectionAutonomous Vehicles	—Unverified	0
Benchmarking AutoML Frameworks for Disease Prediction Using Medical Claims	Jul 22, 2021	AutoMLBenchmarking	—Unverified	0
3D fluorescence microscopy data synthesis for segmentation and benchmarking	Jul 21, 2021	Benchmarking	CodeCode Available	0
An Exploration of Exploration: Measuring the ability of lexicase selection to find obscure pathways to optimality	Jul 20, 2021	BenchmarkingDiagnostic	CodeCode Available	0
Learned Sorted Table Search and Static Indexes in Small Model Space	Jul 19, 2021	BenchmarkingOpen-Ended Question Answering	CodeCode Available	0
PhD Thesis on Code Modulated Interferometric Imaging System using Phased Arrays	Jul 19, 2021	Benchmarking	—Unverified	0
Attribution of Predictive Uncertainties in Classification Models	Jul 19, 2021	BenchmarkingClassification	CodeCode Available	0
Better force fields start with better data -- A data set of cation dipeptide interactions	Jul 19, 2021	Benchmarking	CodeCode Available	0
ECG-Adv-GAN: Detecting ECG Adversarial Examples with Conditional Generative Adversarial Networks	Jul 16, 2021	BenchmarkingGenerative Adversarial Network	—Unverified	0
Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi	Jul 15, 2021	BenchmarkingDeep Reinforcement Learning	—Unverified	0

Show:10 25 50

← PrevPage 87 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified