Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4151–4200 of 5548 papers

Title	Date	Tasks	Status
TEP-GNN: Accurate Execution Time Prediction of Functional Tests using Graph Neural Networks	Aug 25, 2022	BenchmarkingGraph Neural Network	—Unverified
Towards Benchmarking Explainable Artificial Intelligence Methods	Aug 25, 2022	BenchmarkingExplainable artificial intelligence	—Unverified
Bugs in the Data: How ImageNet Misrepresents Biodiversity	Aug 24, 2022	BenchmarkingObject Detection	CodeCode Available
StEduCov: An Explored and Benchmarked Dataset on Stance Detection in Tweets towards Online Education during COVID-19 Pandemic	Aug 22, 2022	BenchmarkingStance Detection	—Unverified
MechProNet: Machine Learning Prediction of Mechanical Properties in Metal Additive Manufacturing	Aug 21, 2022	ArticlesBenchmarking	—Unverified
SIM2E: Benchmarking the Group Equivariant Capability of Correspondence Matching Algorithms	Aug 21, 2022	Benchmarking	—Unverified
A biologically-inspired multi-modal evaluation of molecular generative machine learning	Aug 20, 2022	BenchmarkingDrug Discovery	—Unverified
Wildfire Forecasting with Satellite Images and Deep Generative Model	Aug 19, 2022	BenchmarkingVideo Prediction	—Unverified
The Low Emission Oil&Gas Open (LEOGO) Reference Platform of an Off-Grid Energy System for Renewable Integration Studies	Aug 16, 2022	BenchmarkingManagement	—Unverified
Unsupervised machine learning approach for building composite indicators with fuzzy metrics	Aug 15, 2022	Benchmarking	—Unverified
Sensitivity analysis and experimental evaluation of PID-like continuous sliding mode control	Aug 13, 2022	BenchmarkingSensitivity	—Unverified
Benchmarking Joint Face Spoofing and Forgery Detection with Visual and Physiological Cues	Aug 10, 2022	BenchmarkingDeepFake Detection	—Unverified
Exact lattice-based stochastic cell culture simulation algorithms incorporating spontaneous and contact-dependent reactions	Aug 9, 2022	BenchmarkingCultural Vocal Bursts Intensity Prediction	—Unverified
fMRI-S4: learning short- and long-range dynamic fMRI dependencies using 1D Convolutions and State Space Models	Aug 8, 2022	BenchmarkingState Space Models	CodeCode Available
QSAM-Net: Rain streak removal by quaternion neural network with self-attention module	Aug 8, 2022	Benchmarkingobject-detection	—Unverified
SOMPT22: A Surveillance Oriented Multi-Pedestrian Tracking Dataset	Aug 4, 2022	BenchmarkingMulti-Object Tracking	—Unverified
AstroVision: Towards Autonomous Feature Detection and Description for Missions to Small Bodies Using Deep Learning	Aug 3, 2022	Benchmarking	CodeCode Available
Benchmarking zero-shot and few-shot approaches for tokenization, tagging, and dependency parsing of Tagalog text	Aug 3, 2022	BenchmarkingData Augmentation	—Unverified
Binary Classification with Positive Labeling Sources	Aug 2, 2022	BenchmarkingBinary Classification	—Unverified
ferret: a Framework for Benchmarking Explainers on Transformers	Aug 2, 2022	BenchmarkingExplainable Artificial Intelligence (XAI)	CodeCode Available
On the role of benchmarking data sets and simulations in method comparison studies	Aug 2, 2022	Benchmarking	—Unverified
Benchmarking Visual-Inertial Deep Multimodal Fusion for Relative Pose Regression and Odometry-aided Absolute Pose Regression	Aug 1, 2022	Benchmarkingregression	—Unverified
A Case for Dataset Specific Profiling	Aug 1, 2022	BenchmarkingModel Selection	—Unverified
On the Evaluation of User Privacy in Deep Neural Networks using Timing Side Channel	Aug 1, 2022	Benchmarkingimage-classification	—Unverified
Vector-Based Data Improves Left-Right Eye-Tracking Classifier Performance After a Covariate Distributional Shift	Jul 31, 2022	BenchmarkingEEG	CodeCode Available
PASTA: A Dataset for Modeling Participant States in Narratives	Jul 31, 2022	BenchmarkingCommon Sense Reasoning	—Unverified
Benchmarking Azerbaijani Neural Machine Translation	Jul 29, 2022	BenchmarkingDomain Generalization	—Unverified
Content-Aware Differential Privacy with Conditional Invertible Neural Networks	Jul 29, 2022	Benchmarking	CodeCode Available
Towards Large-Scale Small Object Detection: Survey and Benchmarks	Jul 28, 2022	BenchmarkingObject	—Unverified
Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks	Jul 27, 2022	Adversarial RobustnessBenchmarking	—Unverified
3DOS: Towards 3D Open Set Learning -- Benchmarking and Understanding Semantic Novelty Detection on Point Clouds	Jul 23, 2022	BenchmarkingNovelty Detection	CodeCode Available
Rethinking the Reference-based Distinctive Image Captioning	Jul 22, 2022	AttributeBenchmarking	CodeCode Available
PieTrack: An MOT solution based on synthetic data training and self-supervised domain adaptation	Jul 22, 2022	BenchmarkingDomain Adaptation	—Unverified
Benchmarking tools for a priori identifiability analysis	Jul 20, 2022	Benchmarking	CodeCode Available
Operation-Level Performance Benchmarking of Graph Neural Networks for Scientific Applications	Jul 20, 2022	Benchmarking	CodeCode Available
Benchmarking Transformers-based models on French Spoken Language Understanding tasks	Jul 19, 2022	BenchmarkingSpoken Language Understanding	—Unverified
The Multiple Subnetwork Hypothesis: Enabling Multidomain Learning by Isolating Task-Specific Subnetworks in Feedforward Neural Networks	Jul 18, 2022	Benchmarking	CodeCode Available
Benchmarking Machine Learning Robustness in Covid-19 Genome Sequence Classification	Jul 18, 2022	BenchmarkingBIG-bench Machine Learning	CodeCode Available
GOAL: Towards Benchmarking Few-Shot Sports Game Summarization	Jul 18, 2022	Benchmarking	CodeCode Available
Bias Mitigation for Machine Learning Classifiers: A Comprehensive Survey	Jul 14, 2022	BenchmarkingBIG-bench Machine Learning	—Unverified
Immunofluorescence Capillary Imaging Segmentation: Cases Study	Jul 14, 2022	BenchmarkingImage Segmentation	CodeCode Available
Automated Detection of Label Errors in Semantic Segmentation Datasets via Deep Learning and Uncertainty Quantification	Jul 13, 2022	BenchmarkingLabel Error Detection	CodeCode Available
Slot Filling for Extracting Reskilling and Upskilling Options from the Web	Jul 11, 2022	BenchmarkingEntity Linking	CodeCode Available
A novel evaluation methodology for supervised Feature Ranking algorithms	Jul 9, 2022	BenchmarkingFeature Importance	CodeCode Available
Ensemble random forest filter: An alternative to the ensemble Kalman filter for inverse modeling	Jul 8, 2022	Benchmarking	—Unverified
OVQA: A Clinically Generated Visual Question Answering Dataset	Jul 7, 2022	BenchmarkingMedical Visual Question Answering	—Unverified
Benefits and Challenges of Dynamic Modelling of Cascading Failures in Power Systems	Jul 7, 2022	Benchmarking	—Unverified
Identifying the Context Shift between Test Benchmarks and Production Data	Jul 3, 2022	BenchmarkingBIG-bench Machine Learning	—Unverified
Towards Toxic Positivity Detection	Jul 1, 2022	BenchmarkingClassification	—Unverified
DACSA: A large-scale Dataset for Automatic summarization of Catalan and Spanish newspaper Articles	Jul 1, 2022	Abstractive Text SummarizationArticles	—Unverified

Show:10 25 50

← PrevPage 84 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified