Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4176–4200 of 5548 papers

Title	Date	Tasks	Status
PASTA: A Dataset for Modeling Participant States in Narratives	Jul 31, 2022	BenchmarkingCommon Sense Reasoning	—Unverified
Benchmarking Azerbaijani Neural Machine Translation	Jul 29, 2022	BenchmarkingDomain Generalization	—Unverified
Content-Aware Differential Privacy with Conditional Invertible Neural Networks	Jul 29, 2022	Benchmarking	CodeCode Available
Towards Large-Scale Small Object Detection: Survey and Benchmarks	Jul 28, 2022	BenchmarkingObject	—Unverified
Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks	Jul 27, 2022	Adversarial RobustnessBenchmarking	—Unverified
3DOS: Towards 3D Open Set Learning -- Benchmarking and Understanding Semantic Novelty Detection on Point Clouds	Jul 23, 2022	BenchmarkingNovelty Detection	CodeCode Available
Rethinking the Reference-based Distinctive Image Captioning	Jul 22, 2022	AttributeBenchmarking	CodeCode Available
PieTrack: An MOT solution based on synthetic data training and self-supervised domain adaptation	Jul 22, 2022	BenchmarkingDomain Adaptation	—Unverified
Benchmarking tools for a priori identifiability analysis	Jul 20, 2022	Benchmarking	CodeCode Available
Operation-Level Performance Benchmarking of Graph Neural Networks for Scientific Applications	Jul 20, 2022	Benchmarking	CodeCode Available
Benchmarking Transformers-based models on French Spoken Language Understanding tasks	Jul 19, 2022	BenchmarkingSpoken Language Understanding	—Unverified
The Multiple Subnetwork Hypothesis: Enabling Multidomain Learning by Isolating Task-Specific Subnetworks in Feedforward Neural Networks	Jul 18, 2022	Benchmarking	CodeCode Available
Benchmarking Machine Learning Robustness in Covid-19 Genome Sequence Classification	Jul 18, 2022	BenchmarkingBIG-bench Machine Learning	CodeCode Available
GOAL: Towards Benchmarking Few-Shot Sports Game Summarization	Jul 18, 2022	Benchmarking	CodeCode Available
Bias Mitigation for Machine Learning Classifiers: A Comprehensive Survey	Jul 14, 2022	BenchmarkingBIG-bench Machine Learning	—Unverified
Immunofluorescence Capillary Imaging Segmentation: Cases Study	Jul 14, 2022	BenchmarkingImage Segmentation	CodeCode Available
Automated Detection of Label Errors in Semantic Segmentation Datasets via Deep Learning and Uncertainty Quantification	Jul 13, 2022	BenchmarkingLabel Error Detection	CodeCode Available
Slot Filling for Extracting Reskilling and Upskilling Options from the Web	Jul 11, 2022	BenchmarkingEntity Linking	CodeCode Available
A novel evaluation methodology for supervised Feature Ranking algorithms	Jul 9, 2022	BenchmarkingFeature Importance	CodeCode Available
Ensemble random forest filter: An alternative to the ensemble Kalman filter for inverse modeling	Jul 8, 2022	Benchmarking	—Unverified
OVQA: A Clinically Generated Visual Question Answering Dataset	Jul 7, 2022	BenchmarkingMedical Visual Question Answering	—Unverified
Benefits and Challenges of Dynamic Modelling of Cascading Failures in Power Systems	Jul 7, 2022	Benchmarking	—Unverified
Identifying the Context Shift between Test Benchmarks and Production Data	Jul 3, 2022	BenchmarkingBIG-bench Machine Learning	—Unverified
Towards Toxic Positivity Detection	Jul 1, 2022	BenchmarkingClassification	—Unverified
DACSA: A large-scale Dataset for Automatic summarization of Catalan and Spanish newspaper Articles	Jul 1, 2022	Abstractive Text SummarizationArticles	—Unverified

Show:10 25 50

← PrevPage 168 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified