Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4426–4450 of 5548 papers

Title	Date	Tasks	Status
Certified Adversarial Defenses Meet Out-of-Distribution Corruptions: Benchmarking Robustness and Simple Baselines	Dec 1, 2021	Adversarial RobustnessBenchmarking	—Unverified
Synthetic weather radar using hybrid quantum-classical machine learning	Nov 30, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified
An implementation of the "Guess who?" game using CLIP	Nov 30, 2021	Benchmarking	CodeCode Available
Dyna-bAbI: unlocking bAbI's potential with dynamic synthetic benchmarking	Nov 30, 2021	BenchmarkingNatural Language Understanding	—Unverified
HRNET: AI on Edge for mask detection and social distancing	Nov 30, 2021	BenchmarkingEdge-computing	CodeCode Available
TinyML Platforms Benchmarking	Nov 30, 2021	Benchmarking	—Unverified
An in-depth experimental study of sensor usage and visual reasoning of robots navigating in real environments	Nov 29, 2021	BenchmarkingVisual Navigation	—Unverified
OOD-CV: A Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images	Nov 29, 2021	3D Pose EstimationBenchmarking	—Unverified
3D Compositional Zero-shot Learning with DeCompositional Consensus	Nov 29, 2021	BenchmarkingCompositional Zero-Shot Learning	—Unverified
EffCNet: An Efficient CondenseNet for Image Classification on NXP BlueBox	Nov 28, 2021	BenchmarkingClassification	—Unverified
Benchmarking Shadow Removal for Facial Landmark Detection and Beyond	Nov 27, 2021	BenchmarkingBlocking	—Unverified
Learning to Transfer for Traffic Forecasting via Multi-task Learning	Nov 27, 2021	BenchmarkingDomain Adaptation	CodeCode Available
Using Color To Identify Insider Threats	Nov 25, 2021	Benchmarking	CodeCode Available
A War Beyond Deepfake: Benchmarking Facial Counterfeits and Countermeasures	Nov 25, 2021	BenchmarkingDeepFake Detection	—Unverified
A Modular Framework for Centrality and Clustering in Complex Networks	Nov 23, 2021	BenchmarkingClustering	—Unverified
RadFusion: Benchmarking Performance and Fairness for Multimodal Pulmonary Embolism Detection from CT and EHR	Nov 23, 2021	BenchmarkingComputed Tomography (CT)	—Unverified
Filter Methods for Feature Selection in Supervised Machine Learning Applications -- Review and Benchmark	Nov 23, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified
Novel Real-Time EMT-TS Modeling Architecture for Feeder Blackstart Simulations	Nov 19, 2021	Benchmarking	—Unverified
CLMB: deep contrastive learning for robust metagenomic binning	Nov 18, 2021	BenchmarkingContrastive Learning	CodeCode Available
Benchmarking Quality-Dependent and Cost-Sensitive Score-Level Multimodal Biometric Fusion Algorithms	Nov 17, 2021	Benchmarking	—Unverified
FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding	Nov 16, 2021	BenchmarkingNatural Language Understanding	—Unverified
MSAMSum: Towards Benchmarking Multi-lingual Dialogue Summarization	Nov 16, 2021	Benchmarkingdialogue summary	—Unverified
Fantastic Questions and Where to Find Them: FairytaleQA--An Authentic Dataset for Narrative Comprehension	Nov 16, 2021	BenchmarkingQuestion Answering	—Unverified
Mukayese: Turkish NLP Strikes Back	Nov 16, 2021	BenchmarkingLanguage Modeling	—Unverified
Multiclass Optimal Classification Trees with SVM-splits	Nov 16, 2021	BenchmarkingClassification	—Unverified

Show:10 25 50

← PrevPage 178 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified