Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4101–4150 of 5548 papers

Title	Date	Tasks	Status	Hype
Benchmarking Safe Deep Reinforcement Learning in Aquatic Navigation	Dec 16, 2021	BenchmarkingDeep Reinforcement Learning	—Unverified	0
High-Dimensional Inference in Bayesian Networks	Dec 16, 2021	BenchmarkingVocal Bursts Intensity Prediction	CodeCode Available	1
Logically at Factify 2022: Multimodal Fact Verification	Dec 16, 2021	BenchmarkingFact Checking	—Unverified	0
A Modular Workflow for Performance Benchmarking of Neuronal Network Simulations	Dec 16, 2021	Benchmarking	CodeCode Available	0
On the Use of Quality Diversity Algorithms for The Traveling Thief Problem	Dec 16, 2021	BenchmarkingDiversity	—Unverified	0
Boosting Neural Image Compression for Machines Using Latent Space Masking	Dec 15, 2021	BenchmarkingImage Compression	CodeCode Available	1
On the Value of ML Models	Dec 13, 2021	Benchmarking	—Unverified	0
GUNNEL: Guided Mixup Augmentation and Multi-View Fusion for Aquatic Animal Segmentation	Dec 12, 2021	BenchmarkingInstance Segmentation	CodeCode Available	0
Benchmarking human visual search computational models in natural scenes: models comparison and reference datasets	Dec 10, 2021	Benchmarking	CodeCode Available	1
Learning Representations with Contrastive Self-Supervised Learning for Histopathology Applications	Dec 10, 2021	BenchmarkingContrastive Learning	CodeCode Available	1
Label, Verify, Correct: A Simple Few Shot Object Detection Method	Dec 10, 2021	BenchmarkingFew-Shot Object Detection	CodeCode Available	1
7th AI Driving Olympics: 1st Place Report for Panoptic Tracking	Dec 9, 2021	BenchmarkingPanoptic Segmentation	—Unverified	0
GreenPCO: An Unsupervised Lightweight Point Cloud Odometry Method	Dec 8, 2021	BenchmarkingObject	—Unverified	0
Object Shape Error Response Using Bayesian 3-D Convolutional Neural Networks for Assembly Systems With Compliant Parts	Dec 8, 2021	3D Shape ModelingBenchmarking	CodeCode Available	1
HyFactor: Hydrogen-count labelled graph-based defactorization Autoencoder	Dec 6, 2021	BenchmarkingGraph Learning	CodeCode Available	1
Neuro-Symbolic Inductive Logic Programming with Logical Neural Networks	Dec 6, 2021	BenchmarkingInductive logic programming	CodeCode Available	1
BenchML: an extensible pipelining framework for benchmarking representations of materials and molecules at scale	Dec 4, 2021	BenchmarkingHyperparameter Optimization	CodeCode Available	1
Reduced, Reused and Recycled: The Life of a Dataset in Machine Learning Research	Dec 3, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified	0
TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation	Dec 2, 2021	BenchmarkingImage Generation	CodeCode Available	1
CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer	Dec 2, 2021	BenchmarkingOrdinal Classification	CodeCode Available	1
NEORL: NeuroEvolution Optimization with Reinforcement Learning	Dec 1, 2021	Benchmarkingglobal-optimization	CodeCode Available	1
Certified Adversarial Defenses Meet Out-of-Distribution Corruptions: Benchmarking Robustness and Simple Baselines	Dec 1, 2021	Adversarial RobustnessBenchmarking	—Unverified	0
MC-Blur: A Comprehensive Benchmark for Image Deblurring	Dec 1, 2021	BenchmarkingDeblurring	CodeCode Available	1
Neural Regression, Representational Similarity, Model Zoology & Neural Taskonomy at Scale in Rodent Visual Cortex	Dec 1, 2021	BenchmarkingObject Recognition	CodeCode Available	1
TinyML Platforms Benchmarking	Nov 30, 2021	Benchmarking	—Unverified	0
An implementation of the "Guess who?" game using CLIP	Nov 30, 2021	Benchmarking	CodeCode Available	0
Synthetic weather radar using hybrid quantum-classical machine learning	Nov 30, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified	0
Dyna-bAbI: unlocking bAbI's potential with dynamic synthetic benchmarking	Nov 30, 2021	BenchmarkingNatural Language Understanding	—Unverified	0
HRNET: AI on Edge for mask detection and social distancing	Nov 30, 2021	BenchmarkingEdge-computing	CodeCode Available	0
3D Compositional Zero-shot Learning with DeCompositional Consensus	Nov 29, 2021	BenchmarkingCompositional Zero-Shot Learning	—Unverified	0
ClimART: A Benchmark Dataset for Emulating Atmospheric Radiative Transfer in Weather and Climate Models	Nov 29, 2021	BenchmarkingPhysical Simulations	CodeCode Available	1
OOD-CV: A Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images	Nov 29, 2021	3D Pose EstimationBenchmarking	—Unverified	0
An in-depth experimental study of sensor usage and visual reasoning of robots navigating in real environments	Nov 29, 2021	BenchmarkingVisual Navigation	—Unverified	0
EffCNet: An Efficient CondenseNet for Image Classification on NXP BlueBox	Nov 28, 2021	BenchmarkingClassification	—Unverified	0
Learning to Transfer for Traffic Forecasting via Multi-task Learning	Nov 27, 2021	BenchmarkingDomain Adaptation	CodeCode Available	0
Benchmarking Shadow Removal for Facial Landmark Detection and Beyond	Nov 27, 2021	BenchmarkingBlocking	—Unverified	0
Benchmarking Accuracy and Generalizability of Four Graph Neural Networks Using Large In Vitro ADME Datasets from Different Chemical Spaces	Nov 27, 2021	BenchmarkingGraph Attention	CodeCode Available	1
A War Beyond Deepfake: Benchmarking Facial Counterfeits and Countermeasures	Nov 25, 2021	BenchmarkingDeepFake Detection	—Unverified	0
Using Color To Identify Insider Threats	Nov 25, 2021	Benchmarking	CodeCode Available	0
Investigating Tradeoffs in Real-World Video Super-Resolution	Nov 24, 2021	BenchmarkingSuper-Resolution	CodeCode Available	2
EH-DNAS: End-to-End Hardware-aware Differentiable Neural Architecture Search	Nov 24, 2021	BenchmarkingNeural Architecture Search	CodeCode Available	1
RadFusion: Benchmarking Performance and Fairness for Multimodal Pulmonary Embolism Detection from CT and EHR	Nov 23, 2021	BenchmarkingComputed Tomography (CT)	—Unverified	0
A Modular Framework for Centrality and Clustering in Complex Networks	Nov 23, 2021	BenchmarkingClustering	—Unverified	0
Filter Methods for Feature Selection in Supervised Machine Learning Applications -- Review and Benchmark	Nov 23, 2021	BenchmarkingBIG-bench Machine Learning	—Unverified	0
Evaluating Adversarial Attacks on ImageNet: A Reality Check on Misclassification Classes	Nov 22, 2021	Benchmarking	CodeCode Available	1
Benchmarking Detection Transfer Learning with Vision Transformers	Nov 22, 2021	Benchmarkingobject-detection	CodeCode Available	1
FedCV: A Federated Learning Framework for Diverse Computer Vision Tasks	Nov 22, 2021	BenchmarkingFederated Learning	CodeCode Available	1
Benchmarking emergency department triage prediction models with machine learning and large public electronic health records	Nov 22, 2021	Benchmarking	CodeCode Available	1
GRecX: An Efficient and Unified Benchmark for GNN-based Recommendation	Nov 19, 2021	BenchmarkingManagement	CodeCode Available	1
Novel Real-Time EMT-TS Modeling Architecture for Feeder Blackstart Simulations	Nov 19, 2021	Benchmarking	—Unverified	0

Show:10 25 50

← PrevPage 83 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified