Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3951–3975 of 5548 papers

Title	Date	Tasks	Status
Structure-Based Experimental Datasets for Benchmarking Protein Simulation Force Fields	Mar 2, 2023	Benchmarking	—Unverified
Learning to Adapt to Online Streams with Distribution Shifts	Mar 2, 2023	BenchmarkingMeta-Learning	—Unverified
Benchmarking Self-Supervised Contrastive Learning Methods for Image-Based Plant Phenotyping	Mar 1, 2023	BenchmarkingContrastive Learning	CodeCode Available
A Comprehensive Study on Robustness of Image Classification Models: Benchmarking and Rethinking	Feb 28, 2023	Adversarial RobustnessBenchmarking	—Unverified
Benchmarking Deepart Detection	Feb 28, 2023	BenchmarkingDeepFake Detection	—Unverified
Predicting the Performance of a Computing System with Deep Networks	Feb 27, 2023	Benchmarking	—Unverified
Benchmarking of Cancelable Biometrics for Deep Templates	Feb 26, 2023	BenchmarkingBinarization	—Unverified
STA: Self-controlled Text Augmentation for Improving Text Classifications	Feb 24, 2023	BenchmarkingText Augmentation	CodeCode Available
Dermatological Diagnosis Explainability Benchmark for Convolutional Neural Networks	Feb 23, 2023	BenchmarkingMedical Diagnosis	CodeCode Available
Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift with Multiple Views	Feb 23, 2023	Benchmarking	—Unverified
MultiRobustBench: Benchmarking Robustness Against Multiple Attacks	Feb 21, 2023	Benchmarking	—Unverified
Time to Embrace Natural Language Processing (NLP)-based Digital Pathology: Benchmarking NLP- and Convolutional Neural Network-based Deep Learning Pipelines	Feb 21, 2023	Benchmarkingwhole slide images	—Unverified
An Efficient Two-stage Gradient Boosting Framework for Short-term Traffic State Estimation	Feb 21, 2023	BenchmarkingState Estimation	CodeCode Available
Determinants of Performance in European ATM -- How to Analyze a Diverse Industry	Feb 20, 2023	BenchmarkingManagement	—Unverified
Arena-Rosnav 2.0: A Development and Benchmarking Platform for Robot Navigation in Highly Dynamic Environments	Feb 20, 2023	BenchmarkingRobot Navigation	CodeCode Available
Fuzzy Knowledge Distillation from High-Order TSK to Low-Order TSK	Feb 16, 2023	BenchmarkingKnowledge Distillation	—Unverified
Towards Fair Machine Learning Software: Understanding and Addressing Model Bias Through Counterfactual Thinking	Feb 16, 2023	Benchmarkingcounterfactual	—Unverified
Benchmarking Continuous Time Models for Predicting Multiple Sclerosis Progression	Feb 15, 2023	Benchmarking	—Unverified
Efficiency in European Air Traffic Management -- A Fundamental Analysis of Data, Models, and Methods	Feb 15, 2023	BenchmarkingDecision Making	—Unverified
Model-Based Underwater 6D Pose Estimation from RGB	Feb 14, 2023	2D Object Detection6D Pose Estimation	—Unverified
A Neuromorphic Dataset for Object Segmentation in Indoor Cluttered Environment	Feb 13, 2023	BenchmarkingSegmentation	CodeCode Available
Deep Imputation of Missing Values in Time Series Health Data: A Review with Benchmarking	Feb 10, 2023	BenchmarkingDeep Learning	—Unverified
AI Sound Recognition on Asthma Medication Adherence: Evaluation With the RDA Benchmark Suite	Feb 8, 2023	BenchmarkingManagement	CodeCode Available
CrossCodeBench: Benchmarking Cross-Task Generalization of Source Code Models	Feb 8, 2023	BenchmarkingFew-Shot Learning	—Unverified
Participatory Personalization in Classification	Feb 8, 2023	BenchmarkingClassification	—Unverified

Show:10 25 50

← PrevPage 159 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified