Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3951–4000 of 5548 papers

Title	Date	Tasks	Status
Structure-Based Experimental Datasets for Benchmarking Protein Simulation Force Fields	Mar 2, 2023	Benchmarking	—Unverified
Learning to Adapt to Online Streams with Distribution Shifts	Mar 2, 2023	BenchmarkingMeta-Learning	—Unverified
Benchmarking Self-Supervised Contrastive Learning Methods for Image-Based Plant Phenotyping	Mar 1, 2023	BenchmarkingContrastive Learning	CodeCode Available
A Comprehensive Study on Robustness of Image Classification Models: Benchmarking and Rethinking	Feb 28, 2023	Adversarial RobustnessBenchmarking	—Unverified
Benchmarking Deepart Detection	Feb 28, 2023	BenchmarkingDeepFake Detection	—Unverified
Predicting the Performance of a Computing System with Deep Networks	Feb 27, 2023	Benchmarking	—Unverified
Benchmarking of Cancelable Biometrics for Deep Templates	Feb 26, 2023	BenchmarkingBinarization	—Unverified
STA: Self-controlled Text Augmentation for Improving Text Classifications	Feb 24, 2023	BenchmarkingText Augmentation	CodeCode Available
Dermatological Diagnosis Explainability Benchmark for Convolutional Neural Networks	Feb 23, 2023	BenchmarkingMedical Diagnosis	CodeCode Available
Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift with Multiple Views	Feb 23, 2023	Benchmarking	—Unverified
MultiRobustBench: Benchmarking Robustness Against Multiple Attacks	Feb 21, 2023	Benchmarking	—Unverified
Time to Embrace Natural Language Processing (NLP)-based Digital Pathology: Benchmarking NLP- and Convolutional Neural Network-based Deep Learning Pipelines	Feb 21, 2023	Benchmarkingwhole slide images	—Unverified
An Efficient Two-stage Gradient Boosting Framework for Short-term Traffic State Estimation	Feb 21, 2023	BenchmarkingState Estimation	CodeCode Available
Determinants of Performance in European ATM -- How to Analyze a Diverse Industry	Feb 20, 2023	BenchmarkingManagement	—Unverified
Arena-Rosnav 2.0: A Development and Benchmarking Platform for Robot Navigation in Highly Dynamic Environments	Feb 20, 2023	BenchmarkingRobot Navigation	CodeCode Available
Fuzzy Knowledge Distillation from High-Order TSK to Low-Order TSK	Feb 16, 2023	BenchmarkingKnowledge Distillation	—Unverified
Towards Fair Machine Learning Software: Understanding and Addressing Model Bias Through Counterfactual Thinking	Feb 16, 2023	Benchmarkingcounterfactual	—Unverified
Benchmarking Continuous Time Models for Predicting Multiple Sclerosis Progression	Feb 15, 2023	Benchmarking	—Unverified
Efficiency in European Air Traffic Management -- A Fundamental Analysis of Data, Models, and Methods	Feb 15, 2023	BenchmarkingDecision Making	—Unverified
Model-Based Underwater 6D Pose Estimation from RGB	Feb 14, 2023	2D Object Detection6D Pose Estimation	—Unverified
A Neuromorphic Dataset for Object Segmentation in Indoor Cluttered Environment	Feb 13, 2023	BenchmarkingSegmentation	CodeCode Available
Deep Imputation of Missing Values in Time Series Health Data: A Review with Benchmarking	Feb 10, 2023	BenchmarkingDeep Learning	—Unverified
AI Sound Recognition on Asthma Medication Adherence: Evaluation With the RDA Benchmark Suite	Feb 8, 2023	BenchmarkingManagement	CodeCode Available
CrossCodeBench: Benchmarking Cross-Task Generalization of Source Code Models	Feb 8, 2023	BenchmarkingFew-Shot Learning	—Unverified
Participatory Personalization in Classification	Feb 8, 2023	BenchmarkingClassification	—Unverified
Arena-Web -- A Web-based Development and Benchmarking Platform for Autonomous Navigation Approaches	Feb 6, 2023	Autonomous NavigationBenchmarking	—Unverified
NA-SODINN: a deep learning algorithm for exoplanet image detection based on residual noise regimes	Feb 6, 2023	BenchmarkingSpecificity	—Unverified
Stability Constrained OPF in Microgrids: A Chance Constrained Optimization Framework with Non-Gaussian Uncertainty	Feb 4, 2023	Benchmarking	—Unverified
Benchmarking sparse system identification with low-dimensional chaos	Feb 4, 2023	Benchmarking	—Unverified
Characterization of Constrained Continuous Multiobjective Optimization Problems: A Performance Space Perspective	Feb 4, 2023	BenchmarkingMultiobjective Optimization	—Unverified
An Operational Perspective to Fairness Interventions: Where and How to Intervene	Feb 3, 2023	BenchmarkingFairness	—Unverified
Benchmarking Probabilistic Deep Learning Methods for License Plate Recognition	Feb 2, 2023	BenchmarkingDeep Learning	CodeCode Available
Data-driven Approach for Static Hedging of Exchange Traded Options	Feb 1, 2023	BenchmarkingInterpretable Machine Learning	—Unverified
Continuous U-Net: Faster, Greater and Noiseless	Feb 1, 2023	BenchmarkingDecoder	—Unverified
Enhancing Hyper-To-Real Space Projections Through Euclidean Norm Meta-Heuristic Optimization	Jan 31, 2023	Benchmarking	CodeCode Available
Benchmarking Model Predictive Control Algorithms in Building Optimization Testing Framework (BOPTEST)	Jan 31, 2023	BenchmarkingModel Predictive Control	—Unverified
Sport Task: Fine Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2022	Jan 31, 2023	Action DetectionBenchmarking	CodeCode Available
Population-wise Labeling of Sulcal Graphs using Multi-graph Matching	Jan 31, 2023	BenchmarkingGraph Matching	CodeCode Available
Benchmarking optimality of time series classification methods in distinguishing diffusions	Jan 30, 2023	BenchmarkingGaussian Processes	CodeCode Available
Cross-Subject Deep Transfer Models for Evoked Potentials in Brain-Computer Interface	Jan 29, 2023	BenchmarkingBrain Computer Interface	—Unverified
Quality Indicators for Preference-based Evolutionary Multi-objective Optimization Using a Reference Point: A Review and Analysis	Jan 28, 2023	BenchmarkingDecision Making	CodeCode Available
Heterogeneous Datasets for Federated Survival Analysis Simulation	Jan 28, 2023	BenchmarkingFederated Learning	CodeCode Available
Task-Agnostic Graph Neural Network Evaluation via Adversarial Collaboration	Jan 27, 2023	BenchmarkingGraph Classification	CodeCode Available
A Systematic Review of Green AI	Jan 26, 2023	Benchmarking	CodeCode Available
Out of Distribution Performance of State of Art Vision Model	Jan 25, 2023	Benchmarking	—Unverified
Towards Robust Metrics for Concept Representation Evaluation	Jan 25, 2023	BenchmarkingDisentanglement	CodeCode Available
SpaceTx: A Roadmap for Benchmarking Spatial Transcriptomics Exploration of the Brain	Jan 20, 2023	BenchmarkingCell Segmentation	—Unverified
Job recommendations: benchmarking of collaborative filtering methods for classifieds	Jan 19, 2023	BenchmarkingCollaborative Filtering	—Unverified
Benchmarking YOLOv5 and YOLOv7 models with DeepSORT for droplet tracking applications	Jan 19, 2023	BenchmarkingGPU	CodeCode Available
Vision Learners Meet Web Image-Text Pairs	Jan 17, 2023	BenchmarkingSelf-Supervised Learning	—Unverified

Show:10 25 50

← PrevPage 80 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified