Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3426–3450 of 5548 papers

Title	Date	Tasks	Status	Hype
FluidLab: A Differentiable Environment for Benchmarking Complex Fluid Manipulation	Mar 4, 2023	BenchmarkingGPU	CodeCode Available	2
Benchmarking framework for machine learning classification from fNIRS data	Mar 3, 2023	BenchmarkingBrain Computer Interface	CodeCode Available	0
Benchmarking White Blood Cell Classification Under Domain Shift	Mar 3, 2023	BenchmarkingClassification	CodeCode Available	0
Data-Efficient Training of CNNs and Transformers with Coresets: A Stability Perspective	Mar 3, 2023	BenchmarkingImage Classification	CodeCode Available	0
POPGym: Benchmarking Partially Observable Reinforcement Learning	Mar 3, 2023	BenchmarkingGPU	CodeCode Available	2
Structure-Based Experimental Datasets for Benchmarking Protein Simulation Force Fields	Mar 2, 2023	Benchmarking	—Unverified	0
Learning to Adapt to Online Streams with Distribution Shifts	Mar 2, 2023	BenchmarkingMeta-Learning	—Unverified	0
Benchmarking Self-Supervised Contrastive Learning Methods for Image-Based Plant Phenotyping	Mar 1, 2023	BenchmarkingContrastive Learning	CodeCode Available	0
A Comprehensive Study on Robustness of Image Classification Models: Benchmarking and Rethinking	Feb 28, 2023	Adversarial RobustnessBenchmarking	—Unverified	0
Benchmarking Deepart Detection	Feb 28, 2023	BenchmarkingDeepFake Detection	—Unverified	0
Predicting the Performance of a Computing System with Deep Networks	Feb 27, 2023	Benchmarking	—Unverified	0
Benchmarking of Cancelable Biometrics for Deep Templates	Feb 26, 2023	BenchmarkingBinarization	—Unverified	0
STA: Self-controlled Text Augmentation for Improving Text Classifications	Feb 24, 2023	BenchmarkingText Augmentation	CodeCode Available	0
Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift with Multiple Views	Feb 23, 2023	Benchmarking	—Unverified	0
What Can We Learn From The Selective Prediction And Uncertainty Estimation Performance Of 523 Imagenet Classifiers	Feb 23, 2023	BenchmarkingOut-of-Distribution Detection	CodeCode Available	1
Revisiting the Gumbel-Softmax in MADDPG	Feb 23, 2023	BenchmarkingMulti-agent Reinforcement Learning	CodeCode Available	1
A framework for benchmarking class-out-of-distribution detection and its application to ImageNet	Feb 23, 2023	BenchmarkingKnowledge Distillation	CodeCode Available	1
Dermatological Diagnosis Explainability Benchmark for Convolutional Neural Networks	Feb 23, 2023	BenchmarkingMedical Diagnosis	CodeCode Available	0
MultiRobustBench: Benchmarking Robustness Against Multiple Attacks	Feb 21, 2023	Benchmarking	—Unverified	0
An Efficient Two-stage Gradient Boosting Framework for Short-term Traffic State Estimation	Feb 21, 2023	BenchmarkingState Estimation	CodeCode Available	0
Time to Embrace Natural Language Processing (NLP)-based Digital Pathology: Benchmarking NLP- and Convolutional Neural Network-based Deep Learning Pipelines	Feb 21, 2023	Benchmarkingwhole slide images	—Unverified	0
Determinants of Performance in European ATM -- How to Analyze a Diverse Industry	Feb 20, 2023	BenchmarkingManagement	—Unverified	0
Arena-Rosnav 2.0: A Development and Benchmarking Platform for Robot Navigation in Highly Dynamic Environments	Feb 20, 2023	BenchmarkingRobot Navigation	CodeCode Available	0
Fuzzy Knowledge Distillation from High-Order TSK to Low-Order TSK	Feb 16, 2023	BenchmarkingKnowledge Distillation	—Unverified	0
Towards Fair Machine Learning Software: Understanding and Addressing Model Bias Through Counterfactual Thinking	Feb 16, 2023	Benchmarkingcounterfactual	—Unverified	0

Show:10 25 50

← PrevPage 138 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified