Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1401–1425 of 5548 papers

Title	Date	Tasks	Status	Hype
PT-Ranking: A Benchmarking Platform for Neural Learning-to-Rank	Aug 31, 2020	BenchmarkingLearning-To-Rank	CodeCode Available	1
NATS-Bench: Benchmarking NAS Algorithms for Architecture Topology and Size	Aug 28, 2020	BenchmarkingDiagnostic	CodeCode Available	1
Image Colorization: A Survey and Dataset	Aug 25, 2020	BenchmarkingColorization	CodeCode Available	1
ScrewNet: Category-Independent Articulation Model Estimation From Depth Images Using Screw Theory	Aug 24, 2020	Benchmarking	CodeCode Available	1
Quantitative Survey of the State of the Art in Sign Language Recognition	Aug 22, 2020	BenchmarkingSign Language Recognition	CodeCode Available	1
Automatic sleep stage classification with deep residual networks in a mixed-cohort setting	Aug 21, 2020	Automatic Sleep Stage ClassificationBenchmarking	CodeCode Available	1
ISSAFE: Improving Semantic Segmentation in Accidents by Fusing Event-based Data	Aug 20, 2020	Autonomous VehiclesBenchmarking	CodeCode Available	1
AIPerf: Automated machine learning as an AI-HPC benchmark	Aug 17, 2020	AutoMLBenchmarking	CodeCode Available	1
dMelodies: A Music Dataset for Disentanglement Learning	Jul 29, 2020	BenchmarkingDisentanglement	CodeCode Available	1
WordCraft: An Environment for Benchmarking Commonsense Agents	Jul 17, 2020	BenchmarkingKnowledge Graphs	CodeCode Available	1
Are We There Yet? Evaluating State-of-the-Art Neural Network based Geoparsers Using EUPEG as a Benchmarking Platform	Jul 15, 2020	ArticlesBenchmarking	CodeCode Available	1
Emoji Prediction: Extensions and Benchmarking	Jul 14, 2020	BenchmarkingMulti-Label Classification	CodeCode Available	1
CheXphoto: 10,000+ Photos and Transformations of Chest X-rays for Benchmarking Deep Learning Robustness	Jul 13, 2020	Benchmarking	CodeCode Available	1
Enhancing spatial and textual analysis with EUPEG: an extensible and unified platform for evaluating geoparsers	Jul 9, 2020	Benchmarking	CodeCode Available	1
GAMA: a General Automated Machine learning Assistant	Jul 9, 2020	AutoMLBenchmarking	CodeCode Available	1
IOHanalyzer: Detailed Performance Analyses for Iterative Optimization Heuristics	Jul 8, 2020	Bayesian OptimizationBenchmarking	CodeCode Available	1
RobFR: Benchmarking Adversarial Robustness on Face Recognition	Jul 8, 2020	Adversarial RobustnessBenchmarking	CodeCode Available	1
URSABench: Comprehensive Benchmarking of Approximate Bayesian Inference Methods for Deep Neural Networks	Jul 8, 2020	Bayesian InferenceBenchmarking	CodeCode Available	1
Re-thinking Co-Salient Object Detection	Jul 7, 2020	BenchmarkingCo-Salient Object Detection	CodeCode Available	1
Wiki-CS: A Wikipedia-Based Benchmark for Graph Neural Networks	Jul 6, 2020	ArticlesBenchmarking	CodeCode Available	1
Quo Vadis, Skeleton Action Recognition ?	Jul 4, 2020	Action RecognitionBenchmarking	CodeCode Available	1
Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers	Jul 3, 2020	BenchmarkingDeep Learning	CodeCode Available	1
Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient	Jul 3, 2020	BenchmarkingMuJoCo	CodeCode Available	1
EndoSLAM Dataset and An Unsupervised Monocular Visual Odometry and Depth Estimation Approach for Endoscopic Videos: Endo-SfMLearner	Jun 30, 2020	BenchmarkingDepth Estimation	CodeCode Available	1
Labelling unlabelled videos from scratch with multi-modal self-supervision	Jun 24, 2020	BenchmarkingClustering	CodeCode Available	1

Show:10 25 50

← PrevPage 57 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified