Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5451–5475 of 5548 papers

Title	Date	Tasks	Status
There's No Comparison: Reference-less Evaluation Metrics in Grammatical Error Correction	Oct 7, 2016	BenchmarkingGrammatical Error Correction	CodeCode Available
Technical Report on the CleverHans v2.1.0 Adversarial Examples Library	Oct 3, 2016	Adversarial AttackAdversarial Defense	CodeCode Available
Estimating transmission from genetic and epidemiological data: a metric to compare transmission trees	Sep 28, 2016	Benchmarking	—Unverified
Geometry-Based Next Frame Prediction from Monocular Video	Sep 20, 2016	Autonomous DrivingBenchmarking	—Unverified
Quantum-Assisted Learning of Hardware-Embedded Probabilistic Graphical Models	Sep 8, 2016	BenchmarkingBIG-bench Machine Learning	—Unverified
Joint Online Spoken Language Understanding and Language Modeling with Recurrent Neural Networks	Sep 6, 2016	BenchmarkingIntent Detection	—Unverified
Benchmarking State-of-the-Art Deep Learning Software Tools	Aug 25, 2016	BenchmarkingCPU	—Unverified
Benchmarking confound regression strategies for the control of motion artifact in studies of functional connectivity	Aug 11, 2016	BenchmarkingFunctional Connectivity	—Unverified
Haze Visibility Enhancement: A Survey and Quantitative Benchmarking	Jul 21, 2016	BenchmarkingSurvey	—Unverified
Multi-Camera Action Dataset for Cross-Camera Action Recognition Benchmarking	Jul 21, 2016	Action RecognitionBenchmarking	—Unverified
Sparse Representation-Based Classification: Orthogonal Least Squares or Orthogonal Matching Pursuit?	Jul 18, 2016	BenchmarkingClassification	—Unverified
Hierarchical Data Generator based on Tree-Structured Stick Breaking Process for Benchmarking Clustering Methods	Jun 17, 2016	BenchmarkingClustering	—Unverified
Spatially Binned ROC: A Comprehensive Saliency Metric	Jun 1, 2016	Benchmarking	—Unverified
Extraction of clinical information from the non-invasive fetal electrocardiogram	May 27, 2016	BenchmarkingHeart Rate Variability	—Unverified
Yum-me: A Personalized Nutrient-based Meal Recommender System	May 25, 2016	BenchmarkingRecommendation Systems	CodeCode Available
Coupling volume-excluding compartment-based models of diffusion at different scales: Voronoi and pseudo-compartment approaches	May 24, 2016	BenchmarkingBlocking	—Unverified
BMOBench: Black-Box Multi-Objective Optimization Benchmarking Platform	May 23, 2016	Benchmarking	—Unverified
Fine-Grained Classification of Pedestrians in Video: Benchmark and State of the Art	May 20, 2016	BenchmarkingGeneral Classification	—Unverified
Movie Description	May 12, 2016	Benchmarking	—Unverified
COCO: Performance Assessment	May 11, 2016	Benchmarking	CodeCode Available
Building a Large Scale Dataset for Image Emotion Recognition: The Fine Print and The Benchmark	May 9, 2016	BenchmarkingEmotion Recognition	CodeCode Available
Anytime Bi-Objective Optimization with a Hybrid Multi-Objective CMA-ES (HMO-CMA-ES)	May 9, 2016	Benchmarking	—Unverified
Active Learning for Community Detection in Stochastic Block Models	May 8, 2016	Active LearningBenchmarking	—Unverified
Benchmarking Lexical Simplification Systems	May 1, 2016	BenchmarkingLexical Simplification	—Unverified
JATE 2.0: Java Automatic Term Extraction with Apache Solr	May 1, 2016	BenchmarkingTerm Extraction	CodeCode Available

Show:10 25 50

← PrevPage 219 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified