Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5401–5450 of 5548 papers

Title	Date	Tasks	Status	Hype
Deep learning for extracting protein-protein interactions from biomedical literature	Jun 5, 2017	BenchmarkingCross-corpus	—Unverified	0
CRNN: A Joint Neural Network for Redundancy Detection	Jun 4, 2017	BenchmarkingGeneral Classification	CodeCode Available	0
Discovering Visual Concept Structure with Sparse and Incomplete Tags	May 30, 2017	BenchmarkingClustering	—Unverified	0
Classification and Retrieval of Digital Pathology Scans: A New Dataset	May 22, 2017	BenchmarkingGeneral Classification	—Unverified	0
Learning to Mix n-Step Returns: Generalizing lambda-Returns for Deep Reinforcement Learning	May 21, 2017	BenchmarkingDecision Making	—Unverified	0
WebVision Challenge: Visual Learning and Understanding With Web Data	May 16, 2017	Benchmarkingimage-classification	—Unverified	0
Saliency Benchmarking Made Easy: Separating Models, Maps and Metrics	Apr 27, 2017	AllBenchmarking	—Unverified	0
Reconstructing antibody repertoires from error-prone immunosequencing datasets	Apr 24, 2017	Benchmarking	—Unverified	0
Computer Vision for Autonomous Vehicles: Problems, Datasets and State of the Art	Apr 18, 2017	Autonomous DrivingAutonomous Vehicles	—Unverified	0
LibOPT: An Open-Source Platform for Fast Prototyping Soft Optimization Techniques	Apr 18, 2017	Benchmarking	CodeCode Available	0
Embodied Artificial Intelligence through Distributed Adaptive Control: An Integrated Framework	Apr 5, 2017	BenchmarkingBoard Games	—Unverified	0
A Comparison of Directional Distances for Hand Pose Estimation	Apr 3, 2017	BenchmarkingHand Pose Estimation	—Unverified	0
Benchmarking Joint Lexical and Syntactic Analysis on Multiword-Rich Data	Apr 1, 2017	BenchmarkingDependency Parsing	—Unverified	0
A Characterization Study of Arabic Twitter Data with a Benchmarking for State-of-the-Art Opinion Mining Models	Apr 1, 2017	BenchmarkingFeature Engineering	—Unverified	0
A Parallel Corpus for Evaluating Machine Translation between Arabic and European Languages	Apr 1, 2017	BenchmarkingMachine Translation	—Unverified	0
Efficient Benchmarking of NLP APIs using Multi-armed Bandits	Apr 1, 2017	BenchmarkingMulti-Armed Bandits	—Unverified	0
Configurable 3D Scene Synthesis and 2D Image Rendering with Per-Pixel Ground Truth using Stochastic Grammars	Apr 1, 2017	BenchmarkingObject	—Unverified	0
Efficient Benchmarking of Algorithm Configuration Procedures via Model-Based Surrogates	Mar 30, 2017	BenchmarkingHyperparameter Optimization	—Unverified	0
Semi and Weakly Supervised Semantic Segmentation Using Generative Adversarial Network	Mar 28, 2017	BenchmarkingClustering	—Unverified	0
Efficient Processing of Deep Neural Networks: A Tutorial and Survey	Mar 27, 2017	Benchmarkingspeech-recognition	—Unverified	0
Multitask learning and benchmarking with clinical time series data	Mar 22, 2017	BenchmarkingBIG-bench Machine Learning	CodeCode Available	1
Computer Aided Detection of Anemia-like Pallor	Mar 17, 2017	BenchmarkingClassification	—Unverified	0
A New Evaluation Protocol and Benchmarking Results for Extendable Cross-media Retrieval	Mar 10, 2017	BenchmarkingImage Retrieval	—Unverified	0
Meet Spinky: An Open-Source Spindle and K-Complex Detection Toolbox Validated on the Open-Access Montreal Archive of Sleep Studies (MASS).	Mar 2, 2017	BenchmarkingEEG	CodeCode Available	0
PMLB: A Large Benchmark Suite for Machine Learning Evaluation and Comparison	Mar 1, 2017	BenchmarkingBIG-bench Machine Learning	CodeCode Available	0
A Dataset for Developing and Benchmarking Active Vision	Feb 27, 2017	BenchmarkingGeneral Classification	—Unverified	0
Support Vector Machines and generalisation in HEP	Feb 15, 2017	Benchmarking	—Unverified	0
FERA 2017 - Addressing Head Pose in the Third Facial Expression Recognition and Analysis Challenge	Feb 14, 2017	BenchmarkingFacial Action Unit Detection	—Unverified	0
MORSE: Semantic-ally Drive-n MORpheme SEgment-er	Feb 7, 2017	Benchmarking	—Unverified	0
The biglasso Package: A Memory- and Computation-Efficient Solver for Lasso Model Fitting with Big Data in R	Jan 20, 2017	Benchmarking	CodeCode Available	0
Deep Learning Logo Detection with Data Expansion by Synthesising Context	Dec 29, 2016	BenchmarkingDeep Learning	—Unverified	0
Jointly learning heterogeneous features for rgb-d activity recognition	Dec 15, 2016	Activity RecognitionBenchmarking	—Unverified	0
Multiple Instance Learning: A Survey of Problem Characteristics and Applications	Dec 11, 2016	BenchmarkingDocument Classification	CodeCode Available	0
pke: an open source python-based keyphrase extraction toolkit	Dec 1, 2016	BenchmarkingKeyphrase Extraction	CodeCode Available	0
MS MARCO: A Human Generated MAchine Reading COmprehension Dataset	Nov 28, 2016	BenchmarkingMachine Reading Comprehension	CodeCode Available	1
Person Re-Identification by Unsupervised Video Matching	Nov 25, 2016	BenchmarkingDynamic Time Warping	—Unverified	0
'Part'ly first among equals: Semantic part-based benchmarking for state-of-the-art object recognition systems	Nov 23, 2016	BenchmarkingObject	—Unverified	0
CMOS based image cytometry for detection of phytoplankton in ballast water	Nov 21, 2016	Benchmarking	—Unverified	0
The Freiburg Groceries Dataset	Nov 17, 2016	BenchmarkingBIG-bench Machine Learning	CodeCode Available	0
Benchmarking inverse statistical approaches for protein structure and design with exactly solvable models	Nov 15, 2016	Benchmarking	—Unverified	0
Benchmarking Quantum Hardware for Training of Fully Visible Boltzmann Machines	Nov 14, 2016	Benchmarking	—Unverified	0
XCSP3: An Integrated Format for Benchmarking Combinatorial Constrained Problems	Nov 10, 2016	Benchmarking	—Unverified	0
A Comparison of Word Embeddings for English and Cross-Lingual Chinese Word Sense Disambiguation	Nov 9, 2016	BenchmarkingTranslation	—Unverified	0
A Benchmark Dataset and Saliency-guided Stacked Autoencoders for Video-based Salient Object Detection	Nov 1, 2016	BenchmarkingObject	—Unverified	0
Word Embeddings for the Construction Domain	Oct 28, 2016	BenchmarkingGeneral Classification	CodeCode Available	0
Portfolio Benchmarking under Drawdown Constraint and Stochastic Sharpe Ratio	Oct 26, 2016	Benchmarking	—Unverified	0
Term-Class-Max-Support (TCMS): A Simple Text Document Categorization Approach Using Term-Class Relevance Measure	Oct 16, 2016	BenchmarkingText Categorization	—Unverified	0
There's No Comparison: Reference-less Evaluation Metrics in Grammatical Error Correction	Oct 7, 2016	BenchmarkingGrammatical Error Correction	CodeCode Available	0
Technical Report on the CleverHans v2.1.0 Adversarial Examples Library	Oct 3, 2016	Adversarial AttackAdversarial Defense	CodeCode Available	0
Estimating transmission from genetic and epidemiological data: a metric to compare transmission trees	Sep 28, 2016	Benchmarking	—Unverified	0

Show:10 25 50

← PrevPage 109 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified