Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1401–1450 of 5548 papers

Title	Date	Tasks	Status	Hype
PT-Ranking: A Benchmarking Platform for Neural Learning-to-Rank	Aug 31, 2020	BenchmarkingLearning-To-Rank	CodeCode Available	1
NATS-Bench: Benchmarking NAS Algorithms for Architecture Topology and Size	Aug 28, 2020	BenchmarkingDiagnostic	CodeCode Available	1
Image Colorization: A Survey and Dataset	Aug 25, 2020	BenchmarkingColorization	CodeCode Available	1
ScrewNet: Category-Independent Articulation Model Estimation From Depth Images Using Screw Theory	Aug 24, 2020	Benchmarking	CodeCode Available	1
Quantitative Survey of the State of the Art in Sign Language Recognition	Aug 22, 2020	BenchmarkingSign Language Recognition	CodeCode Available	1
Automatic sleep stage classification with deep residual networks in a mixed-cohort setting	Aug 21, 2020	Automatic Sleep Stage ClassificationBenchmarking	CodeCode Available	1
ISSAFE: Improving Semantic Segmentation in Accidents by Fusing Event-based Data	Aug 20, 2020	Autonomous VehiclesBenchmarking	CodeCode Available	1
AIPerf: Automated machine learning as an AI-HPC benchmark	Aug 17, 2020	AutoMLBenchmarking	CodeCode Available	1
dMelodies: A Music Dataset for Disentanglement Learning	Jul 29, 2020	BenchmarkingDisentanglement	CodeCode Available	1
WordCraft: An Environment for Benchmarking Commonsense Agents	Jul 17, 2020	BenchmarkingKnowledge Graphs	CodeCode Available	1
Are We There Yet? Evaluating State-of-the-Art Neural Network based Geoparsers Using EUPEG as a Benchmarking Platform	Jul 15, 2020	ArticlesBenchmarking	CodeCode Available	1
Emoji Prediction: Extensions and Benchmarking	Jul 14, 2020	BenchmarkingMulti-Label Classification	CodeCode Available	1
CheXphoto: 10,000+ Photos and Transformations of Chest X-rays for Benchmarking Deep Learning Robustness	Jul 13, 2020	Benchmarking	CodeCode Available	1
GAMA: a General Automated Machine learning Assistant	Jul 9, 2020	AutoMLBenchmarking	CodeCode Available	1
Enhancing spatial and textual analysis with EUPEG: an extensible and unified platform for evaluating geoparsers	Jul 9, 2020	Benchmarking	CodeCode Available	1
RobFR: Benchmarking Adversarial Robustness on Face Recognition	Jul 8, 2020	Adversarial RobustnessBenchmarking	CodeCode Available	1
URSABench: Comprehensive Benchmarking of Approximate Bayesian Inference Methods for Deep Neural Networks	Jul 8, 2020	Bayesian InferenceBenchmarking	CodeCode Available	1
IOHanalyzer: Detailed Performance Analyses for Iterative Optimization Heuristics	Jul 8, 2020	Bayesian OptimizationBenchmarking	CodeCode Available	1
Re-thinking Co-Salient Object Detection	Jul 7, 2020	BenchmarkingCo-Salient Object Detection	CodeCode Available	1
Wiki-CS: A Wikipedia-Based Benchmark for Graph Neural Networks	Jul 6, 2020	ArticlesBenchmarking	CodeCode Available	1
Quo Vadis, Skeleton Action Recognition ?	Jul 4, 2020	Action RecognitionBenchmarking	CodeCode Available	1
Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers	Jul 3, 2020	BenchmarkingDeep Learning	CodeCode Available	1
Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient	Jul 3, 2020	BenchmarkingMuJoCo	CodeCode Available	1
EndoSLAM Dataset and An Unsupervised Monocular Visual Odometry and Depth Estimation Approach for Endoscopic Videos: Endo-SfMLearner	Jun 30, 2020	BenchmarkingDepth Estimation	CodeCode Available	1
Labelling unlabelled videos from scratch with multi-modal self-supervision	Jun 24, 2020	BenchmarkingClustering	CodeCode Available	1
Monash University, UEA, UCR Time Series Extrinsic Regression Archive	Jun 19, 2020	BenchmarkingMissing Values	CodeCode Available	1
Mitigating Gender Bias in Captioning Systems	Jun 15, 2020	BenchmarkingGender Prediction	CodeCode Available	1
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks	Jun 14, 2020	BenchmarkingDeep Reinforcement Learning	CodeCode Available	1
Benchmarking Unsupervised Object Representations for Video Sequences	Jun 12, 2020	BenchmarkingClustering	CodeCode Available	1
Supervised learning is an accurate method for network-based gene classification	Jun 1, 2020	BenchmarkingGeneral Classification	CodeCode Available	1
Benchmarking Adversarial Robustness on Image Classification	Jun 1, 2020	Adversarial AttackAdversarial Robustness	CodeCode Available	1
Taking a Deeper Look at Co-Salient Object Detection	Jun 1, 2020	BenchmarkingCo-Salient Object Detection	CodeCode Available	1
UGC-VQA: Benchmarking Blind Video Quality Assessment for User Generated Content	May 29, 2020	Benchmarkingfeature selection	CodeCode Available	1
Reference Pose Generation for Long-term Visual Localization via Learned Features and View Synthesis	May 11, 2020	Autonomous DrivingBenchmarking	CodeCode Available	1
Curious Hierarchical Actor-Critic Reinforcement Learning	May 7, 2020	BenchmarkingHierarchical Reinforcement Learning	CodeCode Available	1
A Ladder of Causal Distances	May 5, 2020	BenchmarkingCausal Discovery	CodeCode Available	1
NTIRE 2020 Challenge on Real-World Image Super-Resolution: Methods and Results	May 5, 2020	BenchmarkingImage Super-Resolution	CodeCode Available	1
Introducing the VoicePrivacy Initiative	May 4, 2020	Benchmarking	CodeCode Available	1
Benchmarking Multidomain English-Indonesian Machine Translation	May 1, 2020	BenchmarkingMachine Translation	CodeCode Available	1
Benchmarking Robustness of Machine Reading Comprehension Models	Apr 29, 2020	BenchmarkingMachine Reading Comprehension	CodeCode Available	1
Machine Learning Methods for Brain Network Classification: Application to Autism Diagnosis using Cortical Morphological Networks	Apr 28, 2020	BenchmarkingBIG-bench Machine Learning	CodeCode Available	1
MAVEN: A Massive General Domain Event Detection Dataset	Apr 28, 2020	BenchmarkingEvent Detection	CodeCode Available	1
Deep Learning for ECG Analysis: Benchmarks and Insights from PTB-XL	Apr 28, 2020	AllBenchmarking	CodeCode Available	1
A Global Benchmark of Algorithms for Segmenting Late Gadolinium-Enhanced Cardiac Magnetic Resonance Imaging	Apr 26, 2020	BenchmarkingLeft Atrium Segmentation	CodeCode Available	1
Global Wheat Head Detection (GWHD) dataset: a large and diverse dataset of high resolution RGB labelled images to develop and benchmark wheat head detection methods	Apr 25, 2020	BenchmarkingHead Detection	CodeCode Available	1
New Protocols and Negative Results for Textual Entailment Data Collection	Apr 24, 2020	BenchmarkingDiversity	CodeCode Available	1
Shortcut Learning in Deep Neural Networks	Apr 16, 2020	Benchmarking	CodeCode Available	1
Evaluating Multimodal Representations on Visual Semantic Textual Similarity	Apr 4, 2020	BenchmarkingImage Captioning	CodeCode Available	1
Benchmarking End-to-End Behavioural Cloning on Video Games	Apr 2, 2020	Behavioural cloningBenchmarking	CodeCode Available	1
Event Probability Mask (EPM) and Event Denoising Convolutional Neural Network (EDnCNN) for Neuromorphic Cameras	Mar 18, 2020	BenchmarkingDenoising	CodeCode Available	1

Show:10 25 50

← PrevPage 29 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified