Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4701–4750 of 5548 papers

Title	Date	Tasks	Status	Hype
Olympus: a benchmarking framework for noisy optimization and experiment planning	Oct 8, 2020	BenchmarkingProbabilistic Deep Learning	CodeCode Available	1
The FaceChannelS: Strike of the Sequences for the AffWild 2 Challenge	Oct 4, 2020	BenchmarkingBIG-bench Machine Learning	—Unverified	0
An Analysis of Control Parameters of MOEA/D Under Two Different Optimization Scenarios	Oct 2, 2020	BenchmarkingEvolutionary Algorithms	—Unverified	0
Reviewing and Benchmarking Parameter Control Methods in Differential Evolution	Oct 2, 2020	Benchmarking	—Unverified	0
OpenTraj: Assessing Prediction Complexity in Human Trajectories Datasets	Oct 2, 2020	BenchmarkingPrediction	CodeCode Available	1
A new dataset of dog breed images and a benchmark for fine-grained classification	Oct 1, 2020	BenchmarkingClassification	—Unverified	0
Bag of Tricks for Adversarial Training	Oct 1, 2020	Adversarial RobustnessBenchmarking	CodeCode Available	1
Metrics for Benchmarking and Uncertainty Quantification: Quality, Applicability, and a Path to Best Practices for Machine Learning in Chemistry	Sep 30, 2020	BenchmarkingBIG-bench Machine Learning	—Unverified	0
HINT3: Raising the bar for Intent Detection in the Wild	Sep 29, 2020	BenchmarkingIntent Detection	CodeCode Available	1
Graph Joint Attention Networks	Sep 28, 2020	BenchmarkingGraph Attention	—Unverified	0
An Analysis of Quality Indicators Using Approximated Optimal Distributions in a Three-dimensional Objective Space	Sep 27, 2020	Benchmarking	—Unverified	0
Benchmarking deep inverse models over time, and the neural-adjoint method	Sep 27, 2020	Benchmarking	CodeCode Available	1
A BFS-Tree of Ranking References for Unsupervised Manifold Learning	Sep 24, 2020	BenchmarkingImage Retrieval	CodeCode Available	1
Using Neural Architecture Search for Improving Software Flaw Detection in Multimodal Deep Learning Models	Sep 22, 2020	BenchmarkingBIG-bench Machine Learning	—Unverified	0
Measuring the Complexity of Domains Used to Evaluate AI Systems	Sep 18, 2020	Benchmarking	—Unverified	0
What if we had no Wikipedia? Domain-independent Term Extraction from a Large News Corpus	Sep 17, 2020	BenchmarkingTerm Extraction	—Unverified	0
Job2Vec: Job Title Benchmarking with Collective Multi-View Representation Learning	Sep 16, 2020	BenchmarkingLink Prediction	—Unverified	0
NABU - Multilingual Graph-based Neural RDF Verbalizer	Sep 16, 2020	BenchmarkingDecoder	—Unverified	0
TadGAN: Time Series Anomaly Detection Using Generative Adversarial Networks	Sep 16, 2020	Anomaly DetectionBenchmarking	CodeCode Available	2
CoDEx: A Comprehensive Knowledge Graph Completion Benchmark	Sep 16, 2020	BenchmarkingKnowledge Graph Completion	CodeCode Available	1
CVPR 2020 Continual Learning in Computer Vision Competition: Approaches, Results, Current Challenges and Future Directions	Sep 14, 2020	BenchmarkingContinual Learning	CodeCode Available	0
A Multisensory Learning Architecture for Rotation-invariant Object Recognition	Sep 14, 2020	BenchmarkingObject	—Unverified	0
Utility-Optimized Synthesis of Differentially Private Location Traces	Sep 14, 2020	Bayesian OptimizationBenchmarking	—Unverified	0
BARS-CTR: Open Benchmarking for Click-Through Rate Prediction	Sep 12, 2020	BenchmarkingClick-Through Rate Prediction	CodeCode Available	1
IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding	Sep 11, 2020	BenchmarkingDiversity	CodeCode Available	1
Optimal Eco-driving Control of Autonomous and Electric Trucks in Adaptation to Highway Topography: Energy Minimization and Battery Life Extension	Sep 10, 2020	BenchmarkingModel Predictive Control	—Unverified	0
MedMeshCNN -- Enabling MeshCNN for Medical Surface Models	Sep 10, 2020	BenchmarkingSegmentation	—Unverified	0
Searching for a Search Method: Benchmarking Search Algorithms for Generating NLP Adversarial Examples	Sep 9, 2020	Adversarial TextBenchmarking	CodeCode Available	2
Integrated Benchmarking and Design for Reproducible and Accessible Evaluation of Robotic Agents	Sep 9, 2020	Benchmarking	—Unverified	0
Deep Metric Learning Meets Deep Clustering: An Novel Unsupervised Approach for Feature Embedding	Sep 9, 2020	BenchmarkingClustering	CodeCode Available	0
Referenced Thermodynamic Integration for Bayesian Model Selection: Application to COVID-19 Model Selection	Sep 8, 2020	BenchmarkingEpidemiology	CodeCode Available	0
Benchmarking off-the-shelf statistical shape modeling tools in clinical applications	Sep 7, 2020	Benchmarking	—Unverified	0
Iris Liveness Detection Competition (LivDet-Iris) -- The 2020 Edition	Sep 1, 2020	Benchmarking	—Unverified	0
PT-Ranking: A Benchmarking Platform for Neural Learning-to-Rank	Aug 31, 2020	BenchmarkingLearning-To-Rank	CodeCode Available	1
Benchmarking adversarial attacks and defenses for time-series data	Aug 30, 2020	Adversarial DefenseBenchmarking	—Unverified	0
NATS-Bench: Benchmarking NAS Algorithms for Architecture Topology and Size	Aug 28, 2020	BenchmarkingDiagnostic	CodeCode Available	1
Adversarially Training for Audio Classifiers	Aug 26, 2020	Benchmarking	—Unverified	0
Image Colorization: A Survey and Dataset	Aug 25, 2020	BenchmarkingColorization	CodeCode Available	1
Optimal Scheduling of Anticipated COVID-19 Vaccination: A Case Study of New York State	Aug 24, 2020	BenchmarkingScheduling	—Unverified	0
HoloGen: An open source toolbox for high-speed hologram generation	Aug 24, 2020	3D HolographyBenchmarking	—Unverified	0
ScrewNet: Category-Independent Articulation Model Estimation From Depth Images Using Screw Theory	Aug 24, 2020	Benchmarking	CodeCode Available	1
Robust Vision Challenge 2020 -- 1st Place Report for Panoptic Segmentation	Aug 23, 2020	BenchmarkingPanoptic Segmentation	—Unverified	0
Holistic Multi-View Building Analysis in the Wild with Projection Pooling	Aug 23, 2020	Benchmarking	—Unverified	0
Quantitative Survey of the State of the Art in Sign Language Recognition	Aug 22, 2020	BenchmarkingSign Language Recognition	CodeCode Available	1
A Unified Taylor Framework for Revisiting Attribution Methods	Aug 21, 2020	BenchmarkingDecision Making	—Unverified	0
Automatic sleep stage classification with deep residual networks in a mixed-cohort setting	Aug 21, 2020	Automatic Sleep Stage ClassificationBenchmarking	CodeCode Available	1
MTOP: A Comprehensive Multilingual Task-Oriented Semantic Parsing Benchmark	Aug 21, 2020	BenchmarkingSemantic Parsing	—Unverified	0
ISSAFE: Improving Semantic Segmentation in Accidents by Fusing Event-based Data	Aug 20, 2020	Autonomous VehiclesBenchmarking	CodeCode Available	1
Benchmarking network fabrics for data distributed training of deep neural networks	Aug 18, 2020	BenchmarkingBIG-bench Machine Learning	—Unverified	0
mlr3proba: An R Package for Machine Learning in Survival Analysis	Aug 18, 2020	BenchmarkingBIG-bench Machine Learning	—Unverified	0

Show:10 25 50

← PrevPage 95 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified