Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4751–4800 of 5548 papers

Title	Date	Tasks	Status	Hype
EASTER: Efficient and Scalable Text Recognizer	Aug 18, 2020	BenchmarkingHandwritten Text Recognition	—Unverified	0
AIPerf: Automated machine learning as an AI-HPC benchmark	Aug 17, 2020	AutoMLBenchmarking	CodeCode Available	1
From Attack to Protection: Leveraging Watermarking Attack Network for Advanced Add-on Watermarking	Aug 14, 2020	Benchmarking	—Unverified	0
Continuous Optimization Benchmarks by Simulation	Aug 14, 2020	BenchmarkingGaussian Processes	CodeCode Available	0
An AI based talent acquisition and benchmarking for job	Aug 12, 2020	BenchmarkingCultural Vocal Bursts Intensity Prediction	—Unverified	0
Short-term origin-destination demand prediction in urban rail transit systems: A channel-wise attentive split-convolutional neural network method	Aug 8, 2020	BenchmarkingManagement	—Unverified	0
Scission: Performance-driven and Context-aware Cloud-Edge Distribution of Deep Neural Networks	Aug 8, 2020	BenchmarkingDecision Making	CodeCode Available	0
A critical analysis of metrics used for measuring progress in artificial intelligence	Aug 6, 2020	Benchmarking	—Unverified	0
Cross-Model Image Annotation Platform with Active Learning	Aug 6, 2020	Active LearningBenchmarking	—Unverified	0
Real-World Blur Dataset for Learning and Benchmarking Deblurring Algorithms	Aug 1, 2020	BenchmarkingDeblurring	—Unverified	0
Beyond Monocular Deraining: Stereo Image Deraining via Semantic Understanding	Aug 1, 2020	BenchmarkingRain Removal	—Unverified	0
Robust Benchmarking for Machine Learning of Clinical Entity Extraction	Jul 31, 2020	BenchmarkingBIG-bench Machine Learning	CodeCode Available	0
Benchmarking and Comparing Multi-exposure Image Fusion Algorithms	Jul 30, 2020	BenchmarkingMulti-Exposure Image Fusion	—Unverified	0
Deep Hedging of Long-Term Financial Derivatives	Jul 29, 2020	BenchmarkingDeep Reinforcement Learning	—Unverified	0
dMelodies: A Music Dataset for Disentanglement Learning	Jul 29, 2020	BenchmarkingDisentanglement	CodeCode Available	1
Realistic Video Summarization through VISIOCITY: A New Benchmark and Evaluation Framework	Jul 29, 2020	BenchmarkingVideo Summarization	—Unverified	0
Benchmarking Meta-heuristic Optimization	Jul 27, 2020	BenchmarkingEvolutionary Algorithms	—Unverified	0
From Sound Representation to Model Robustness	Jul 27, 2020	Adversarial AttackAdversarial Robustness	—Unverified	0
Benchmarking Multivariate Time Series Classification Algorithms	Jul 26, 2020	BenchmarkingClassification	—Unverified	0
Image-Based Benchmarking and Visualization for Large-Scale Global Optimization	Jul 24, 2020	BenchmarkingDimensionality Reduction	—Unverified	0
A Survey on Performance Metrics for Object-Detection Algorithms	Jul 21, 2020	BenchmarkingObject	CodeCode Available	3
Explainable Rumor Detection using Inter and Intra-feature Attention Networks	Jul 21, 2020	Benchmarking	—Unverified	0
DDR-ID: Dual Deep Reconstruction Networks Based Image Decomposition for Anomaly Detection	Jul 18, 2020	Adversarial AttackAdversarial Attack Detection	—Unverified	0
Few-Shot Defect Segmentation Leveraging Abundant Normal Training Samples Through Normal Background Regularization and Crop-and-Paste Operation	Jul 18, 2020	Anomaly DetectionBenchmarking	—Unverified	0
ImageNet performance correlates with pose estimation robustness and generalization on out-of-domain data	Jul 17, 2020	Animal Pose EstimationBenchmarking	—Unverified	0
WordCraft: An Environment for Benchmarking Commonsense Agents	Jul 17, 2020	BenchmarkingKnowledge Graphs	CodeCode Available	1
Domain2Vec: Domain Embedding for Unsupervised Domain Adaptation	Jul 17, 2020	BenchmarkingDisentanglement	CodeCode Available	0
Towards an Automated SOAP Note: Classifying Utterances from Medical Conversations	Jul 17, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
CoNES: Convex Natural Evolutionary Strategies	Jul 16, 2020	BenchmarkingMuJoCo	—Unverified	0
Are We There Yet? Evaluating State-of-the-Art Neural Network based Geoparsers Using EUPEG as a Benchmarking Platform	Jul 15, 2020	ArticlesBenchmarking	CodeCode Available	1
Emoji Prediction: Extensions and Benchmarking	Jul 14, 2020	BenchmarkingMulti-Label Classification	CodeCode Available	1
Towards causal benchmarking of bias in face analysis algorithms	Jul 13, 2020	AttributeBenchmarking	CodeCode Available	0
CheXphoto: 10,000+ Photos and Transformations of Chest X-rays for Benchmarking Deep Learning Robustness	Jul 13, 2020	Benchmarking	CodeCode Available	1
Affine Non-negative Collaborative Representation Based Pattern Classification	Jul 10, 2020	BenchmarkingClassification	CodeCode Available	0
GAMA: a General Automated Machine learning Assistant	Jul 9, 2020	AutoMLBenchmarking	CodeCode Available	1
VisImages: A Fine-Grained Expert-Annotated Visualization Dataset	Jul 9, 2020	Benchmarking	—Unverified	0
Enhancing spatial and textual analysis with EUPEG: an extensible and unified platform for evaluating geoparsers	Jul 9, 2020	Benchmarking	CodeCode Available	1
URSABench: Comprehensive Benchmarking of Approximate Bayesian Inference Methods for Deep Neural Networks	Jul 8, 2020	Bayesian InferenceBenchmarking	CodeCode Available	1
Quaternion Capsule Networks	Jul 8, 2020	BenchmarkingObject Recognition	CodeCode Available	0
RobFR: Benchmarking Adversarial Robustness on Face Recognition	Jul 8, 2020	Adversarial RobustnessBenchmarking	CodeCode Available	1
IOHanalyzer: Detailed Performance Analyses for Iterative Optimization Heuristics	Jul 8, 2020	Bayesian OptimizationBenchmarking	CodeCode Available	1
Benchmarking in Optimization: Best Practice and Open Issues	Jul 7, 2020	Benchmarking	—Unverified	0
Re-thinking Co-Salient Object Detection	Jul 7, 2020	BenchmarkingCo-Salient Object Detection	CodeCode Available	1
Wiki-CS: A Wikipedia-Based Benchmark for Graph Neural Networks	Jul 6, 2020	ArticlesBenchmarking	CodeCode Available	1
Complex Human Action Recognition in Live Videos Using Hybrid FR-DL Method	Jul 6, 2020	Action RecognitionArticles	—Unverified	0
Does imputation matter? Benchmark for predictive models	Jul 6, 2020	BenchmarkingBIG-bench Machine Learning	—Unverified	0
Automatic Target Recognition on Synthetic Aperture Radar Imagery: A Survey	Jul 4, 2020	BenchmarkingSurvey	—Unverified	0
Building benchmarking frameworks for supporting replicability and reproducibility: spatial and textual analysis as an example	Jul 4, 2020	BenchmarkingPosition	—Unverified	0
Quo Vadis, Skeleton Action Recognition ?	Jul 4, 2020	Action RecognitionBenchmarking	CodeCode Available	1
Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient	Jul 3, 2020	BenchmarkingMuJoCo	CodeCode Available	1

Show:10 25 50

← PrevPage 96 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified