Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4776–4800 of 5548 papers

Title	Date	Tasks	Status	Hype
WordCraft: An Environment for Benchmarking Commonsense Agents	Jul 17, 2020	BenchmarkingKnowledge Graphs	CodeCode Available	1
Domain2Vec: Domain Embedding for Unsupervised Domain Adaptation	Jul 17, 2020	BenchmarkingDisentanglement	CodeCode Available	0
Towards an Automated SOAP Note: Classifying Utterances from Medical Conversations	Jul 17, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
CoNES: Convex Natural Evolutionary Strategies	Jul 16, 2020	BenchmarkingMuJoCo	—Unverified	0
Are We There Yet? Evaluating State-of-the-Art Neural Network based Geoparsers Using EUPEG as a Benchmarking Platform	Jul 15, 2020	ArticlesBenchmarking	CodeCode Available	1
Emoji Prediction: Extensions and Benchmarking	Jul 14, 2020	BenchmarkingMulti-Label Classification	CodeCode Available	1
Towards causal benchmarking of bias in face analysis algorithms	Jul 13, 2020	AttributeBenchmarking	CodeCode Available	0
CheXphoto: 10,000+ Photos and Transformations of Chest X-rays for Benchmarking Deep Learning Robustness	Jul 13, 2020	Benchmarking	CodeCode Available	1
Affine Non-negative Collaborative Representation Based Pattern Classification	Jul 10, 2020	BenchmarkingClassification	CodeCode Available	0
GAMA: a General Automated Machine learning Assistant	Jul 9, 2020	AutoMLBenchmarking	CodeCode Available	1
VisImages: A Fine-Grained Expert-Annotated Visualization Dataset	Jul 9, 2020	Benchmarking	—Unverified	0
Enhancing spatial and textual analysis with EUPEG: an extensible and unified platform for evaluating geoparsers	Jul 9, 2020	Benchmarking	CodeCode Available	1
URSABench: Comprehensive Benchmarking of Approximate Bayesian Inference Methods for Deep Neural Networks	Jul 8, 2020	Bayesian InferenceBenchmarking	CodeCode Available	1
Quaternion Capsule Networks	Jul 8, 2020	BenchmarkingObject Recognition	CodeCode Available	0
RobFR: Benchmarking Adversarial Robustness on Face Recognition	Jul 8, 2020	Adversarial RobustnessBenchmarking	CodeCode Available	1
IOHanalyzer: Detailed Performance Analyses for Iterative Optimization Heuristics	Jul 8, 2020	Bayesian OptimizationBenchmarking	CodeCode Available	1
Benchmarking in Optimization: Best Practice and Open Issues	Jul 7, 2020	Benchmarking	—Unverified	0
Re-thinking Co-Salient Object Detection	Jul 7, 2020	BenchmarkingCo-Salient Object Detection	CodeCode Available	1
Wiki-CS: A Wikipedia-Based Benchmark for Graph Neural Networks	Jul 6, 2020	ArticlesBenchmarking	CodeCode Available	1
Complex Human Action Recognition in Live Videos Using Hybrid FR-DL Method	Jul 6, 2020	Action RecognitionArticles	—Unverified	0
Does imputation matter? Benchmark for predictive models	Jul 6, 2020	BenchmarkingBIG-bench Machine Learning	—Unverified	0
Automatic Target Recognition on Synthetic Aperture Radar Imagery: A Survey	Jul 4, 2020	BenchmarkingSurvey	—Unverified	0
Building benchmarking frameworks for supporting replicability and reproducibility: spatial and textual analysis as an example	Jul 4, 2020	BenchmarkingPosition	—Unverified	0
Quo Vadis, Skeleton Action Recognition ?	Jul 4, 2020	Action RecognitionBenchmarking	CodeCode Available	1
Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient	Jul 3, 2020	BenchmarkingMuJoCo	CodeCode Available	1

Show:10 25 50

← PrevPage 192 of 222Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified