Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1351–1400 of 5548 papers

Title	Date	Tasks	Status	Hype
ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis	Mar 9, 2021	BenchmarkingClassification	CodeCode Available	1
OpenICS: Open Image Compressive Sensing Toolbox and Benchmark	Feb 28, 2021	BenchmarkingCompressive Sensing	CodeCode Available	1
Benchmarking and Survey of Explanation Methods for Black Box Models	Feb 25, 2021	BenchmarkingSurvey	CodeCode Available	1
4D Panoptic LiDAR Segmentation	Feb 24, 2021	4D Panoptic SegmentationBenchmarking	CodeCode Available	1
Deluca -- A Differentiable Control Library: Environments, Methods, and Benchmarking	Feb 19, 2021	BenchmarkingOpenAI Gym	CodeCode Available	1
NuCLS: A scalable crowdsourcing, deep learning approach and dataset for nucleus classification, localization and segmentation	Feb 18, 2021	BenchmarkingInterpretable Machine Learning	CodeCode Available	1
GraphGallery: A Platform for Fast Benchmarking and Easy Development of Graph Neural Networks Based Intelligent Software	Feb 16, 2021	Benchmarking	CodeCode Available	1
HAWKS: Evolving Challenging Benchmark Sets for Cluster Analysis	Feb 13, 2021	BenchmarkingClustering	CodeCode Available	1
Towards Large Scale Automated Algorithm Design by Integrating Modular Benchmarking Frameworks	Feb 12, 2021	Benchmarking	CodeCode Available	1
Benchmarking Deep Graph Generative Models for Optimizing New Drug Molecules for COVID-19	Feb 9, 2021	BenchmarkingQ-Learning	CodeCode Available	1
Benchmarking Quantized Neural Networks on FPGAs with FINN	Feb 2, 2021	BenchmarkingQuantization	CodeCode Available	1
Generating a Doppelganger Graph: Resembling but Distinct	Jan 23, 2021	BenchmarkingGraph Representation Learning	CodeCode Available	1
COSMOS: Catching Out-of-Context Misinformation with Self-Supervised Learning	Jan 15, 2021	BenchmarkingMisinformation	CodeCode Available	1
Automated Model Design and Benchmarking of 3D Deep Learning Models for COVID-19 Detection with Chest CT Scans	Jan 14, 2021	BenchmarkingMedical Diagnosis	CodeCode Available	1
Benchmarking Simulation-Based Inference	Jan 12, 2021	Benchmarking	CodeCode Available	1
Shallow-UWnet : Compressed Model for Underwater Image Enhancement	Jan 6, 2021	BenchmarkingImage Enhancement	CodeCode Available	1
Descending through a Crowded Valley — Benchmarking Deep Learning Optimizers	Jan 1, 2021	BenchmarkingDeep Learning	CodeCode Available	1
Rotation Equivariant Siamese Networks for Tracking	Dec 24, 2020	2D Pose EstimationBenchmarking	CodeCode Available	1
TACTO: A Fast, Flexible, and Open-source Simulator for High-Resolution Vision-based Tactile Sensors	Dec 15, 2020	Benchmarking	CodeCode Available	1
Evaluating Attribution for Graph Neural Networks	Dec 1, 2020	Benchmarking	CodeCode Available	1
PMLB v1.0: An open source dataset collection for benchmarking machine learning methods	Nov 30, 2020	BenchmarkingBIG-bench Machine Learning	CodeCode Available	1
Benchmarking Image Retrieval for Visual Localization	Nov 24, 2020	Autonomous DrivingBenchmarking	CodeCode Available	1
RobustPointSet: A Dataset for Benchmarking Robustness of Point Cloud Classifiers	Nov 23, 2020	3D Point Cloud ClassificationBenchmarking	CodeCode Available	1
Tonic: A Deep Reinforcement Learning Library for Fast Prototyping and Benchmarking	Nov 15, 2020	Benchmarkingcontinuous-control	CodeCode Available	1
Real-Time Polyp Detection, Localization and Segmentation in Colonoscopy Using Deep Learning	Nov 15, 2020	BenchmarkingColorectal Polyps Characterization	CodeCode Available	1
SoftGym: Benchmarking Deep Reinforcement Learning for Deformable Object Manipulation	Nov 14, 2020	BenchmarkingDeep Reinforcement Learning	CodeCode Available	1
tvopt: A Python Framework for Time-Varying Optimization	Nov 12, 2020	Benchmarking	CodeCode Available	1
Long Range Arena: A Benchmark for Efficient Transformers	Nov 8, 2020	16kBenchmarking	CodeCode Available	1
Collective Knowledge: organizing research projects as a database of reusable components and portable workflows with common APIs	Nov 2, 2020	Benchmarking	CodeCode Available	1
Benchmarking Meaning Representations in Neural Semantic Parsing	Nov 1, 2020	BenchmarkingSemantic Parsing	CodeCode Available	1
A Critical Assessment of State-of-the-Art in Entity Alignment	Oct 30, 2020	BenchmarkingEntity Alignment	CodeCode Available	1
Benchmarking Deep Learning Interpretability in Time Series Predictions	Oct 26, 2020	BenchmarkingDeep Learning	CodeCode Available	1
Kvasir-Instrument: Diagnostic and therapeutic tool segmentation dataset in gastrointestinal endoscopy	Oct 23, 2020	BenchmarkingDiagnostic	CodeCode Available	1
KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi	Oct 23, 2020	ArticlesBenchmarking	CodeCode Available	1
Exploiting News Article Structure for Automatic Corpus Generation of Entailment Datasets	Oct 22, 2020	ArticlesBenchmarking	CodeCode Available	1
Self-Alignment Pretraining for Biomedical Entity Representations	Oct 22, 2020	BenchmarkingEntity Linking	CodeCode Available	1
German's Next Language Model	Oct 21, 2020	BenchmarkingDocument Classification	CodeCode Available	1
Promoting High Diversity Ensemble Learning with EnsembleBench	Oct 20, 2020	BenchmarkingDiversity	CodeCode Available	1
RobustBench: a standardized adversarial robustness benchmark	Oct 19, 2020	Adversarial RobustnessBenchmarking	CodeCode Available	1
RADIATE: A Radar Dataset for Automotive Perception in Bad Weather	Oct 18, 2020	Autonomous DrivingBenchmarking	CodeCode Available	1
Light Field Salient Object Detection: A Review and Benchmark	Oct 10, 2020	BenchmarkingObject	CodeCode Available	1
Olympus: a benchmarking framework for noisy optimization and experiment planning	Oct 8, 2020	BenchmarkingProbabilistic Deep Learning	CodeCode Available	1
OpenTraj: Assessing Prediction Complexity in Human Trajectories Datasets	Oct 2, 2020	BenchmarkingPrediction	CodeCode Available	1
Bag of Tricks for Adversarial Training	Oct 1, 2020	Adversarial RobustnessBenchmarking	CodeCode Available	1
HINT3: Raising the bar for Intent Detection in the Wild	Sep 29, 2020	BenchmarkingIntent Detection	CodeCode Available	1
Benchmarking deep inverse models over time, and the neural-adjoint method	Sep 27, 2020	Benchmarking	CodeCode Available	1
A BFS-Tree of Ranking References for Unsupervised Manifold Learning	Sep 24, 2020	BenchmarkingImage Retrieval	CodeCode Available	1
CoDEx: A Comprehensive Knowledge Graph Completion Benchmark	Sep 16, 2020	BenchmarkingKnowledge Graph Completion	CodeCode Available	1
BARS-CTR: Open Benchmarking for Click-Through Rate Prediction	Sep 12, 2020	BenchmarkingClick-Through Rate Prediction	CodeCode Available	1
IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding	Sep 11, 2020	BenchmarkingDiversity	CodeCode Available	1

Show:10 25 50

← PrevPage 28 of 111Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified