| ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis | Mar 9, 2021 | BenchmarkingClassification | CodeCode Available | 1 |
| OpenICS: Open Image Compressive Sensing Toolbox and Benchmark | Feb 28, 2021 | BenchmarkingCompressive Sensing | CodeCode Available | 1 |
| Benchmarking and Survey of Explanation Methods for Black Box Models | Feb 25, 2021 | BenchmarkingSurvey | CodeCode Available | 1 |
| 4D Panoptic LiDAR Segmentation | Feb 24, 2021 | 4D Panoptic SegmentationBenchmarking | CodeCode Available | 1 |
| Deluca -- A Differentiable Control Library: Environments, Methods, and Benchmarking | Feb 19, 2021 | BenchmarkingOpenAI Gym | CodeCode Available | 1 |
| NuCLS: A scalable crowdsourcing, deep learning approach and dataset for nucleus classification, localization and segmentation | Feb 18, 2021 | BenchmarkingInterpretable Machine Learning | CodeCode Available | 1 |
| GraphGallery: A Platform for Fast Benchmarking and Easy Development of Graph Neural Networks Based Intelligent Software | Feb 16, 2021 | Benchmarking | CodeCode Available | 1 |
| HAWKS: Evolving Challenging Benchmark Sets for Cluster Analysis | Feb 13, 2021 | BenchmarkingClustering | CodeCode Available | 1 |
| Towards Large Scale Automated Algorithm Design by Integrating Modular Benchmarking Frameworks | Feb 12, 2021 | Benchmarking | CodeCode Available | 1 |
| Benchmarking Deep Graph Generative Models for Optimizing New Drug Molecules for COVID-19 | Feb 9, 2021 | BenchmarkingQ-Learning | CodeCode Available | 1 |
| Benchmarking Quantized Neural Networks on FPGAs with FINN | Feb 2, 2021 | BenchmarkingQuantization | CodeCode Available | 1 |
| Generating a Doppelganger Graph: Resembling but Distinct | Jan 23, 2021 | BenchmarkingGraph Representation Learning | CodeCode Available | 1 |
| COSMOS: Catching Out-of-Context Misinformation with Self-Supervised Learning | Jan 15, 2021 | BenchmarkingMisinformation | CodeCode Available | 1 |
| Automated Model Design and Benchmarking of 3D Deep Learning Models for COVID-19 Detection with Chest CT Scans | Jan 14, 2021 | BenchmarkingMedical Diagnosis | CodeCode Available | 1 |
| Benchmarking Simulation-Based Inference | Jan 12, 2021 | Benchmarking | CodeCode Available | 1 |
| Shallow-UWnet : Compressed Model for Underwater Image Enhancement | Jan 6, 2021 | BenchmarkingImage Enhancement | CodeCode Available | 1 |
| Descending through a Crowded Valley — Benchmarking Deep Learning Optimizers | Jan 1, 2021 | BenchmarkingDeep Learning | CodeCode Available | 1 |
| Rotation Equivariant Siamese Networks for Tracking | Dec 24, 2020 | 2D Pose EstimationBenchmarking | CodeCode Available | 1 |
| TACTO: A Fast, Flexible, and Open-source Simulator for High-Resolution Vision-based Tactile Sensors | Dec 15, 2020 | Benchmarking | CodeCode Available | 1 |
| Evaluating Attribution for Graph Neural Networks | Dec 1, 2020 | Benchmarking | CodeCode Available | 1 |
| PMLB v1.0: An open source dataset collection for benchmarking machine learning methods | Nov 30, 2020 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 1 |
| Benchmarking Image Retrieval for Visual Localization | Nov 24, 2020 | Autonomous DrivingBenchmarking | CodeCode Available | 1 |
| RobustPointSet: A Dataset for Benchmarking Robustness of Point Cloud Classifiers | Nov 23, 2020 | 3D Point Cloud ClassificationBenchmarking | CodeCode Available | 1 |
| Tonic: A Deep Reinforcement Learning Library for Fast Prototyping and Benchmarking | Nov 15, 2020 | Benchmarkingcontinuous-control | CodeCode Available | 1 |
| Real-Time Polyp Detection, Localization and Segmentation in Colonoscopy Using Deep Learning | Nov 15, 2020 | BenchmarkingColorectal Polyps Characterization | CodeCode Available | 1 |
| SoftGym: Benchmarking Deep Reinforcement Learning for Deformable Object Manipulation | Nov 14, 2020 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 |
| tvopt: A Python Framework for Time-Varying Optimization | Nov 12, 2020 | Benchmarking | CodeCode Available | 1 |
| Long Range Arena: A Benchmark for Efficient Transformers | Nov 8, 2020 | 16kBenchmarking | CodeCode Available | 1 |
| Collective Knowledge: organizing research projects as a database of reusable components and portable workflows with common APIs | Nov 2, 2020 | Benchmarking | CodeCode Available | 1 |
| Benchmarking Meaning Representations in Neural Semantic Parsing | Nov 1, 2020 | BenchmarkingSemantic Parsing | CodeCode Available | 1 |
| A Critical Assessment of State-of-the-Art in Entity Alignment | Oct 30, 2020 | BenchmarkingEntity Alignment | CodeCode Available | 1 |
| Benchmarking Deep Learning Interpretability in Time Series Predictions | Oct 26, 2020 | BenchmarkingDeep Learning | CodeCode Available | 1 |
| Kvasir-Instrument: Diagnostic and therapeutic tool segmentation dataset in gastrointestinal endoscopy | Oct 23, 2020 | BenchmarkingDiagnostic | CodeCode Available | 1 |
| KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi | Oct 23, 2020 | ArticlesBenchmarking | CodeCode Available | 1 |
| Exploiting News Article Structure for Automatic Corpus Generation of Entailment Datasets | Oct 22, 2020 | ArticlesBenchmarking | CodeCode Available | 1 |
| Self-Alignment Pretraining for Biomedical Entity Representations | Oct 22, 2020 | BenchmarkingEntity Linking | CodeCode Available | 1 |
| German's Next Language Model | Oct 21, 2020 | BenchmarkingDocument Classification | CodeCode Available | 1 |
| Promoting High Diversity Ensemble Learning with EnsembleBench | Oct 20, 2020 | BenchmarkingDiversity | CodeCode Available | 1 |
| RobustBench: a standardized adversarial robustness benchmark | Oct 19, 2020 | Adversarial RobustnessBenchmarking | CodeCode Available | 1 |
| RADIATE: A Radar Dataset for Automotive Perception in Bad Weather | Oct 18, 2020 | Autonomous DrivingBenchmarking | CodeCode Available | 1 |
| Light Field Salient Object Detection: A Review and Benchmark | Oct 10, 2020 | BenchmarkingObject | CodeCode Available | 1 |
| Olympus: a benchmarking framework for noisy optimization and experiment planning | Oct 8, 2020 | BenchmarkingProbabilistic Deep Learning | CodeCode Available | 1 |
| OpenTraj: Assessing Prediction Complexity in Human Trajectories Datasets | Oct 2, 2020 | BenchmarkingPrediction | CodeCode Available | 1 |
| Bag of Tricks for Adversarial Training | Oct 1, 2020 | Adversarial RobustnessBenchmarking | CodeCode Available | 1 |
| HINT3: Raising the bar for Intent Detection in the Wild | Sep 29, 2020 | BenchmarkingIntent Detection | CodeCode Available | 1 |
| Benchmarking deep inverse models over time, and the neural-adjoint method | Sep 27, 2020 | Benchmarking | CodeCode Available | 1 |
| A BFS-Tree of Ranking References for Unsupervised Manifold Learning | Sep 24, 2020 | BenchmarkingImage Retrieval | CodeCode Available | 1 |
| CoDEx: A Comprehensive Knowledge Graph Completion Benchmark | Sep 16, 2020 | BenchmarkingKnowledge Graph Completion | CodeCode Available | 1 |
| BARS-CTR: Open Benchmarking for Click-Through Rate Prediction | Sep 12, 2020 | BenchmarkingClick-Through Rate Prediction | CodeCode Available | 1 |
| IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding | Sep 11, 2020 | BenchmarkingDiversity | CodeCode Available | 1 |