| CLMB: deep contrastive learning for robust metagenomic binning | Nov 18, 2021 | BenchmarkingContrastive Learning | CodeCode Available | 0 |
| Benchmarking and scaling of deep learning models for land cover image classification | Nov 18, 2021 | BenchmarkingClassification | CodeCode Available | 1 |
| Benchmarking Quality-Dependent and Cost-Sensitive Score-Level Multimodal Biometric Fusion Algorithms | Nov 17, 2021 | Benchmarking | —Unverified | 0 |
| MSAMSum: Towards Benchmarking Multi-lingual Dialogue Summarization | Nov 16, 2021 | Benchmarkingdialogue summary | —Unverified | 0 |
| Fantastic Questions and Where to Find Them: FairytaleQA--An Authentic Dataset for Narrative Comprehension | Nov 16, 2021 | BenchmarkingQuestion Answering | —Unverified | 0 |
| FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding | Nov 16, 2021 | BenchmarkingNatural Language Understanding | —Unverified | 0 |
| Mukayese: Turkish NLP Strikes Back | Nov 16, 2021 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms | Nov 16, 2021 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 3 |
| Multiclass Optimal Classification Trees with SVM-splits | Nov 16, 2021 | BenchmarkingClassification | —Unverified | 0 |
| Benchmarking deep generative models for diverse antibody sequence design | Nov 12, 2021 | BenchmarkingDiversity | —Unverified | 0 |
| ADCB: An Alzheimer's disease benchmark for evaluating observational estimators of causal effects | Nov 12, 2021 | BenchmarkingCausal Inference | —Unverified | 0 |
| Bi-Discriminator Class-Conditional Tabular GAN | Nov 12, 2021 | Benchmarking | —Unverified | 0 |
| MLHarness: A Scalable Benchmarking System for MLCommons | Nov 9, 2021 | Benchmarking | —Unverified | 0 |
| Which priors matter? Benchmarking models for learning latent dynamics | Nov 9, 2021 | Autonomous DrivingBenchmarking | CodeCode Available | 1 |
| EvoLearner: Learning Description Logics with Evolutionary Algorithms | Nov 8, 2021 | BenchmarkingEvolutionary Algorithms | CodeCode Available | 0 |
| Practical, Fast and Robust Point Cloud Registration for 3D Scene Stitching and Object Localization | Nov 8, 2021 | 3D Feature MatchingBenchmarking | —Unverified | 0 |
| Characterizing the adversarial vulnerability of speech self-supervised learning | Nov 8, 2021 | Adversarial RobustnessBenchmarking | —Unverified | 0 |
| Graph Robustness Benchmark: Benchmarking the Adversarial Robustness of Graph Machine Learning | Nov 8, 2021 | Adversarial RobustnessBenchmarking | CodeCode Available | 1 |
| Personalized Benchmarking with the Ludwig Benchmarking Toolkit | Nov 8, 2021 | BenchmarkingHyperparameter Optimization | CodeCode Available | 3 |
| IOHexperimenter: Benchmarking Platform for Iterative Optimization Heuristics | Nov 7, 2021 | Bayesian OptimizationBenchmarking | CodeCode Available | 1 |
| Benchmarking Data-driven Surrogate Simulators for Artificial Electromagnetic Materials | Nov 6, 2021 | BenchmarkingNeural Network simulation | CodeCode Available | 1 |
| A new baseline for retinal vessel segmentation: Numerical identification and correction of methodological inconsistencies affecting 100+ papers | Nov 6, 2021 | BenchmarkingRetinal Vessel Segmentation | CodeCode Available | 0 |
| Benchmarking Multimodal AutoML for Tabular Data with Text Fields | Nov 4, 2021 | AutoMLBenchmarking | CodeCode Available | 3 |
| B-Pref: Benchmarking Preference-Based Reinforcement Learning | Nov 4, 2021 | Benchmarkingreinforcement-learning | CodeCode Available | 1 |
| OpenFWI: Large-Scale Multi-Structural Benchmark Datasets for Seismic Full Waveform Inversion | Nov 4, 2021 | 2kBenchmarking | CodeCode Available | 1 |
| Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies | Nov 3, 2021 | AllBenchmarking | —Unverified | 0 |
| Virus-MNIST: Machine Learning Baseline Calculations for Image Classification | Nov 3, 2021 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| Procedural Generalization by Planning with Self-Supervised World Models | Nov 2, 2021 | BenchmarkingMeta-Learning | —Unverified | 0 |
| Don’t be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System | Nov 1, 2021 | BenchmarkingResponse Generation | CodeCode Available | 1 |
| Constructing a Psychometric Testbed for Fair Natural Language Processing | Nov 1, 2021 | BenchmarkingFairness | CodeCode Available | 0 |
| Benchmarking Meta-embeddings: What Works and What Does Not | Nov 1, 2021 | BenchmarkingEmbeddings Evaluation | CodeCode Available | 1 |
| Automatic Resolution of Domain Name Disputes | Nov 1, 2021 | Benchmarking | CodeCode Available | 0 |
| Who’s on First?: Probing the Learning and Representation Capabilities of Language Models on Deterministic Closed Domains | Nov 1, 2021 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| OPF-Learn: An Open-Source Framework for Creating Representative AC Optimal Power Flow Datasets | Nov 1, 2021 | Benchmarking | CodeCode Available | 1 |
| AdaPool: Exponential Adaptive Pooling for Information-Retaining Downsampling | Nov 1, 2021 | Benchmarkingobject-detection | CodeCode Available | 1 |
| Livestock Monitoring with Transformer | Nov 1, 2021 | Action RecognitionBenchmarking | —Unverified | 0 |
| Distributing Deep Learning Hyperparameter Tuning for 3D Medical Image Segmentation | Oct 29, 2021 | BenchmarkingBrain Tumor Segmentation | CodeCode Available | 0 |
| Towards a Taxonomy of Graph Learning Datasets | Oct 27, 2021 | BenchmarkingGraph Learning | —Unverified | 0 |
| FTNet: Feature Transverse Network for Thermal Image Semantic Segmentation | Oct 26, 2021 | BenchmarkingScene Segmentation | CodeCode Available | 1 |
| Quantum Boosting using Domain-Partitioning Hypotheses | Oct 25, 2021 | BenchmarkingEnsemble Learning | CodeCode Available | 0 |
| Which Model to Trust: Assessing the Influence of Models on the Performance of Reinforcement Learning Algorithms for Continuous Control Tasks | Oct 25, 2021 | Benchmarkingcontinuous-control | CodeCode Available | 0 |
| Identifying and Benchmarking Natural Out-of-Context Prediction Problems | Oct 25, 2021 | Benchmarking | CodeCode Available | 0 |
| Scientific Machine Learning Benchmarks | Oct 25, 2021 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| Benchmarking of Lightweight Deep Learning Architectures for Skin Cancer Classification using ISIC 2017 Dataset | Oct 23, 2021 | BenchmarkingCancer Classification | —Unverified | 0 |
| Learning with Noisy Labels Revisited: A Study Using Real-World Human Annotations | Oct 22, 2021 | BenchmarkingLearning with noisy labels | CodeCode Available | 1 |
| MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems | Oct 21, 2021 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| OpenABC-D: A Large-Scale Dataset For Machine Learning Guided Integrated Circuit Synthesis | Oct 21, 2021 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 1 |
| Text-Based Person Search with Limited Data | Oct 20, 2021 | BenchmarkingContrastive Learning | CodeCode Available | 1 |
| Improved Multilingual Language Model Pretraining for Social Media Text via Translation Pair Prediction | Oct 20, 2021 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| An Open Natural Language Processing Development Framework for EHR-based Clinical Research: A case demonstration using the National COVID Cohort Collaborative (N3C) | Oct 20, 2021 | Benchmarking | —Unverified | 0 |