| Which Model to Trust: Assessing the Influence of Models on the Performance of Reinforcement Learning Algorithms for Continuous Control Tasks | Oct 25, 2021 | Benchmarkingcontinuous-control | CodeCode Available | 0 |
| Identifying and Benchmarking Natural Out-of-Context Prediction Problems | Oct 25, 2021 | Benchmarking | CodeCode Available | 0 |
| Scientific Machine Learning Benchmarks | Oct 25, 2021 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| Benchmarking of Lightweight Deep Learning Architectures for Skin Cancer Classification using ISIC 2017 Dataset | Oct 23, 2021 | BenchmarkingCancer Classification | —Unverified | 0 |
| Learning with Noisy Labels Revisited: A Study Using Real-World Human Annotations | Oct 22, 2021 | BenchmarkingLearning with noisy labels | CodeCode Available | 1 |
| MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems | Oct 21, 2021 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| OpenABC-D: A Large-Scale Dataset For Machine Learning Guided Integrated Circuit Synthesis | Oct 21, 2021 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 1 |
| Text-Based Person Search with Limited Data | Oct 20, 2021 | BenchmarkingContrastive Learning | CodeCode Available | 1 |
| Improved Multilingual Language Model Pretraining for Social Media Text via Translation Pair Prediction | Oct 20, 2021 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| An Open Natural Language Processing Development Framework for EHR-based Clinical Research: A case demonstration using the National COVID Cohort Collaborative (N3C) | Oct 20, 2021 | Benchmarking | —Unverified | 0 |