| When Do Flat Minima Optimizers Work? | Feb 1, 2022 | BenchmarkingGraph Learning | CodeCode Available | 1 |
| Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study | Dec 30, 2021 | AttributeBenchmarking | CodeCode Available | 1 |
| Are we really making much progress? Revisiting, benchmarking, and refining heterogeneous graph neural networks | Dec 30, 2021 | BenchmarkingHeterogeneous Node Classification | CodeCode Available | 1 |
| Leveraging Trust for Joint Multi-Objective and Multi-Fidelity Optimization | Dec 27, 2021 | Bayesian OptimizationBenchmarking | CodeCode Available | 1 |
| Autonomous Reinforcement Learning: Formalism and Benchmarking | Dec 17, 2021 | Benchmarkingreinforcement-learning | CodeCode Available | 1 |
| High-Dimensional Inference in Bayesian Networks | Dec 16, 2021 | BenchmarkingVocal Bursts Intensity Prediction | CodeCode Available | 1 |
| Boosting Neural Image Compression for Machines Using Latent Space Masking | Dec 15, 2021 | BenchmarkingImage Compression | CodeCode Available | 1 |
| Label, Verify, Correct: A Simple Few Shot Object Detection Method | Dec 10, 2021 | BenchmarkingFew-Shot Object Detection | CodeCode Available | 1 |
| Learning Representations with Contrastive Self-Supervised Learning for Histopathology Applications | Dec 10, 2021 | BenchmarkingContrastive Learning | CodeCode Available | 1 |
| Benchmarking human visual search computational models in natural scenes: models comparison and reference datasets | Dec 10, 2021 | Benchmarking | CodeCode Available | 1 |
| Object Shape Error Response Using Bayesian 3-D Convolutional Neural Networks for Assembly Systems With Compliant Parts | Dec 8, 2021 | 3D Shape ModelingBenchmarking | CodeCode Available | 1 |
| Neuro-Symbolic Inductive Logic Programming with Logical Neural Networks | Dec 6, 2021 | BenchmarkingInductive logic programming | CodeCode Available | 1 |
| HyFactor: Hydrogen-count labelled graph-based defactorization Autoencoder | Dec 6, 2021 | BenchmarkingGraph Learning | CodeCode Available | 1 |
| BenchML: an extensible pipelining framework for benchmarking representations of materials and molecules at scale | Dec 4, 2021 | BenchmarkingHyperparameter Optimization | CodeCode Available | 1 |
| CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer | Dec 2, 2021 | BenchmarkingOrdinal Classification | CodeCode Available | 1 |
| TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation | Dec 2, 2021 | BenchmarkingImage Generation | CodeCode Available | 1 |
| Neural Regression, Representational Similarity, Model Zoology & Neural Taskonomy at Scale in Rodent Visual Cortex | Dec 1, 2021 | BenchmarkingObject Recognition | CodeCode Available | 1 |
| MC-Blur: A Comprehensive Benchmark for Image Deblurring | Dec 1, 2021 | BenchmarkingDeblurring | CodeCode Available | 1 |
| NEORL: NeuroEvolution Optimization with Reinforcement Learning | Dec 1, 2021 | Benchmarkingglobal-optimization | CodeCode Available | 1 |
| ClimART: A Benchmark Dataset for Emulating Atmospheric Radiative Transfer in Weather and Climate Models | Nov 29, 2021 | BenchmarkingPhysical Simulations | CodeCode Available | 1 |
| Benchmarking Accuracy and Generalizability of Four Graph Neural Networks Using Large In Vitro ADME Datasets from Different Chemical Spaces | Nov 27, 2021 | BenchmarkingGraph Attention | CodeCode Available | 1 |
| EH-DNAS: End-to-End Hardware-aware Differentiable Neural Architecture Search | Nov 24, 2021 | BenchmarkingNeural Architecture Search | CodeCode Available | 1 |
| Benchmarking Detection Transfer Learning with Vision Transformers | Nov 22, 2021 | Benchmarkingobject-detection | CodeCode Available | 1 |
| Evaluating Adversarial Attacks on ImageNet: A Reality Check on Misclassification Classes | Nov 22, 2021 | Benchmarking | CodeCode Available | 1 |
| Benchmarking emergency department triage prediction models with machine learning and large public electronic health records | Nov 22, 2021 | Benchmarking | CodeCode Available | 1 |
| FedCV: A Federated Learning Framework for Diverse Computer Vision Tasks | Nov 22, 2021 | BenchmarkingFederated Learning | CodeCode Available | 1 |
| GRecX: An Efficient and Unified Benchmark for GNN-based Recommendation | Nov 19, 2021 | BenchmarkingManagement | CodeCode Available | 1 |
| Benchmarking and scaling of deep learning models for land cover image classification | Nov 18, 2021 | BenchmarkingClassification | CodeCode Available | 1 |
| Which priors matter? Benchmarking models for learning latent dynamics | Nov 9, 2021 | Autonomous DrivingBenchmarking | CodeCode Available | 1 |
| Graph Robustness Benchmark: Benchmarking the Adversarial Robustness of Graph Machine Learning | Nov 8, 2021 | Adversarial RobustnessBenchmarking | CodeCode Available | 1 |
| IOHexperimenter: Benchmarking Platform for Iterative Optimization Heuristics | Nov 7, 2021 | Bayesian OptimizationBenchmarking | CodeCode Available | 1 |
| Benchmarking Data-driven Surrogate Simulators for Artificial Electromagnetic Materials | Nov 6, 2021 | BenchmarkingNeural Network simulation | CodeCode Available | 1 |
| OpenFWI: Large-Scale Multi-Structural Benchmark Datasets for Seismic Full Waveform Inversion | Nov 4, 2021 | 2kBenchmarking | CodeCode Available | 1 |
| B-Pref: Benchmarking Preference-Based Reinforcement Learning | Nov 4, 2021 | Benchmarkingreinforcement-learning | CodeCode Available | 1 |
| AdaPool: Exponential Adaptive Pooling for Information-Retaining Downsampling | Nov 1, 2021 | Benchmarkingobject-detection | CodeCode Available | 1 |
| OPF-Learn: An Open-Source Framework for Creating Representative AC Optimal Power Flow Datasets | Nov 1, 2021 | Benchmarking | CodeCode Available | 1 |
| Don’t be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System | Nov 1, 2021 | BenchmarkingResponse Generation | CodeCode Available | 1 |
| Benchmarking Meta-embeddings: What Works and What Does Not | Nov 1, 2021 | BenchmarkingEmbeddings Evaluation | CodeCode Available | 1 |
| FTNet: Feature Transverse Network for Thermal Image Semantic Segmentation | Oct 26, 2021 | BenchmarkingScene Segmentation | CodeCode Available | 1 |
| Learning with Noisy Labels Revisited: A Study Using Real-World Human Annotations | Oct 22, 2021 | BenchmarkingLearning with noisy labels | CodeCode Available | 1 |
| OpenABC-D: A Large-Scale Dataset For Machine Learning Guided Integrated Circuit Synthesis | Oct 21, 2021 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 1 |
| Text-Based Person Search with Limited Data | Oct 20, 2021 | BenchmarkingContrastive Learning | CodeCode Available | 1 |
| NAS-HPO-Bench-II: A Benchmark Dataset on Joint Optimization of Convolutional Neural Network Architecture and Training Hyperparameters | Oct 19, 2021 | 4kBenchmarking | CodeCode Available | 1 |
| HUMAN4D: A Human-Centric Multimodal Dataset for Motions and Immersive Media | Oct 14, 2021 | 3D Pose EstimationBenchmarking | CodeCode Available | 1 |
| Benchmarking the Robustness of Spatial-Temporal Models Against Corruptions | Oct 13, 2021 | BenchmarkingComputational Efficiency | CodeCode Available | 1 |
| Codabench: Flexible, Easy-to-Use and Reproducible Benchmarking Platform | Oct 12, 2021 | Benchmarking | CodeCode Available | 1 |
| NAS-Bench-360: Benchmarking Neural Architecture Search on Diverse Tasks | Oct 12, 2021 | Benchmarkingimage-classification | CodeCode Available | 1 |
| S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations | Oct 12, 2021 | BenchmarkingVoice Conversion | CodeCode Available | 1 |
| EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale Dataset | Oct 11, 2021 | BenchmarkingFace Hallucination | CodeCode Available | 1 |
| Performance Evaluation of Deep Transfer Learning on Multiclass Identification of Common Weed Species in Cotton Production Systems | Oct 11, 2021 | BenchmarkingManagement | CodeCode Available | 1 |