| 3DOS: Towards 3D Open Set Learning -- Benchmarking and Understanding Semantic Novelty Detection on Point Clouds | Jul 23, 2022 | BenchmarkingNovelty Detection | CodeCode Available | 0 |
| Panoptic Scene Graph Generation | Jul 22, 2022 | BenchmarkingPanoptic Scene Graph Generation | CodeCode Available | 2 |
| Rethinking the Reference-based Distinctive Image Captioning | Jul 22, 2022 | AttributeBenchmarking | CodeCode Available | 0 |
| PieTrack: An MOT solution based on synthetic data training and self-supervised domain adaptation | Jul 22, 2022 | BenchmarkingDomain Adaptation | —Unverified | 0 |
| Physiology-based simulation of the retinal vasculature enables annotation-free segmentation of OCT angiographs | Jul 22, 2022 | BenchmarkingRetinal Vessel Segmentation | CodeCode Available | 1 |
| Benchmarking tools for a priori identifiability analysis | Jul 20, 2022 | Benchmarking | CodeCode Available | 0 |
| Operation-Level Performance Benchmarking of Graph Neural Networks for Scientific Applications | Jul 20, 2022 | Benchmarking | CodeCode Available | 0 |
| Detecting beats in the photoplethysmogram: benchmarking open-source algorithms | Jul 19, 2022 | BenchmarkingPhotoplethysmography (PPG) beat detection | CodeCode Available | 1 |
| ALTO: A Large-Scale Dataset for UAV Visual Place Recognition and Localization | Jul 19, 2022 | BenchmarkingImage Registration | CodeCode Available | 1 |
| Initial recommendations for performing, benchmarking, and reporting single-cell proteomics experiments | Jul 19, 2022 | BenchmarkingExperimental Design | CodeCode Available | 1 |
| Benchmarking Transformers-based models on French Spoken Language Understanding tasks | Jul 19, 2022 | BenchmarkingSpoken Language Understanding | —Unverified | 0 |
| Benchmarking Machine Learning Robustness in Covid-19 Genome Sequence Classification | Jul 18, 2022 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 0 |
| The Multiple Subnetwork Hypothesis: Enabling Multidomain Learning by Isolating Task-Specific Subnetworks in Feedforward Neural Networks | Jul 18, 2022 | Benchmarking | CodeCode Available | 0 |
| Why do tree-based models still outperform deep learning on tabular data? | Jul 18, 2022 | Benchmarking | CodeCode Available | 2 |
| GOAL: Towards Benchmarking Few-Shot Sports Game Summarization | Jul 18, 2022 | Benchmarking | CodeCode Available | 0 |
| Benchmarking Omni-Vision Representation through the Lens of Visual Realms | Jul 14, 2022 | BenchmarkingContrastive Learning | CodeCode Available | 1 |
| Bias Mitigation for Machine Learning Classifiers: A Comprehensive Survey | Jul 14, 2022 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| Immunofluorescence Capillary Imaging Segmentation: Cases Study | Jul 14, 2022 | BenchmarkingImage Segmentation | CodeCode Available | 0 |
| Automated Detection of Label Errors in Semantic Segmentation Datasets via Deep Learning and Uncertainty Quantification | Jul 13, 2022 | BenchmarkingLabel Error Detection | CodeCode Available | 0 |
| Slot Filling for Extracting Reskilling and Upskilling Options from the Web | Jul 11, 2022 | BenchmarkingEntity Linking | CodeCode Available | 0 |
| TASKOGRAPHY: Evaluating robot task planning over large 3D scene graphs | Jul 11, 2022 | BenchmarkingRepresentation Learning | CodeCode Available | 1 |
| Graph Generative Model for Benchmarking Graph Neural Networks | Jul 10, 2022 | BenchmarkingGraph Generation | CodeCode Available | 1 |
| A novel evaluation methodology for supervised Feature Ranking algorithms | Jul 9, 2022 | BenchmarkingFeature Importance | CodeCode Available | 0 |
| Ensemble random forest filter: An alternative to the ensemble Kalman filter for inverse modeling | Jul 8, 2022 | Benchmarking | —Unverified | 0 |
| OVQA: A Clinically Generated Visual Question Answering Dataset | Jul 7, 2022 | BenchmarkingMedical Visual Question Answering | —Unverified | 0 |
| VMAS: A Vectorized Multi-Agent Simulator for Collective Robot Learning | Jul 7, 2022 | BenchmarkingMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| Benefits and Challenges of Dynamic Modelling of Cascading Failures in Power Systems | Jul 7, 2022 | Benchmarking | —Unverified | 0 |
| Understanding Performance of Long-Document Ranking Models through Comprehensive Evaluation and Leaderboarding | Jul 4, 2022 | BenchmarkingDocument Ranking | CodeCode Available | 2 |
| Identifying the Context Shift between Test Benchmarks and Production Data | Jul 3, 2022 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| Can Language Models Make Fun? A Case Study in Chinese Comical Crosstalk | Jul 2, 2022 | BenchmarkingMachine Translation | CodeCode Available | 1 |
| Less Is More: A Comparison of Active Learning Strategies for 3D Medical Image Segmentation | Jul 2, 2022 | Active LearningBenchmarking | CodeCode Available | 1 |
| HATE-ITA: New Baselines for Hate Speech Detection in Italian | Jul 1, 2022 | BenchmarkingHate Speech Detection | CodeCode Available | 0 |
| SentSpace: Large-Scale Benchmarking and Evaluation of Text using Cognitively Motivated Lexical, Syntactic, and Semantic Features | Jul 1, 2022 | BenchmarkingSentence | —Unverified | 0 |
| Towards Toxic Positivity Detection | Jul 1, 2022 | BenchmarkingClassification | —Unverified | 0 |
| Benchmarking Intersectional Biases in NLP | Jul 1, 2022 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 0 |
| Beyond Emotion: A Multi-Modal Dataset for Human Desire Understanding | Jul 1, 2022 | Benchmarking | —Unverified | 0 |
| DACSA: A large-scale Dataset for Automatic summarization of Catalan and Spanish newspaper Articles | Jul 1, 2022 | Abstractive Text SummarizationArticles | —Unverified | 0 |
| Dyna-bAbI: unlocking bAbI’s potential with dynamic synthetic benchmarking | Jul 1, 2022 | BenchmarkingNatural Language Understanding | —Unverified | 0 |
| Benchmarking Language-agnostic Intent Classification for Virtual Assistant Platforms | Jul 1, 2022 | BenchmarkingClassification | CodeCode Available | 0 |
| Local manifold learning and its link to domain-based physics knowledge | Jul 1, 2022 | BenchmarkingDimensionality Reduction | CodeCode Available | 0 |
| Analyzing the behaviour of D'WAVE quantum annealer: fine-tuning parameterization and tests with restrictive Hamiltonian formulations | Jul 1, 2022 | BenchmarkingCombinatorial Optimization | —Unverified | 0 |
| DFGC 2022: The Second DeepFake Game Competition | Jun 30, 2022 | BenchmarkingFace Swapping | CodeCode Available | 1 |
| Benchmarking the Robustness of Deep Neural Networks to Common Corruptions in Digital Pathology | Jun 30, 2022 | BenchmarkingDiagnostic | CodeCode Available | 1 |
| Computer-aided diagnosis and prediction in brain disorders | Jun 29, 2022 | BenchmarkingDecision Making | —Unverified | 0 |
| An extensible Benchmarking Graph-Mesh dataset for studying Steady-State Incompressible Navier-Stokes Equations | Jun 29, 2022 | Benchmarking | CodeCode Available | 0 |
| Beyond neural scaling laws: beating power law scaling via data pruning | Jun 29, 2022 | Benchmarking | CodeCode Available | 1 |
| Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of the Video Frames | Jun 29, 2022 | BenchmarkingDiversity | CodeCode Available | 1 |
| Toward an ImageNet Library of Functions for Global Optimization Benchmarking | Jun 27, 2022 | Benchmarkingglobal-optimization | —Unverified | 0 |
| Benchopt: Reproducible, efficient and collaborative optimization benchmarks | Jun 27, 2022 | Benchmarkingimage-classification | CodeCode Available | 4 |
| The DEBS 2022 Grand Challenge: Detecting Trading Trends in Financial Tick Data | Jun 23, 2022 | Benchmarking | CodeCode Available | 1 |