SOTAVerified

Hierarchical Multi-Label Classification with Missing Information for Benthic Habitat Imagery

2024-09-10Code Available0· sign in to hype

Isaac Xu, Benjamin Misiuk, Scott C. Lowe, Martin Gillis, Craig J. Brown, Thomas Trappenberg

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

In this work, we apply state-of-the-art self-supervised learning techniques on a large dataset of seafloor imagery, BenthicNet, and study their performance for a complex hierarchical multi-label (HML) classification downstream task. In particular, we demonstrate the capacity to conduct HML training in scenarios where there exist multiple levels of missing annotation information, an important scenario for handling heterogeneous real-world data collected by multiple research groups with differing data collection protocols. We find that, when using smaller one-hot image label datasets typical of local or regional scale benthic science projects, models pre-trained with self-supervision on a larger collection of in-domain benthic data outperform models pre-trained on ImageNet. In the HML setting, we find the model can attain a deeper and more precise classification if it is pre-trained with self-supervision on in-domain data. We hope this work can establish a benchmark for future models in the field of automated underwater image annotation tasks and can guide work in other domains with hierarchical annotations of mixed resolution.

Tasks

Reproductions