Multi-Class Abnormality Classification in Video Capsule Endoscopy Using Deep Learning

2024-10-24Code Available0· sign in to hype

Arnav Samal, Ranya Batsyas

Code Available — Be the first to reproduce this paper.

Code

github.com/arnavs04/capsule-vision-2024
OfficialIn paperpytorch★ 5

Abstract

This report outlines Team Seq2Cure's deep learning approach for the Capsule Vision 2024 Challenge, leveraging an ensemble of convolutional neural networks (CNNs) and transformer-based architectures for multi-class abnormality classification in video capsule endoscopy frames. The dataset comprised over 50,000 frames from three public sources and one private dataset, labeled across 10 abnormality classes. To overcome the limitations of traditional CNNs in capturing global context, we integrated CNN and transformer models within a multi-model ensemble. Our approach achieved a balanced accuracy of 86.34 percent and a mean AUC-ROC score of 0.9908 on the validation set, earning our submission 5th place in the challenge. Code is available at http://github.com/arnavs04/capsule-vision-2024 .

Tasks

Multi-class Classification

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Training and validation dataset of capsule vision 2024 challenge.	Multi-Model Ensemble	Mean AUC	0.99	—	Unverified

Multi-Class Abnormality Classification in Video Capsule Endoscopy Using Deep Learning

Code

Abstract

Tasks

Benchmark Results

Reproductions