LiteSeg: A Novel Lightweight ConvNet for Semantic Segmentation
Taha Emara, Hossam E. Abd El Munim, Hazem M. Abbas
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/zh320/realtime-semantic-segmentation-pytorchpytorch★ 249
- github.com/tahaemara/LiteSegpytorch★ 0
Abstract
Semantic image segmentation plays a pivotal role in many vision applications including autonomous driving and medical image analysis. Most of the former approaches move towards enhancing the performance in terms of accuracy with a little awareness of computational efficiency. In this paper, we introduce LiteSeg, a lightweight architecture for semantic image segmentation. In this work, we explore a new deeper version of Atrous Spatial Pyramid Pooling module (ASPP) and apply short and long residual connections, and depthwise separable convolution, resulting in a faster and efficient model. LiteSeg architecture is introduced and tested with multiple backbone networks as Darknet19, MobileNet, and ShuffleNet to provide multiple trade-offs between accuracy and computational cost. The proposed model LiteSeg, with MobileNetV2 as a backbone network, achieves an accuracy of 67.81% mean intersection over union at 161 frames per second with 640 360 resolution on the Cityscapes dataset.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| Cityscapes test | LightSeg-DarkNet19 | Mean IoU (class) | 70.75 | — | Unverified |
| Cityscapes test | LightSeg-MobileNet | Mean IoU (class) | 67.81 | — | Unverified |
| Cityscapes test | LiteSeg-MobileNet | Mean IoU (class) | 67.81 | — | Unverified |
| Cityscapes test | LightSeg-ShuffleNet | Mean IoU (class) | 65.17 | — | Unverified |
| Cityscapes test | LiteSeg-ShuffleNet | Mean IoU (class) | 65.17 | — | Unverified |