E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial Vehicles

2022-06-05Code Available0· sign in to hype

Zhenyu Hu, Zhenyu Wu, Pengcheng Pi, Yunhe Xue, Jiayi Shen, Jianchao Tan, Xiangru Lian, Zhangyang Wang, Ji Liu

Code Available — Be the first to reproduce this paper.

Code

github.com/wuzhenyusjtu/lpcvc20-videotextspotting
OfficialIn paperpytorch★ 3

Abstract

Unmanned Aerial Vehicles (UAVs) based video text spotting has been extensively used in civil and military domains. UAV's limited battery capacity motivates us to develop an energy-efficient video text spotting solution. In this paper, we first revisit RCNN's crop & resize training strategy and empirically find that it outperforms aligned RoI sampling on a real-world video text dataset captured by UAV. To reduce energy consumption, we further propose a multi-stage image processor that takes videos' redundancy, continuity, and mixed degradation into account. Lastly, the model is pruned and quantized before deployed on Raspberry Pi. Our proposed energy-efficient video text spotting solution, dubbed as E^2VTS, outperforms all previous methods by achieving a competitive tradeoff between energy efficiency and performance. All our codes and pre-trained models are available at https://github.com/wuzhenyusjtu/LPCVC20-VideoTextSpotting.

Tasks

Text Spotting

E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial Vehicles

Code

Abstract

Tasks

Reproductions