Shape Robust Text Detection with Progressive Scale Expansion Network

2018-06-07Code Available1· sign in to hype

Xiang Li, Wenhai Wang, Wenbo Hou, Ruo-Ze Liu, Tong Lu, Jian Yang

Code Available — Be the first to reproduce this paper.

Code

github.com/whai362/PSENet
OfficialIn papertf★ 0
github.com/JiaquanYe/TableMASTER-mmocr
pytorch★ 470
github.com/DePengW/PSENet
tf★ 1
github.com/PaddleEdu/OCR-models-PaddlePaddle/tree/main/PSENet
paddle★ 0
github.com/SimonWang00/psenet.tf2
tf★ 0
github.com/li10141110/PSENet-tf2
tf★ 0
github.com/Mael-zys/PSENet
pytorch★ 0
github.com/liuheng92/tensorflow_PSENet
tf★ 0
github.com/A-ZHANG1/PSENet
tf★ 0

Abstract

The challenges of shape robust text detection lie in two aspects: 1) most existing quadrangular bounding box based detectors are difficult to locate texts with arbitrary shapes, which are hard to be enclosed perfectly in a rectangle; 2) most pixel-wise segmentation-based detectors may not separate the text instances that are very close to each other. To address these problems, we propose a novel Progressive Scale Expansion Network (PSENet), designed as a segmentation-based detector with multiple predictions for each text instance. These predictions correspond to different `kernels' produced by shrinking the original text instance into various scales. Consequently, the final detection can be conducted through our progressive scale expansion algorithm which gradually expands the kernels with minimal scales to the text instances with maximal and complete shapes. Due to the fact that there are large geometrical margins among these minimal kernels, our method is effective to distinguish the adjacent text instances and is robust to arbitrary shapes. The state-of-the-art results on ICDAR 2015 and ICDAR 2017 MLT benchmarks further confirm the great effectiveness of PSENet. Notably, PSENet outperforms the previous best record by absolute 6.37\% on the curve text dataset SCUT-CTW1500. Code will be available in https://github.com/whai362/PSENet.

Tasks

Curved Text Detection Scene Text Detection Text Detection

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
ICDAR 2017 MLT	PSENet-1s	Precision	77.01	—	Unverified
SCUT-CTW1500	PSENet-1s	F-Measure	81.17	—	Unverified
SCUT-CTW1500	PSENet-1s	F-Measure	82.2	—	Unverified

Shape Robust Text Detection with Progressive Scale Expansion Network

Code

Abstract

Tasks

Benchmark Results

Reproductions