Detecting Oriented Text in Natural Images by Linking Segments

2017-03-19CVPR 2017Code Available0· sign in to hype

Baoguang Shi, Xiang Bai, Serge Belongie

Code Available — Be the first to reproduce this paper.

Code

github.com/GuoLiuFang/seglink-lfs
tf★ 0
github.com/dengdan/seglink
tf★ 0
github.com/YohannaYin/segmentlink_yh
tf★ 0
github.com/curbmap/curbmap-ml
tf★ 0
github.com/Yiming992/Test-Linking-Segments-Text-Localization
tf★ 0
github.com/bgshih/seglink
tf★ 0

Abstract

Most state-of-the-art text detection methods are specific to horizontal Latin text and are not fast enough for real-time applications. We introduce Segment Linking (SegLink), an oriented text detection method. The main idea is to decompose text into two locally detectable elements, namely segments and links. A segment is an oriented box covering a part of a word or text line; A link connects two adjacent segments, indicating that they belong to the same word or text line. Both elements are detected densely at multiple scales by an end-to-end trained, fully-convolutional neural network. Final detections are produced by combining segments connected by links. Compared with previous methods, SegLink improves along the dimensions of accuracy, speed, and ease of training. It achieves an f-measure of 75.0% on the standard ICDAR 2015 Incidental (Challenge 4) benchmark, outperforming the previous best by a large margin. It runs at over 20 FPS on 512x512 images. Moreover, without modification, SegLink is able to detect long lines of non-Latin text, such as Chinese.

Tasks

Curved Text Detection Scene Text Detection Text Detection

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
ICDAR 2013	SegLink	F-Measure	85.3	—	Unverified
ICDAR 2015	WordSup (VGG16-synth-icdar)	F-Measure	78.2	—	Unverified
MSRA-TD500	SegLink	F-Measure	77	—	Unverified

Detecting Oriented Text in Natural Images by Linking Segments

Code

Abstract

Tasks

Benchmark Results

Reproductions