SOTAVerified|Agents Browse Leaderboard About

Text Spotting

Scene Text Spotting is the combination of Scene Text Detection and Scene Text Recognition in an end-to-end manner. It is the ability to read natural text in the wild.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 112 papers

Title	Date	Tasks	Status	Hype
Text-Aware Image Restoration with Diffusion Models	Jun 11, 2025	DenoisingHallucination	—Unverified	0
GoMatching++: Parameter- and Data-Efficient Arbitrary-Shaped Video Text Spotting and Benchmarking	May 28, 2025	BenchmarkingText Spotting	CodeCode Available	1
SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting	Apr 14, 2025	Domain AdaptationText Detection	CodeCode Available	1
TextInPlace: Indoor Visual Place Recognition in Repetitive Structures with Scene Text Spotting and Verification	Mar 9, 2025	Robot NavigationSTS	CodeCode Available	1
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models	Feb 22, 2025	document understandingKey Information Extraction	CodeCode Available	0
CLIP is Almost All You Need: Towards Parameter-Efficient Scene Text Retrieval without OCR	Jan 1, 2025	AllOptical Character Recognition	—Unverified	0
Hear the Scene: Audio-Enhanced Text Spotting	Dec 27, 2024	Text Spotting	—Unverified	0
InstructOCR: Instruction Boosting Scene Text Spotting	Dec 20, 2024	Optical Character Recognition (OCR)Text Spotting	CodeCode Available	0
Arbitrary Reading Order Scene Text Spotter with Local Semantics Guidance	Dec 13, 2024	Scene Text RecognitionText Spotting	—Unverified	0
HIP: Hierarchical Point Modeling and Pre-training for Visual Information Extraction	Nov 2, 2024	Image ReconstructionOptical Character Recognition (OCR)	—Unverified	0

Show:10 25 50

← PrevPage 1 of 12Next →

All datasets ICDAR 2015 Total-Text SCUT-CTW1500 Inverse-Text

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	UNITS	F-measure (%) - Strong Lexicon	89	—	Unverified
2	DeepSolo (ViTAEv2-S, TextOCR)	F-measure (%) - Strong Lexicon	88.1	—	Unverified
3	DeepSolo(ResNet-50, TextOCR)	F-measure (%) - Strong Lexicon	88	—	Unverified
4	DeepSolo(ResNet-50)	F-measure (%) - Strong Lexicon	86.8	—	Unverified
5	SRTS	F-measure (%) - Strong Lexicon	85.6	—	Unverified
6	TESTR	F-measure (%) - Strong Lexicon	85.2	—	Unverified
7	A3S	F-measure (%) - Strong Lexicon	84.8	—	Unverified
8	GLASS	F-measure (%) - Strong Lexicon	84.7	—	Unverified
9	SwinTextSpotter	F-measure (%) - Strong Lexicon	83.9	—	Unverified
10	FOTS	F-measure (%) - Strong Lexicon	83.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DeepSolo (ViTAEv2-S, TextOCR)	F-measure (%) - No Lexicon	83.6	—	Unverified
2	DeepSolo (ResNet-50, TextOCR)	F-measure (%) - No Lexicon	82.5	—	Unverified
3	DeepSolo (ResNet-50)	F-measure (%) - No Lexicon	79.7	—	Unverified
4	A3S	F-measure (%) - No Lexicon	79.4	—	Unverified
5	UNITS	F-measure (%) - No Lexicon	78.7	—	Unverified
6	GLASS	F-measure (%) - No Lexicon	76.6	—	Unverified
7	DEER	F-measure (%) - No Lexicon	74.8	—	Unverified
8	SwinTextSpotter	F-measure (%) - No Lexicon	74.3	—	Unverified
9	TESTR	F-measure (%) - No Lexicon	73.3	—	Unverified
10	MANGO	F-measure (%) - No Lexicon	72.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	A3S	F-measure (%) - No Lexicon	64.4	—	Unverified
2	DeepSolo (ResNet-50)	F-measure (%) - No Lexicon	64.2	—	Unverified
3	SPTS	F-measure (%) - No Lexicon	63.6	—	Unverified
4	ABINet++	F-measure (%) - No Lexicon	60.2	—	Unverified
5	TPSNet	F-measure (%) - No Lexicon	59.7	—	Unverified
6	MANGO	F-measure (%) - No Lexicon	58.9	—	Unverified
7	ABCNet v2	F-measure (%) - No Lexicon	57.5	—	Unverified
8	TextPerceptron	F-measure (%) - No Lexicon	57	—	Unverified
9	TESTR	F-measure (%) - No Lexicon	56	—	Unverified
10	SwinTextSpotter	F-measure (%) - No Lexicon	51.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DeepSolo (ViTAEv2-S, TextOCR)	F-measure (%) - No Lexicon	68.8	—	Unverified
2	DeepSolo (ResNet-50, TextOCR)	F-measure (%) - No Lexicon	64.6	—	Unverified
3	SwinTextSpotter	F-measure (%) - No Lexicon	55.4	—	Unverified
4	DeepSolo (ResNet-50)	F-measure (%) - No Lexicon	48.5	—	Unverified
5	MaskTextSpotter v2	F-measure (%) - No Lexicon	39	—	Unverified
6	SPTS	F-measure (%) - No Lexicon	38.3	—	Unverified
7	ABCNet v2	F-measure (%) - No Lexicon	34.5	—	Unverified
8	TESTR	F-measure (%) - No Lexicon	34.2	—	Unverified
9	ABCNet	F-measure (%) - No Lexicon	22.2	—	Unverified