Text-Aware Image Restoration with Diffusion Models Jun 11, 2025 Denoising Hallucination
— Unverified 0GoMatching++: Parameter- and Data-Efficient Arbitrary-Shaped Video Text Spotting and Benchmarking May 28, 2025 Benchmarking Text Spotting
Code Code Available 1SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting Apr 14, 2025 Domain Adaptation Text Detection
Code Code Available 1TextInPlace: Indoor Visual Place Recognition in Repetitive Structures with Scene Text Spotting and Verification Mar 9, 2025 Robot Navigation STS
Code Code Available 1OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models Feb 22, 2025 document understanding Key Information Extraction
Code Code Available 0CLIP is Almost All You Need: Towards Parameter-Efficient Scene Text Retrieval without OCR Jan 1, 2025 All Optical Character Recognition
— Unverified 0Hear the Scene: Audio-Enhanced Text Spotting Dec 27, 2024 Text Spotting
— Unverified 0InstructOCR: Instruction Boosting Scene Text Spotting Dec 20, 2024 Optical Character Recognition (OCR) Text Spotting
Code Code Available 0Arbitrary Reading Order Scene Text Spotter with Local Semantics Guidance Dec 13, 2024 Scene Text Recognition Text Spotting
— Unverified 0HIP: Hierarchical Point Modeling and Pre-training for Visual Information Extraction Nov 2, 2024 Image Reconstruction Optical Character Recognition (OCR)
— Unverified 0FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting Aug 27, 2024 Benchmarking Decoder
Code Code Available 0DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training Aug 1, 2024 Denoising Graph Matching
Code Code Available 1WeCromCL: Weakly Supervised Cross-Modality Contrastive Learning for Transcription-only Supervised Text Spotting Jul 28, 2024 Contrastive Learning Text Spotting
Code Code Available 0CLII: Visual-Text Inpainting via Cross-Modal Predictive Interaction Jul 23, 2024 Image Inpainting Image Restoration
— Unverified 0Block-level Text Spotting with LLMs Jun 19, 2024 Language Modeling Language Modelling
— Unverified 0LOGO: Video Text Spotting with Language Collaboration and Glyph Perception Model May 29, 2024 Position Text Spotting
— Unverified 0VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization Apr 30, 2024 Domain Adaptation Domain Generalization
Code Code Available 2Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer Apr 19, 2024 Decoder Optical Character Recognition
— Unverified 0Bridging the Gap Between End-to-End and Two-Step Text Spotting Apr 6, 2024 Text Spotting
Code Code Available 2Ensemble Learning for Vietnamese Scene Text Spotting in Urban Environments Apr 1, 2024 Ensemble Learning Text Detection
— Unverified 0OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition Mar 28, 2024 Decoder document understanding
Code Code Available 0TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model Mar 15, 2024 Language Modeling Language Modelling
— Unverified 0TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document Mar 7, 2024 document understanding Key Information Extraction
Code Code Available 5Efficiently Leveraging Linguistic Priors for Scene Text Spotting Feb 27, 2024 Scene Text Recognition Text Detection
— Unverified 0Beyond the Mud: Datasets and Benchmarks for Computer Vision in Off-Road Racing Feb 12, 2024 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 0SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting Jan 15, 2024 Text Detection Text Spotting
Code Code Available 1GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching Jan 13, 2024 Text Detection Text Spotting
Code Code Available 1Watermark Text Pattern Spotting in Document Images Jan 10, 2024 Text Spotting
— Unverified 0GloTSFormer: Global Video Text Spotting Transformer Jan 8, 2024 Text Spotting
Code Code Available 0Inverse-like Antagonistic Scene Text Spotting via Reading-Order Estimation and Dynamic Sampling Jan 8, 2024 Text Detection Text Spotting
— Unverified 0OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition Jan 1, 2024 Decoder document understanding
Code Code Available 0Word length-aware text spotting: Enhancing detection and recognition in dense text image Dec 25, 2023 Text Detection Text Spotting
— Unverified 0Parrot Captions Teach CLIP to Spot Text Dec 21, 2023 Representation Learning text similarity
Code Code Available 1Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis Oct 25, 2023 Text Spotting
Code Code Available 2Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance Oct 2, 2023 Scene Text Detection Text Detection
Code Code Available 0Diving into the Depths of Spotting Text in Multi-Domain Noisy Scenes Oct 1, 2023 Super-Resolution Text Spotting
— Unverified 0STEP -- Towards Structured Scene-Text Spotting Sep 5, 2023 Optical Character Recognition (OCR) Scene Text Detection
Code Code Available 0Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration Sep 3, 2023 Decoder document understanding
— Unverified 0Deformation Robust Text Spotting with Geometric Prior Aug 31, 2023 Diversity Text Detection
— Unverified 0ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer Aug 20, 2023 Decoder Text Detection
Code Code Available 1TextFormer: A Query-based End-to-End Text Spotter with Mixed Supervision Jun 6, 2023 Decoder Scene Text Detection
— Unverified 0DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multilingual Text Spotting May 31, 2023 Decoder Scene Text Detection
Code Code Available 2FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation May 5, 2023 Optical Flow Estimation Text Spotting
Code Code Available 1Scalable Mask Annotation for Video Text Spotting May 2, 2023 Text Spotting
Code Code Available 1ICDAR 2023 Video Text Reading Competition for Dense and Small Text Apr 10, 2023 Task 2 Text Detection
— Unverified 0Towards Unified Scene Text Spotting based on Sequence Generation Apr 7, 2023 Text Spotting
Code Code Available 1VGTS: Visually Guided Text Spotting for Novel Categories in Historical Manuscripts Apr 3, 2023 Geometric Matching Metric Learning
— Unverified 0Video text tracking for dense and small text based on pp-yoloe-r and sort algorithm Mar 31, 2023 object-detection Object Detection
— Unverified 0Modeling Entities as Semantic Points for Visual Information Extraction in the Wild Mar 23, 2023 Text Spotting
— Unverified 0A3S: Adversarial learning of semantic representations for Scene-Text Spotting Feb 21, 2023 Text Spotting
— Unverified 0