| Document Summarization with Text Segmentation | Jan 20, 2023 | ArticlesDocument Summarization | —Unverified | 0 |
| Structured Summarization: Unified Text Segmentation and Segment Labeling as a Generation Task | Sep 28, 2022 | DecoderSegmentation | —Unverified | 0 |
| OCR for TIFF Compressed Document Images Directly in Compressed Domain Using Text segmentation and Hidden Markov Model | Sep 13, 2022 | Optical Character Recognition (OCR)Text Segmentation | —Unverified | 0 |
| DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon | Jun 22, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unsupervised Tokenization Learning | May 23, 2022 | Text Segmentation | —Unverified | 0 |
| Towards Deployable OCR models for Indic languages | May 13, 2022 | Optical Character Recognition (OCR)Segmentation | —Unverified | 0 |
| TopWORDS-Seg: Simultaneous Text Segmentation and Word Discovery for Open-Domain Chinese Texts via Bayesian Inference | May 1, 2022 | Bayesian InferenceSegmentation | —Unverified | 0 |
| Fuzzy Segmentations of a String | Jan 31, 2022 | ClusteringSegmentation | —Unverified | 0 |
| BTS: A Bi-Lingual Benchmark for Text Segmentation in the Wild | Jan 1, 2022 | SegmentationStyle Transfer | —Unverified | 0 |
| Weakly supervised discourse segmentation for multiparty oral conversations | Nov 1, 2021 | Discourse SegmentationSegmentation | CodeCode Available | 0 |