SOTAVerified

Chinese Word Segmentation

Chinese word segmentation is the task of splitting Chinese text (i.e. a sequence of Chinese characters) into words (Source: www.nlpprogress.com).

Papers

Showing 150 of 244 papers

TitleStatusHype
N-LTP: An Open-source Neural Language Technology Platform for ChineseCode3
ZEN: Pre-training Chinese Text Encoder Enhanced by N-gram RepresentationsCode1
LSICC: A Large Scale Informal Chinese CorpusCode1
Sub-Character Tokenization for Chinese Pretrained Language ModelsCode1
Joint Chinese Word Segmentation and Part-of-speech Tagging via Two-way Attentions of Auto-analyzed KnowledgeCode1
RethinkCWS: Is Chinese Word Segmentation a Solved Task?Code1
Joint Chinese Word Segmentation and Part-of-speech Tagging via Multi-channel Attention of Character N-gramsCode1
fastHan: A BERT-based Multi-Task Toolkit for Chinese NLPCode1
Semi-supervised URL Segmentation with Recurrent Neural NetworksPre-trained on Knowledge Graph EntitiesCode1
The Uncertainty-based Retrieval Framework for Ancient Chinese CWS and POSCode1
Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word SegmentationCode1
Semi-supervised URL Segmentation with Recurrent Neural Networks Pre-trained on Knowledge Graph EntitiesCode1
Fast and Accurate End-to-End Span-based Semantic Role Labeling as Word-based Graph ParsingCode1
Improving Chinese Word Segmentation with Wordhood Memory NetworksCode1
Unsupervised Neural Word Segmentation for Chinese via Segmental Language ModelingCode0
Transition-Based Neural Word SegmentationCode0
Weighted self Distillation for Chinese word segmentationCode0
Simplifying Neural Machine Translation with Addition-Subtraction Twin-Gated Recurrent NetworksCode0
Adversarial Transfer Learning for Chinese Named Entity Recognition with Self-Attention MechanismCode0
State-of-the-art Chinese Word Segmentation with Bi-LSTMsCode0
Word Segmentation by Separation Inference for East Asian LanguagesCode0
A Concise Model for Multi-Criteria Chinese Word Segmentation with Transformer EncoderCode0
PKUSEG: A Toolkit for Multi-Domain Chinese Word SegmentationCode0
Segmental Recurrent Neural NetworksCode0
Ancient Chinese Word Segmentation and Part-of-Speech Tagging Using Distant SupervisionCode0
LATTE: Lattice ATTentive Encoding for Character-based Word SegmentationCode0
Subword Encoding in Lattice LSTM for Chinese Word SegmentationCode0
Neural Chinese Word Segmentation as Sequence to Sequence TranslationCode0
Unsupervised Boundary-Aware Language Model Pretraining for Chinese Sequence LabelingCode0
Unsupervised Chinese Word Segmentation with BERT Oriented Probing and TransformationCode0
Attention Is All You Need for Chinese Word SegmentationCode0
Improving Cross-Domain Chinese Word Segmentation with Word EmbeddingsCode0
Investigating Self-Attention Network for Chinese Word SegmentationCode0
Neural Word Segmentation Learning for ChineseCode0
A Segmentation Matrix Method for Chinese Segmentation Ambiguity AnalysisCode0
Exploring Word Segmentation and Medical Concept Recognition for Chinese Medical TextsCode0
Adaptive Multi-Task Transfer Learning for Chinese Word Segmentation in Medical TextCode0
Fast and Accurate Neural Word Segmentation for ChineseCode0
Glyce: Glyph-vectors for Chinese Character RepresentationsCode0
Approaching Neural Chinese Word Segmentation as a Low-Resource Machine Translation TaskCode0
Dual Long Short-Term Memory Networks for Sub-Character Representation LearningCode0
A Graph-based Model for Joint Chinese Word Segmentation and Dependency ParsingCode0
Effective Neural Solution for Multi-Criteria Word SegmentationCode0
Legal Documents Drafting with Fine-Tuned Pre-Trained Large Language ModelCode0
Mining Word Boundaries from Speech-Text Parallel Data for Cross-domain Chinese Word SegmentationCode0
More than Text: Multi-modal Chinese Word SegmentationCode0
Convolutional Neural Network with Word Embeddings for Chinese Word SegmentationCode0
ChiMST: A Chinese Medical Corpus for Word Segmentation and Medical Term RecognitionCode0
Exploring Segment Representations for Neural Segmentation ModelsCode0
GNN-SL: Sequence Labeling Based on Nearest Examples via GNNCode0
Show:102550
← PrevPage 1 of 5Next →

No leaderboard results yet.