| Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation | Jun 24, 2024 | parameter-efficient fine-tuningSentence | CodeCode Available | 7 |
| Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation | May 30, 2023 | Machine TranslationSegmentation | CodeCode Available | 3 |
| Abstractive Summarization of Spoken andWritten Instructions with BERT | Aug 21, 2020 | Abstractive Text SummarizationArticles | CodeCode Available | 2 |
| Opera Graeca Adnotata: Building a 34M+ Token Multilayer Corpus for Ancient Greek | Mar 31, 2024 | LemmatizationSentence | CodeCode Available | 1 |
| Lexical Semantic Recognition | Apr 30, 2020 | Natural Language UnderstandingSentence | CodeCode Available | 1 |
| Ascle: A Python Natural Language Processing Toolkit for Medical Text Generation | Nov 28, 2023 | Machine TranslationQuestion Answering | CodeCode Available | 1 |
| A unified approach to sentence segmentation of punctuated text in many languages | Aug 1, 2021 | SentenceSentence segmentation | CodeCode Available | 1 |
| Mukayese: Turkish NLP Strikes Back | Mar 2, 2022 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation | Sep 20, 2020 | Machine TranslationSentence | CodeCode Available | 1 |
| Abstractive Summarization of Spoken and Written Instructions with BERT | Aug 21, 2020 | Abstractive Text SummarizationArticles | CodeCode Available | 1 |
| Trankit: A Light-Weight Transformer-based Toolkit for Multilingual Natural Language Processing | Jan 9, 2021 | Dependency ParsingLanguage Modeling | CodeCode Available | 1 |
| KG-GPT: A General Framework for Reasoning on Knowledge Graphs Using Large Language Models | Oct 17, 2023 | Fact VerificationKnowledge Graphs | CodeCode Available | 1 |
| 82 Treebanks, 34 Models: Universal Dependency Parsing with Multi-Treebank Models | Sep 6, 2018 | Dependency ParsingPOS | —Unverified | 0 |
| Evaluating Sentence Segmentation in Different Datasets of Neuropsychological Language Tests in Brazilian Portuguese | May 1, 2020 | SentenceSentence segmentation | —Unverified | 0 |
| Evaluation of Simple Distributional Compositional Operations on Longer Texts | May 1, 2014 | Semantic Textual SimilaritySentence | —Unverified | 0 |
| Experiments on transfer learning architectures for biomedical relation extraction | Nov 24, 2020 | Open-Ended Question AnsweringRelation | —Unverified | 0 |
| Feature Fusion Strategies for End-to-End Evaluation of Cognitive Behavior Therapy Sessions | May 15, 2020 | SentenceSentence segmentation | —Unverified | 0 |
| Fine-Grained Control of Sentence Segmentation and Entity Positioning in Neural NLG | Nov 1, 2019 | Data-to-Text GenerationPosition | —Unverified | 0 |
| From Raw Text to Universal Dependencies - Look, No Tags! | Aug 1, 2017 | Dependency ParsingPart-Of-Speech Tagging | —Unverified | 0 |
| GujiBERT and GujiGPT: Construction of Intelligent Information Processing Foundation Language Models for Ancient Texts | Jul 11, 2023 | Model SelectionPart-Of-Speech Tagging | —Unverified | 0 |
| IBM Research at the CoNLL 2018 Shared Task on Multilingual Parsing | Oct 1, 2018 | ARCDependency Parsing | —Unverified | 0 |
| IMS at the CoNLL 2017 UD Shared Task: CRFs and Perceptrons Meet Neural Networks | Aug 1, 2017 | POSSegmentation | —Unverified | 0 |
| Inforex -- a web-based tool for text corpus management and semantic annotation | May 1, 2012 | ManagementNamed Entity Recognition (NER) | —Unverified | 0 |
| Integration of Automatic Sentence Segmentation and Lexical Analysis of Ancient Chinese based on BiLSTM-CRF Model | May 1, 2020 | Lexical Analysisnamed-entity-recognition | —Unverified | 0 |
| Midas Loop: A Prioritized Human-in-the-Loop Annotation for Large Scale Multilayer Data | Jun 1, 2022 | Active LearningManagement | —Unverified | 0 |
| Mukayese: Turkish NLP Strikes Back | Nov 16, 2021 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Online Sentence Segmentation for Simultaneous Interpretation using Multi-Shifted Recurrent Neural Network | Aug 1, 2019 | SentenceSentence segmentation | —Unverified | 0 |
| Enhancements in statistical spoken language translation by de-normalization of ASR results | Nov 18, 2015 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Basic Language Resource Kit for Persian | May 1, 2012 | Part-Of-Speech TaggingPOS | —Unverified | 0 |
| A Latent Topic Modeling approach for Subject Summarization of Research on the Military Art and Science in South Korea | Jun 1, 2020 | ArticlesGeneral Classification | —Unverified | 0 |
| Alibaba Submission to the WMT20 Parallel Corpus Filtering Task | Nov 1, 2020 | DiversityLanguage Identification | —Unverified | 0 |
| A Statistical, Grammar-Based Approach to Microplanning | Apr 1, 2017 | SentenceSentence segmentation | —Unverified | 0 |
| Better Chinese Sentence Segmentation with Reinforcement Learning | Aug 1, 2021 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Bulgarian X-language Parallel Corpus | May 1, 2012 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| Classical Chinese Sentence Segmentation for Tomb Biographies of Tang Dynasty | Aug 28, 2019 | BIG-bench Machine LearningSentence | —Unverified | 0 |
| Corpus Augmentation by Sentence Segmentation for Low-Resource Neural Machine Translation | May 22, 2019 | Low Resource Neural Machine TranslationLow-Resource Neural Machine Translation | —Unverified | 0 |
| Creating Training Corpora for NLG Micro-Planners | Jul 1, 2017 | Data-to-Text GenerationReferring Expression | —Unverified | 0 |
| CUNI Systems in WMT21: Revisiting Backtranslation Techniques for English-Czech NMT | Nov 1, 2021 | NMTSegmentation | —Unverified | 0 |
| Elephant: Sequence Labeling for Word and Sentence Segmentation | Oct 1, 2013 | Boundary DetectionFeature Engineering | —Unverified | 0 |
| Sentiment Analysis for Troll Detection on Weibo | Mar 7, 2021 | SentenceSentence segmentation | —Unverified | 0 |
| TGIF: Tree-Graph Integrated-Format Parser for Enhanced UD with Two-Stage Generic- to Individual-Language Finetuning | Jul 14, 2021 | SentenceSentence segmentation | —Unverified | 0 |
| The AFRL IWSLT 2020 Systems: Work-From-Home Edition | Jul 1, 2020 | Action DetectionActivity Detection | —Unverified | 0 |
| The DeepMind Chinese–English Document Translation System at WMT2020 | Nov 1, 2020 | Document TranslationSentence | —Unverified | 0 |
| The Reading Machine: A Versatile Framework for Studying Incremental Parsing Strategies | Aug 1, 2021 | Dependency ParsingLemmatization | —Unverified | 0 |
| The WebNLG Challenge: Generating Text from RDF Data | Sep 1, 2017 | Referring ExpressionReferring expression generation | —Unverified | 0 |
| Transformer-Encoder-GRU (T-E-GRU) for Chinese Sentiment Analysis on Chinese Comment Text | Aug 1, 2021 | Chinese Sentiment AnalysisPosition | —Unverified | 0 |
| UDPipe 2.0 Prototype at CoNLL 2018 UD Shared Task | Oct 1, 2018 | Dependency ParsingLemmatization | —Unverified | 0 |
| Universal Joint Morph-Syntactic Processing: The Open University of Israel's Submission to The CoNLL 2017 Shared Task | Aug 1, 2017 | MORPHSentence | —Unverified | 0 |
| When Classical Chinese Meets Machine Learning: Explaining the Relative Performances of Word and Sentence Segmentation Tasks | Jul 22, 2020 | BIG-bench Machine LearningSegmentation | —Unverified | 0 |
| Optical Character Recognition, Word Segmentation, Sentence Segmentation, and Information Extraction for Historical and Literature Texts in Classical Chinese | Sep 1, 2020 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |