| Human Genome Book: Words, Sentences and Paragraphs | Jan 23, 2025 | Protein Structure PredictionSentence segmentation | CodeCode Available | 0 |
| Segmentation en phrases : ouvrez les guillemets sans perdre le fil | Jul 29, 2024 | SentenceSentence segmentation | —Unverified | 0 |
| Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation | Jun 24, 2024 | parameter-efficient fine-tuningSentence | CodeCode Available | 7 |
| Opera Graeca Adnotata: Building a 34M+ Token Multilayer Corpus for Ancient Greek | Mar 31, 2024 | LemmatizationSentence | CodeCode Available | 1 |
| Ascle: A Python Natural Language Processing Toolkit for Medical Text Generation | Nov 28, 2023 | Machine TranslationQuestion Answering | CodeCode Available | 1 |
| KG-GPT: A General Framework for Reasoning on Knowledge Graphs Using Large Language Models | Oct 17, 2023 | Fact VerificationKnowledge Graphs | CodeCode Available | 1 |
| GujiBERT and GujiGPT: Construction of Intelligent Information Processing Foundation Language Models for Ancient Texts | Jul 11, 2023 | Model SelectionPart-Of-Speech Tagging | —Unverified | 0 |
| Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation | May 30, 2023 | Machine TranslationSegmentation | CodeCode Available | 3 |
| Prosodic features improve sentence segmentation and parsing | Feb 23, 2023 | SentenceSentence segmentation | CodeCode Available | 0 |
| Sentence Identification with BOS and EOS Label Combinations | Jan 31, 2023 | SentenceSentence segmentation | —Unverified | 0 |
| SLATE: A Sequence Labeling Approach for Task Extraction from Free-form Inked Content | Nov 8, 2022 | FormSegmentation | CodeCode Available | 0 |
| Midas Loop: A Prioritized Human-in-the-Loop Annotation for Large Scale Multilayer Data | Jun 1, 2022 | Active LearningManagement | —Unverified | 0 |
| LeConTra: A Learner Corpus of English-to-Dutch News Translation | Jun 1, 2022 | SentenceSentence segmentation | CodeCode Available | 0 |
| Mukayese: Turkish NLP Strikes Back | Mar 2, 2022 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Mukayese: Turkish NLP Strikes Back | Nov 16, 2021 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| CUNI Systems in WMT21: Revisiting Backtranslation Techniques for English-Czech NMT | Nov 1, 2021 | NMTSegmentation | —Unverified | 0 |
| Better Chinese Sentence Segmentation with Reinforcement Learning | Aug 1, 2021 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| The Reading Machine: A Versatile Framework for Studying Incremental Parsing Strategies | Aug 1, 2021 | Dependency ParsingLemmatization | —Unverified | 0 |
| A unified approach to sentence segmentation of punctuated text in many languages | Aug 1, 2021 | SentenceSentence segmentation | CodeCode Available | 1 |
| Transformer-Encoder-GRU (T-E-GRU) for Chinese Sentiment Analysis on Chinese Comment Text | Aug 1, 2021 | Chinese Sentiment AnalysisPosition | —Unverified | 0 |
| TGIF: Tree-Graph Integrated-Format Parser for Enhanced UD with Two-Stage Generic- to Individual-Language Finetuning | Jul 14, 2021 | SentenceSentence segmentation | —Unverified | 0 |
| Sentiment Analysis for Troll Detection on Weibo | Mar 7, 2021 | SentenceSentence segmentation | —Unverified | 0 |
| Creating a Universal Dependencies Treebank of Spoken Frisian-Dutch Code-switched Data | Feb 22, 2021 | SentenceSentence segmentation | CodeCode Available | 0 |
| Trankit: A Light-Weight Transformer-based Toolkit for Multilingual Natural Language Processing | Jan 9, 2021 | Dependency ParsingLanguage Modeling | CodeCode Available | 1 |
| Experiments on transfer learning architectures for biomedical relation extraction | Nov 24, 2020 | Open-Ended Question AnsweringRelation | —Unverified | 0 |