| Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation | Jun 24, 2024 | parameter-efficient fine-tuningSentence | CodeCode Available | 7 |
| Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation | May 30, 2023 | Machine TranslationSegmentation | CodeCode Available | 3 |
| Abstractive Summarization of Spoken andWritten Instructions with BERT | Aug 21, 2020 | Abstractive Text SummarizationArticles | CodeCode Available | 2 |
| Opera Graeca Adnotata: Building a 34M+ Token Multilayer Corpus for Ancient Greek | Mar 31, 2024 | LemmatizationSentence | CodeCode Available | 1 |
| Ascle: A Python Natural Language Processing Toolkit for Medical Text Generation | Nov 28, 2023 | Machine TranslationQuestion Answering | CodeCode Available | 1 |
| KG-GPT: A General Framework for Reasoning on Knowledge Graphs Using Large Language Models | Oct 17, 2023 | Fact VerificationKnowledge Graphs | CodeCode Available | 1 |
| Mukayese: Turkish NLP Strikes Back | Mar 2, 2022 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| A unified approach to sentence segmentation of punctuated text in many languages | Aug 1, 2021 | SentenceSentence segmentation | CodeCode Available | 1 |
| Trankit: A Light-Weight Transformer-based Toolkit for Multilingual Natural Language Processing | Jan 9, 2021 | Dependency ParsingLanguage Modeling | CodeCode Available | 1 |
| Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation | Sep 20, 2020 | Machine TranslationSentence | CodeCode Available | 1 |