| Boosting the Capabilities of Compact Models in Low-Data Contexts with Large Language Models and Retrieval-Augmented Generation | Oct 1, 2024 | DescriptiveInductive Bias | —Unverified | 0 |
| Combining resources for MWE-token classification | Jul 1, 2012 | ClassificationGeneral Classification | —Unverified | 0 |
| Comparing Variation in Tokenizer Outputs Using a Series of Problematic and Challenging Biomedical Sentences | May 15, 2023 | Sentencetoken-classification | —Unverified | 0 |
| Comparison Study Between Token Classification and Sequence Classification In Text Classification | Nov 25, 2022 | ClassificationLanguage Modeling | —Unverified | 0 |
| Data Cleaning Tools for Token Classification Tasks | Jun 1, 2021 | Classificationnamed-entity-recognition | —Unverified | 0 |
| De-identification of Unstructured Clinical Texts from Sequence to Sequence Perspective | Aug 18, 2021 | De-identificationnamed-entity-recognition | —Unverified | 0 |
| ECSpell^UD: Zero-shot Domain Adaptive Chinese Spelling Check with User Dictionary | Nov 16, 2021 | Domain Adaptationtoken-classification | —Unverified | 0 |
| Evaluating Input Representation for Language Identification in Hindi-English Code Mixed Text | Nov 23, 2020 | Language IdentificationSentence | —Unverified | 0 |
| Interactive DualChecker for Mitigating Hallucinations in Distilling Large Language Models | Aug 22, 2024 | In-Context LearningKnowledge Distillation | —Unverified | 0 |
| John_Snow_Labs@SMM4H’22: Social Media Mining for Health (#SMM4H) with Spark NLP | Oct 1, 2022 | ClassificationGPU | —Unverified | 0 |
| Learning the Language of NVMe Streams for Ransomware Detection | Feb 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging Three Types of Embeddings from Masked Language Models in Idiom Token Classification | Jul 1, 2022 | Classificationtoken-classification | —Unverified | 0 |
| Looks can be Deceptive: Distinguishing Repetition Disfluency from Reduplication | Jul 11, 2024 | token-classificationToken Classification | —Unverified | 0 |
| MOOSComp: Improving Lightweight Long-Context Compressor via Mitigating Over-Smoothing and Incorporating Outlier Scores | Apr 23, 2025 | Long-Context Understandingtoken-classification | —Unverified | 0 |
| Multimodal Document Analytics for Banking Process Automation | Jul 21, 2023 | token-classificationToken Classification | —Unverified | 0 |
| MultiVitaminBooster at PARSEME Shared Task 2020: Combining Window- and Dependency-Based Features with Multilingual Contextualised Word Embeddings for VMWE Detection | Dec 1, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 |
| NBIAS: A Natural Language Processing Framework for Bias Identification in Text | Aug 3, 2023 | token-classificationToken Classification | —Unverified | 0 |
| Nested Named Entity Recognition as Single-Pass Sequence Labeling | May 22, 2025 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| People and Places of Historical Europe: Bootstrapping Annotation Pipeline and a New Corpus of Named Entities in Late Medieval Texts | May 26, 2023 | Information Retrievalnamed-entity-recognition | —Unverified | 0 |
| Persian Typographical Error Type Detection Using Deep Neural Networks on Algorithmically-Generated Misspellings | May 19, 2023 | Spelling Correctiontoken-classification | —Unverified | 0 |
| Preserving Empirical Probabilities in BERT for Small-sample Clinical Entity Recognition | Sep 5, 2024 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| Instruction Fine-Tuning: Does Prompt Loss Matter? | Jan 24, 2024 | Multiple-choicetoken-classification | —Unverified | 0 |
| Region-dependent temperature scaling for certainty calibration and application to class-imbalanced token classification | May 1, 2022 | NERtoken-classification | —Unverified | 0 |
| Revisiting Supertagging for Faster HPSG Pasing | Sep 14, 2023 | token-classificationToken Classification | —Unverified | 0 |
| Robust and Fine-Grained Detection of AI Generated Texts | Apr 16, 2025 | token-classificationToken Classification | —Unverified | 0 |