| Nested Named Entity Recognition as Single-Pass Sequence Labeling | May 22, 2025 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| The Devil Is in the Word Alignment Details: On Translation-Based Cross-Lingual Transfer for Token Classification Tasks | May 15, 2025 | Cross-Lingual Transfertoken-classification | CodeCode Available | 0 |
| MOOSComp: Improving Lightweight Long-Context Compressor via Mitigating Over-Smoothing and Incorporating Outlier Scores | Apr 23, 2025 | Long-Context Understandingtoken-classification | —Unverified | 0 |
| Robust and Fine-Grained Detection of AI Generated Texts | Apr 16, 2025 | token-classificationToken Classification | —Unverified | 0 |
| Improving Applicability of Deep Learning based Token Classification models during Training | Mar 28, 2025 | document understandingtoken-classification | —Unverified | 0 |
| Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation | Feb 27, 2025 | Image Generationtoken-classification | CodeCode Available | 3 |
| LettuceDetect: A Hallucination Detection Framework for RAG Applications | Feb 24, 2025 | 8kGPU | CodeCode Available | 4 |
| Learning the Language of NVMe Streams for Ransomware Detection | Feb 7, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GliLem: Leveraging GliNER for Contextualized Lemmatization in Estonian | Dec 29, 2024 | Information RetrievalLEMMA | —Unverified | 0 |
| POS-tagging to highlight the skeletal structure of sentences | Nov 21, 2024 | Machine TranslationMorphological Analysis | CodeCode Available | 0 |
| Bangla Grammatical Error Detection Leveraging Transformer-based Token Classification | Nov 13, 2024 | Grammatical Error Detectiontoken-classification | —Unverified | 0 |
| AutoTrain: No-code training for state-of-the-art models | Oct 21, 2024 | Classificationimage-classification | CodeCode Available | 7 |
| ChuLo: Chunk-Level Key Information Representation for Long Document Processing | Oct 14, 2024 | ChunkingClassification | CodeCode Available | 0 |
| BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation | Oct 13, 2024 | Natural Language Understandingparameter-efficient fine-tuning | —Unverified | 0 |
| GUS-Net: Social Bias Classification in Text with Generalizations, Unfairness, and Stereotypes | Oct 10, 2024 | Bias Detectiontoken-classification | CodeCode Available | 0 |
| Boosting the Capabilities of Compact Models in Low-Data Contexts with Large Language Models and Retrieval-Augmented Generation | Oct 1, 2024 | DescriptiveInductive Bias | —Unverified | 0 |
| TACO-RL: Task Aware Prompt Compression Optimization with Reinforcement Learning | Sep 19, 2024 | Code SummarizationComputational Efficiency | —Unverified | 0 |
| Preserving Empirical Probabilities in BERT for Small-sample Clinical Entity Recognition | Sep 5, 2024 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| The MERIT Dataset: Modelling and Efficiently Rendering Interpretable Transcripts | Aug 31, 2024 | document understandingtoken-classification | —Unverified | 0 |
| Event Extraction for Portuguese: A QA-driven Approach using ACE-2005 | Aug 29, 2024 | Event ExtractionInformation Retrieval | —Unverified | 0 |
| Interactive DualChecker for Mitigating Hallucinations in Distilling Large Language Models | Aug 22, 2024 | In-Context LearningKnowledge Distillation | —Unverified | 0 |
| Acquiring Bidirectionality via Large and Small Language Models | Aug 19, 2024 | Few-Shot Learningnamed-entity-recognition | CodeCode Available | 0 |
| MemeMind at ArAIEval Shared Task: Spotting Persuasive Spans in Arabic Text with Persuasion Techniques Identification | Aug 8, 2024 | Propaganda detectiontoken-classification | CodeCode Available | 0 |
| Leveraging Encoder-only Large Language Models for Mobile App Review Feature Extraction | Aug 2, 2024 | Sentiment Analysistoken-classification | CodeCode Available | 0 |
| Looks can be Deceptive: Distinguishing Repetition Disfluency from Reduplication | Jul 11, 2024 | token-classificationToken Classification | —Unverified | 0 |
| Deep Content Understanding Toward Entity and Aspect Target Sentiment Analysis on Foundation Models | Jul 4, 2024 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 0 |
| In-Context Learning on a Budget: A Case Study in Token Classification | Jun 19, 2024 | Domain AdaptationIn-Context Learning | —Unverified | 0 |
| BEADs: Bias Evaluation Across Domains | Jun 6, 2024 | BenchmarkingBias Detection | —Unverified | 0 |
| A Framework for Leveraging Partially-Labeled Data for Product Attribute-Value Identification | May 17, 2024 | AttributeNER | —Unverified | 0 |
| What's Mine becomes Yours: Defining, Annotating and Detecting Context-Dependent Paraphrases in News Interview Dialogs | Apr 10, 2024 | In-Context Learningtoken-classification | CodeCode Available | 0 |
| Event Extraction in Basque: Typologically motivated Cross-Lingual Transfer-Learning Analysis | Apr 9, 2024 | Cross-Lingual TransferEvent Extraction | —Unverified | 0 |
| TM-TREK at SemEval-2024 Task 8: Towards LLM-Based Automatic Boundary Detection for Human-Machine Mixed Text | Apr 1, 2024 | Boundary DetectionText Detection | —Unverified | 0 |
| Evaluating Shortest Edit Script Methods for Contextual Lemmatization | Mar 25, 2024 | LEMMALemmatization | CodeCode Available | 0 |
| LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression | Mar 19, 2024 | GSM8KLanguage Modelling | CodeCode Available | 9 |
| Evaluating Named Entity Recognition: A comparative analysis of mono- and multilingual transformer models on a novel Brazilian corporate earnings call transcripts dataset | Mar 18, 2024 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 0 |
| Embedded Named Entity Recognition using Probing Classifiers | Mar 18, 2024 | DecoderFact Checking | CodeCode Available | 0 |
| Ignore Me But Don't Replace Me: Utilizing Non-Linguistic Elements for Pretraining on the Cybersecurity Domain | Mar 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VNLP: Turkish NLP Package | Mar 2, 2024 | Morphological Analysisnamed-entity-recognition | CodeCode Available | 2 |
| Instruction Fine-Tuning: Does Prompt Loss Matter? | Jan 24, 2024 | Multiple-choicetoken-classification | —Unverified | 0 |
| Arabic Text Diacritization In The Age Of Transfer Learning: Token Classification Is All You Need | Jan 9, 2024 | AllArabic Text Diacritization | —Unverified | 0 |
| Entity or Relation Embeddings? An Analysis of Encoding Strategies for Relation Extraction | Dec 18, 2023 | Entity EmbeddingsLanguage Modeling | CodeCode Available | 0 |
| Lazy-k: Decoding for Constrained Token Classification | Dec 6, 2023 | ClassificationStructured Prediction | CodeCode Available | 0 |
| Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction | Oct 17, 2023 | Entity LinkingKey Information Extraction | CodeCode Available | 1 |
| One For All & All For One: Bypassing Hyperparameter Tuning with Model Averaging For Cross-Lingual Transfer | Oct 16, 2023 | AllCross-Lingual Transfer | CodeCode Available | 0 |
| Label Supervised LLaMA Finetuning | Oct 2, 2023 | GPUnamed-entity-recognition | CodeCode Available | 1 |
| Revisiting Supertagging for Faster HPSG Pasing | Sep 14, 2023 | token-classificationToken Classification | —Unverified | 0 |
| Detecting Spells in Fantasy Literature with a Transformer Based Artificial Intelligence | Aug 7, 2023 | token-classificationToken Classification | —Unverified | 0 |
| NBIAS: A Natural Language Processing Framework for Bias Identification in Text | Aug 3, 2023 | token-classificationToken Classification | —Unverified | 0 |
| Multimodal Document Analytics for Banking Process Automation | Jul 21, 2023 | token-classificationToken Classification | —Unverified | 0 |
| Retrieval Augmented Generation using Engineering Design Knowledge | Jul 13, 2023 | Common Sense ReasoningEdge Classification | CodeCode Available | 0 |