| LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression | Mar 19, 2024 | GSM8KLanguage Modelling | CodeCode Available | 9 | 5 |
| AutoTrain: No-code training for state-of-the-art models | Oct 21, 2024 | Classificationimage-classification | CodeCode Available | 7 | 5 |
| LettuceDetect: A Hallucination Detection Framework for RAG Applications | Feb 24, 2025 | 8kGPU | CodeCode Available | 4 | 5 |
| Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation | Feb 27, 2025 | Image Generationtoken-classification | CodeCode Available | 3 | 5 |
| VNLP: Turkish NLP Package | Mar 2, 2024 | Morphological Analysisnamed-entity-recognition | CodeCode Available | 2 | 5 |
| VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups | Jun 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Empowering the Fact-checkers! Automatic Identification of Claim Spans on Twitter | Oct 10, 2022 | Misinformationtoken-classification | CodeCode Available | 1 | 5 |
| Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction | Oct 17, 2023 | Entity LinkingKey Information Extraction | CodeCode Available | 1 | 5 |
| Ultrasound Video Transformers for Cardiac Ejection Fraction Estimation | Jul 2, 2021 | token-classificationToken Classification | CodeCode Available | 1 | 5 |
| Investigating Entity Knowledge in BERT with Simple Neural End-To-End Entity Linking | Mar 11, 2020 | Entity DisambiguationEntity Linking | CodeCode Available | 1 | 5 |
| WangchanBERTa: Pretraining transformer-based Thai Language Models | Jan 24, 2021 | ArticlesLanguage Modelling | CodeCode Available | 1 | 5 |
| NLRG at SemEval-2021 Task 5: Toxic Spans Detection Leveraging BERT-based Token Classification and Span Prediction Techniques | Feb 24, 2021 | NERtoken-classification | CodeCode Available | 1 | 5 |
| BERT got a Date: Introducing Transformers to Temporal Tagging | Sep 30, 2021 | ClassificationDecoder | CodeCode Available | 1 | 5 |
| From Zero to Hero: Harnessing Transformers for Biomedical Named Entity Recognition in Zero- and Few-shot Contexts | May 5, 2023 | few-shot-nerFew-shot NER | CodeCode Available | 1 | 5 |
| On Long-Tailed Phenomena in Neural Machine Translation | Oct 10, 2020 | Conditional Text GenerationMachine Translation | CodeCode Available | 1 | 5 |
| Improving Radiology Report Generation Systems by Removing Hallucinated References to Non-existent Priors | Sep 27, 2022 | token-classificationToken Classification | CodeCode Available | 1 | 5 |
| Label Supervised LLaMA Finetuning | Oct 2, 2023 | GPUnamed-entity-recognition | CodeCode Available | 1 | 5 |
| General and Domain Adaptive Chinese Spelling Check with Error Consistent Pretraining | Mar 21, 2022 | Domain Adaptationtoken-classification | CodeCode Available | 1 | 5 |
| The Devil Is in the Word Alignment Details: On Translation-Based Cross-Lingual Transfer for Token Classification Tasks | May 15, 2025 | Cross-Lingual Transfertoken-classification | CodeCode Available | 0 | 5 |
| ChuLo: Chunk-Level Key Information Representation for Long Document Processing | Oct 14, 2024 | ChunkingClassification | CodeCode Available | 0 | 5 |
| Leveraging Encoder-only Large Language Models for Mobile App Review Feature Extraction | Aug 2, 2024 | Sentiment Analysistoken-classification | CodeCode Available | 0 | 5 |
| Retrieval Augmented Generation using Engineering Design Knowledge | Jul 13, 2023 | Common Sense ReasoningEdge Classification | CodeCode Available | 0 | 5 |
| Indic-Transformers: An Analysis of Transformer Language Models for Indian Languages | Nov 4, 2020 | ClassificationGeneral Classification | CodeCode Available | 0 | 5 |
| Domino at FinCausal 2020, Task 1 and 2: Causal Extraction System | Dec 1, 2020 | Information RetrievalQuestion Answering | CodeCode Available | 0 | 5 |
| ClassBases at CASE-2022 Multilingual Protest Event Detection Tasks: Multilingual Protest News Detection and Automatically Replicating Manually Created Event Datasets | Jan 16, 2023 | ClassificationDocument Classification | CodeCode Available | 0 | 5 |
| UoB at SemEval-2021 Task 5: Extending Pre-Trained Language Models to Include Task and Domain-Specific Information for Toxic Span Prediction | Oct 7, 2021 | token-classificationToken Classification | CodeCode Available | 0 | 5 |
| What's Mine becomes Yours: Defining, Annotating and Detecting Context-Dependent Paraphrases in News Interview Dialogs | Apr 10, 2024 | In-Context Learningtoken-classification | CodeCode Available | 0 | 5 |
| Embedded Named Entity Recognition using Probing Classifiers | Mar 18, 2024 | DecoderFact Checking | CodeCode Available | 0 | 5 |
| YoungSheldon at SemEval-2021 Task 5: Fine-tuning Pre-trained Language Models for Toxic Spans Detection using Token classification Objective | Aug 1, 2021 | Sentencetoken-classification | CodeCode Available | 0 | 5 |
| Lazy-k: Decoding for Constrained Token Classification | Dec 6, 2023 | ClassificationStructured Prediction | CodeCode Available | 0 | 5 |
| Entity at SemEval-2021 Task 5: Weakly Supervised Token Labelling for Toxic Spans Detection | Aug 1, 2021 | ClassificationLanguage Modeling | CodeCode Available | 0 | 5 |
| Entity or Relation Embeddings? An Analysis of Encoding Strategies for Relation Extraction | Dec 18, 2023 | Entity EmbeddingsLanguage Modeling | CodeCode Available | 0 | 5 |
| MemeMind at ArAIEval Shared Task: Spotting Persuasive Spans in Arabic Text with Persuasion Techniques Identification | Aug 8, 2024 | Propaganda detectiontoken-classification | CodeCode Available | 0 | 5 |
| Evaluating Named Entity Recognition: A comparative analysis of mono- and multilingual transformer models on a novel Brazilian corporate earnings call transcripts dataset | Mar 18, 2024 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 0 | 5 |
| Evaluating Shortest Edit Script Methods for Contextual Lemmatization | Mar 25, 2024 | LEMMALemmatization | CodeCode Available | 0 | 5 |
| NamedEntityRangers at SemEval-2022 Task 11: Transformer-based Approaches for Multilingual Complex Named Entity Recognition | Jul 1, 2022 | Decodernamed-entity-recognition | CodeCode Available | 0 | 5 |
| Common-Knowledge Concept Recognition for SEVA | Mar 26, 2020 | Entity Extraction using GANgraph construction | CodeCode Available | 0 | 5 |
| One For All & All For One: Bypassing Hyperparameter Tuning with Model Averaging For Cross-Lingual Transfer | Oct 16, 2023 | AllCross-Lingual Transfer | CodeCode Available | 0 | 5 |
| Free Lunch: Robust Cross-Lingual Transfer via Model Checkpoint Averaging | May 26, 2023 | Cross-Lingual TransferModel Selection | CodeCode Available | 0 | 5 |
| POS-tagging to highlight the skeletal structure of sentences | Nov 21, 2024 | Machine TranslationMorphological Analysis | CodeCode Available | 0 | 5 |
| Counterfactual Detection meets Transfer Learning | May 27, 2020 | Binary Classificationcounterfactual | CodeCode Available | 0 | 5 |
| Acquiring Bidirectionality via Large and Small Language Models | Aug 19, 2024 | Few-Shot Learningnamed-entity-recognition | CodeCode Available | 0 | 5 |
| GUS-Net: Social Bias Classification in Text with Generalizations, Unfairness, and Stereotypes | Oct 10, 2024 | Bias Detectiontoken-classification | CodeCode Available | 0 | 5 |
| Deep Content Understanding Toward Entity and Aspect Target Sentiment Analysis on Foundation Models | Jul 4, 2024 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 0 | 5 |
| Technical Report: Impact of Position Bias on Language Models in Token Classification | Apr 26, 2023 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 0 | 5 |
| Detecting Label Errors in Token Classification Data | Oct 8, 2022 | General ClassificationToken Classification | CodeCode Available | 0 | 5 |
| TACO-RL: Task Aware Prompt Compression Optimization with Reinforcement Learning | Sep 19, 2024 | Code SummarizationComputational Efficiency | —Unverified | 0 | 0 |
| The MERIT Dataset: Modelling and Efficiently Rendering Interpretable Transcripts | Aug 31, 2024 | document understandingtoken-classification | —Unverified | 0 | 0 |
| Token Classification for Disambiguating Medical Abbreviations | Oct 5, 2022 | Classificationtext-classification | —Unverified | 0 | 0 |
| Tradeoffs in Resampling and Filtering for Imbalanced Classification | Aug 31, 2022 | Classificationimbalanced classification | —Unverified | 0 | 0 |