| LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression | Mar 19, 2024 | GSM8KLanguage Modelling | CodeCode Available | 9 |
| AutoTrain: No-code training for state-of-the-art models | Oct 21, 2024 | Classificationimage-classification | CodeCode Available | 7 |
| LettuceDetect: A Hallucination Detection Framework for RAG Applications | Feb 24, 2025 | 8kGPU | CodeCode Available | 4 |
| Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation | Feb 27, 2025 | Image Generationtoken-classification | CodeCode Available | 3 |
| VNLP: Turkish NLP Package | Mar 2, 2024 | Morphological Analysisnamed-entity-recognition | CodeCode Available | 2 |
| Empowering the Fact-checkers! Automatic Identification of Claim Spans on Twitter | Oct 10, 2022 | Misinformationtoken-classification | CodeCode Available | 1 |
| On Long-Tailed Phenomena in Neural Machine Translation | Oct 10, 2020 | Conditional Text GenerationMachine Translation | CodeCode Available | 1 |
| From Zero to Hero: Harnessing Transformers for Biomedical Named Entity Recognition in Zero- and Few-shot Contexts | May 5, 2023 | few-shot-nerFew-shot NER | CodeCode Available | 1 |
| Reading Order Matters: Information Extraction from Visually-rich Documents by Token Path Prediction | Oct 17, 2023 | Entity LinkingKey Information Extraction | CodeCode Available | 1 |
| NLRG at SemEval-2021 Task 5: Toxic Spans Detection Leveraging BERT-based Token Classification and Span Prediction Techniques | Feb 24, 2021 | NERtoken-classification | CodeCode Available | 1 |
| WangchanBERTa: Pretraining transformer-based Thai Language Models | Jan 24, 2021 | ArticlesLanguage Modelling | CodeCode Available | 1 |
| Ultrasound Video Transformers for Cardiac Ejection Fraction Estimation | Jul 2, 2021 | token-classificationToken Classification | CodeCode Available | 1 |
| General and Domain Adaptive Chinese Spelling Check with Error Consistent Pretraining | Mar 21, 2022 | Domain Adaptationtoken-classification | CodeCode Available | 1 |
| BERT got a Date: Introducing Transformers to Temporal Tagging | Sep 30, 2021 | ClassificationDecoder | CodeCode Available | 1 |
| VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups | Jun 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Improving Radiology Report Generation Systems by Removing Hallucinated References to Non-existent Priors | Sep 27, 2022 | token-classificationToken Classification | CodeCode Available | 1 |
| Investigating Entity Knowledge in BERT with Simple Neural End-To-End Entity Linking | Mar 11, 2020 | Entity DisambiguationEntity Linking | CodeCode Available | 1 |
| Label Supervised LLaMA Finetuning | Oct 2, 2023 | GPUnamed-entity-recognition | CodeCode Available | 1 |
| Event Extraction in Basque: Typologically motivated Cross-Lingual Transfer-Learning Analysis | Apr 9, 2024 | Cross-Lingual TransferEvent Extraction | —Unverified | 0 |
| Bangla Grammatical Error Detection Leveraging Transformer-based Token Classification | Nov 13, 2024 | Grammatical Error Detectiontoken-classification | —Unverified | 0 |
| Boosting the Capabilities of Compact Models in Low-Data Contexts with Large Language Models and Retrieval-Augmented Generation | Oct 1, 2024 | DescriptiveInductive Bias | —Unverified | 0 |
| Detecting Spells in Fantasy Literature with a Transformer Based Artificial Intelligence | Aug 7, 2023 | token-classificationToken Classification | —Unverified | 0 |
| BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation | Oct 13, 2024 | Natural Language Understandingparameter-efficient fine-tuning | —Unverified | 0 |
| Event Extraction for Portuguese: A QA-driven Approach using ACE-2005 | Aug 29, 2024 | Event ExtractionInformation Retrieval | —Unverified | 0 |
| A Framework for Leveraging Partially-Labeled Data for Product Attribute-Value Identification | May 17, 2024 | AttributeNER | —Unverified | 0 |