| FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness | May 27, 2022 | 16k4k | CodeCode Available | 6 |
| BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining | Oct 19, 2022 | Document ClassificationLanguage Modelling | CodeCode Available | 4 |
| Pre-Training with Whole Word Masking for Chinese BERT | Jun 19, 2019 | Document ClassificationGeneral Classification | CodeCode Available | 3 |
| DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models | Jun 17, 2024 | Document ClassificationVisual Grounding | CodeCode Available | 3 |
| Visually Guided Generative Text-Layout Pre-training for Document Intelligence | Mar 25, 2024 | Document Classificationdocument understanding | CodeCode Available | 2 |
| LinkBERT: Pretraining Language Models with Document Links | Mar 29, 2022 | Document ClassificationLanguage Modeling | CodeCode Available | 2 |
| One Configuration to Rule Them All? Towards Hyperparameter Transfer in Topic Models using Multi-Objective Bayesian Optimization | Feb 15, 2022 | AllBayesian Optimization | CodeCode Available | 2 |
| Document Classification for COVID-19 Literature | Jun 15, 2020 | ArticlesClassification | CodeCode Available | 1 |
| Data Programming by Demonstration: A Framework for Interactively Learning Labeling Functions | Sep 3, 2020 | Document Classification | CodeCode Available | 1 |
| SPECTER: Document-level Representation Learning using Citation-informed Transformers | Apr 15, 2020 | Citation PredictionDocument Classification | CodeCode Available | 1 |
| Clinical-Longformer and Clinical-BigBird: Transformers for long clinical sequences | Jan 27, 2022 | Clinical KnowledgeDocument Classification | CodeCode Available | 1 |
| DocBERT: BERT for Document Classification | Apr 17, 2019 | ClassificationDocument Classification | CodeCode Available | 1 |
| A Comparative Study of Pretrained Language Models for Long Clinical Text | Jan 27, 2023 | Clinical KnowledgeDocument Classification | CodeCode Available | 1 |
| A Corpus for Multilingual Document Classification in Eight Languages | May 24, 2018 | ClassificationCross-Lingual Document Classification | CodeCode Available | 1 |
| Can a Fruit Fly Learn Word Embeddings? | Jan 18, 2021 | Document ClassificationWord Embeddings | CodeCode Available | 1 |
| Bioformer: an efficient transformer language model for biomedical text mining | Feb 3, 2023 | ArticlesDocument Classification | CodeCode Available | 1 |
| ChordMixer: A Scalable Neural Attention Model for Sequences with Different Lengths | Jun 12, 2022 | ChunkingDocument Classification | CodeCode Available | 1 |
| Aspect-based Document Similarity for Research Papers | Oct 13, 2020 | Document ClassificationRecommendation Systems | CodeCode Available | 1 |
| Benchmarking for Biomedical Natural Language Processing Tasks with a Domain Specific ALBERT | Jul 9, 2021 | BenchmarkingDocument Classification | CodeCode Available | 1 |
| Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT | Apr 19, 2019 | Cross-Lingual NERCross-Lingual Transfer | CodeCode Available | 1 |
| A Sentence-level Hierarchical BERT Model for Document Classification with Limited Labelled Data | Jun 12, 2021 | ClassificationDocument Classification | CodeCode Available | 1 |
| Bridge Correlational Neural Networks for Multilingual Multimodal Representation Learning | Oct 13, 2015 | Document ClassificationRepresentation Learning | CodeCode Available | 1 |
| Balancing Methods for Multi-label Text Classification with Long-Tailed Class Distribution | Sep 10, 2021 | Document ClassificationMulti-Label Text Classification | CodeCode Available | 1 |
| ContraDoc: Understanding Self-Contradictions in Documents with Large Language Models | Nov 15, 2023 | Document ClassificationQuestion Answering | CodeCode Available | 1 |
| Pre-training technique to localize medical BERT and enhance biomedical BERT | May 14, 2020 | Document ClassificationTransfer Learning | CodeCode Available | 1 |