| ERNIE-Layout: Layout-Knowledge Enhanced Multi-modal Pre-training for Document Understanding | Jan 16, 2022 | cross-modal alignmentDocument Classification | CodeCode Available | 0 |
| LoPE: Learnable Sinusoidal Positional Encoding for Improving Document Transformer Model | Jan 16, 2022 | document understanding | —Unverified | 0 |
| Efficient layout-aware pretraining for multimodal form understanding | Jan 16, 2022 | document understandingForm | —Unverified | 0 |
| Deeper Clinical Document Understanding Using Relation Extraction | Dec 25, 2021 | document understandingnamed-entity-recognition | CodeCode Available | 0 |
| UniDoc: Unified Pretraining Framework for Document Understanding | Dec 1, 2021 | document understandingSelf-Supervised Learning | —Unverified | 0 |
| SimCLAD: A Simple Framework for Contrastive Learning of Acronym Disambiguation | Nov 29, 2021 | Contrastive Learningdocument understanding | —Unverified | 0 |
| PSG: Prompt-based Sequence Generation for Acronym Extraction | Nov 29, 2021 | document understandingLanguage Modeling | —Unverified | 0 |
| Document Layout Analysis with Aesthetic-Guided Image Augmentation | Nov 27, 2021 | Document Layout Analysisdocument understanding | —Unverified | 0 |
| Handling tree-structured text: parsing directory pages | Nov 24, 2021 | document understanding | —Unverified | 0 |
| Probing Position-Aware Attention Mechanism in Long Document Understanding | Nov 16, 2021 | document understandingNatural Language Understanding | —Unverified | 0 |
| MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding | Oct 16, 2021 | document understanding | CodeCode Available | 0 |
| OPAD: An Optimized Policy-based Active Learning Framework for Document Content Analysis | Oct 1, 2021 | Active Learningdocument understanding | —Unverified | 0 |
| Skim-Attention: Learning to Focus via Document Layout | Sep 2, 2021 | document understandingLanguage Modeling | CodeCode Available | 0 |
| Position Masking for Improved Layout-Aware Document Understanding | Sep 1, 2021 | document understandingPosition | —Unverified | 0 |
| The Law of Large Documents: Understanding the Structure of Legal Contracts Using Visual Cues | Jul 16, 2021 | Attributedocument understanding | —Unverified | 0 |
| Leveraging Domain Agnostic and Specific Knowledge for Acronym Disambiguation | Jul 1, 2021 | document understandingWord Embeddings | —Unverified | 0 |
| Document Collection Visual Question Answering | Apr 27, 2021 | document understandingQuestion Answering | —Unverified | 0 |
| LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding | Apr 18, 2021 | Document Image Classificationdocument understanding | CodeCode Available | 0 |
| LAMPRET: Layout-Aware Multimodal PreTraining for Document Understanding | Apr 16, 2021 | document understanding | —Unverified | 0 |
| Automatic Knowledge Extraction with Human Interface | Apr 9, 2021 | document understanding | —Unverified | 0 |
| Decontextualization: Making Sentences Stand-Alone | Feb 9, 2021 | document understandingQuestion Answering | —Unverified | 0 |
| AT-BERT: Adversarial Training BERT for Acronym Identification Winning Solution for SDU@AAAI-21 | Jan 11, 2021 | document understandingUnsupervised Pre-training | —Unverified | 0 |
| BROS: A Pre-trained Language Model for Understanding Texts in Document | Jan 1, 2021 | DecoderDiversity | —Unverified | 0 |
| LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding | Dec 29, 2020 | Document Image ClassificationDocument Layout Analysis | CodeCode Available | 0 |
| Acronym Identification and Disambiguation Shared Tasks for Scientific Document Understanding | Dec 22, 2020 | document understanding | —Unverified | 0 |
| Understood in Translation, Transformers for Domain Understanding | Dec 18, 2020 | document understandingTranslation | CodeCode Available | 0 |
| Primer AI's Systems for Acronym Identification and Disambiguation | Dec 14, 2020 | document understandingSentence | CodeCode Available | 0 |
| EvaLDA: Efficient Evasion Attacks Towards Latent Dirichlet Allocation | Dec 9, 2020 | document understandingInformation Retrieval | CodeCode Available | 0 |
| Improving Clinical Document Understanding on COVID-19 Research with Spark NLP | Dec 7, 2020 | AnatomyClinical Assertion Status Detection | CodeCode Available | 0 |
| Merge and Recognize: A Geometry and 2D Context Aware Graph Model for Named Entity Recognition from Visual Documents | Dec 1, 2020 | document understandingLanguage Modeling | —Unverified | 0 |
| A Survey of Deep Learning Approaches for OCR and Document Understanding | Nov 27, 2020 | document understandingOptical Character Recognition (OCR) | CodeCode Available | 0 |
| WSL-DS: Weakly Supervised Learning with Distant Supervision for Query Focused Multi-Document Abstractive Summarization | Nov 3, 2020 | Abstractive Text SummarizationDocument Summarization | CodeCode Available | 0 |
| Friendly Topic Assistant for Transformer Based Abstractive Summarization | Nov 1, 2020 | Abstractive Text SummarizationDocument Summarization | —Unverified | 0 |
| Attention-Based Graph Neural Network with Global Context Awareness for Document Understanding | Oct 1, 2020 | document understandinggraph construction | —Unverified | 0 |
| Hierarchical GPT with Congruent Transformers for Multi-Sentence Language Models | Sep 18, 2020 | DecoderDialogue Generation | —Unverified | 0 |
| Multi-modal Information Extraction from Text, Semi-structured, and Tabular Data on the Web | Jul 1, 2020 | document understandingEntity Linking | —Unverified | 0 |
| Scalable Cross Lingual Pivots to Model Pronoun Gender for Translation | Jun 16, 2020 | document understandingMachine Translation | —Unverified | 0 |
| TRIE: End-to-End Text Reading and Information Extraction for Document Understanding | May 27, 2020 | document understanding | CodeCode Available | 0 |
| Table Structure Extraction with Bi-directional Gated Recurrent Unit Networks | Jan 8, 2020 | document understandingOptical Character Recognition | —Unverified | 0 |
| BERT-AL: BERT for Arbitrarily Long Document Understanding | Jan 1, 2020 | document understandingText Summarization | —Unverified | 0 |
| Table-Of-Contents generation on contemporary documents | Nov 20, 2019 | document understanding | —Unverified | 0 |
| Blockwise Self-Attention for Long Document Understanding | Nov 7, 2019 | document understandingLanguage Modeling | CodeCode Available | 0 |
| KRED: Knowledge-Aware Document Representation for News Recommendations | Oct 25, 2019 | Articlesdocument understanding | CodeCode Available | 0 |
| Message Passing Attention Networks for Document Understanding | Aug 17, 2019 | document understandingMulti-Modal Document Classification | CodeCode Available | 0 |
| Bidirectional Context-Aware Hierarchical Attention Network for Document Understanding | Aug 16, 2019 | Abstractive Text Summarizationdocument understanding | CodeCode Available | 0 |
| A Retrospective Recount of Computer Architecture Research with a Data-Driven Study of Over Four Decades of ISCA Publications | Jun 22, 2019 | document understandingNatural Language Understanding | —Unverified | 0 |
| A User-Centered Concept Mining System for Query and Document Understanding at Tencent | May 21, 2019 | document understandingKnowledge Base Construction | —Unverified | 0 |
| Graph Convolution for Multimodal Information Extraction from Visually Rich Documents | Mar 27, 2019 | document understandingEntity Extraction using GAN | —Unverified | 0 |
| Doc2Im: document to image conversion through self-attentive embedding | Nov 8, 2018 | Document To Image Conversiondocument understanding | —Unverified | 0 |
| Chargrid: Towards Understanding 2D Documents | Sep 24, 2018 | Decoderdocument understanding | CodeCode Available | 0 |