SOTAVerified

document understanding

Document understanding involves document classification, layout analysis, information extraction, and DocQA.

Papers

Showing 251275 of 309 papers

TitleStatusHype
Fast-StrucTexT: An Efficient Hourglass Transformer with Modality-guided Dynamic Token Merge for Document Understanding0
Finding Pragmatic Differences Between Disciplines0
FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction0
FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction0
Friendly Topic Assistant for Transformer Based Abstractive Summarization0
From Entity Linking to Question Answering -- Recent Progress on Semantic Grounding Tasks0
Génération de question à partir d’analyse sémantique pour l’adaptation non supervisée de modèles de compréhension de documents (Question generation from semantic analysis for unsupervised adaptation of document understanding models)0
Graph Convolution for Multimodal Information Extraction from Visually Rich Documents0
Handling tree-structured text: parsing directory pages0
Harnessing Webpage UIs for Text-Rich Visual Understanding0
Hierarchical BERT for Medical Document Understanding0
Hierarchical GPT with Congruent Transformers for Multi-Sentence Language Models0
Hierarchical Visual Feature Aggregation for OCR-Free Document Understanding0
How does Watermarking Affect Visual Language Models in Document Understanding?0
HRVDA: High-Resolution Visual Document Assistant0
Improving Applicability of Deep Learning based Token Classification models during Training0
Improving Keyphrase Extraction with Data Augmentation and Information Filtering0
Information Extraction from Heterogeneous Documents without Ground Truth Labels using Synthetic Label Generation and Knowledge Distillation0
Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding0
Joint Structured Learning and Predictions under Logical Constraints in Conditional Random Fields0
KeyVec: Key-semantics Preserving Document Representations0
KITAB-Bench: A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding0
KOSMOS-2.5: A Multimodal Literate Model0
LAMPRET: Layout-Aware Multimodal PreTraining for Document Understanding0
LAPDoc: Layout-Aware Prompting for Documents0
Show:102550
← PrevPage 11 of 13Next →

No leaderboard results yet.