| PubTables-1M: Towards comprehensive table extraction from unstructured documents | Sep 30, 2021 | Articlesobject-detection | CodeCode Available | 2 | 5 |
| Graph Neural Networks and Representation Embedding for Table Extraction in PDF Documents | Aug 23, 2022 | Optical Character Recognition (OCR)Table Extraction | CodeCode Available | 1 | 5 |
| H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on Tables | Jun 29, 2024 | Fact VerificationMathematical Reasoning | CodeCode Available | 1 | 5 |
| SIMPLOT: Enhancing Chart Question Answering by Distilling Essentials | Feb 22, 2024 | Chart Question AnsweringLanguage Modeling | CodeCode Available | 1 | 5 |
| Deep learning for table detection and structure recognition: A survey | Nov 15, 2022 | Deep Learningobject-detection | CodeCode Available | 1 | 5 |
| GFTE: Graph-based Financial Table Extraction | Mar 17, 2020 | Information RetrievalPosition | CodeCode Available | 1 | 5 |
| Schema-Driven Information Extraction from Heterogeneous Tables | May 23, 2023 | Attribute ExtractionInstruction Following | CodeCode Available | 1 | 5 |
| TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images | Jan 6, 2020 | Table DetectionTable Extraction | CodeCode Available | 1 | 5 |
| CTE: A Dataset for Contextualized Table Extraction | Feb 2, 2023 | Document Layout AnalysisTable Detection | CodeCode Available | 1 | 5 |
| Flexible Table Recognition and Semantic Interpretation System | May 25, 2021 | Table DetectionTable Extraction | CodeCode Available | 0 | 5 |
| ScanBank: A Benchmark Dataset for Figure Extraction from Scanned Electronic Theses and Dissertations | Jun 23, 2021 | Data AugmentationTable Extraction | CodeCode Available | 0 | 5 |
| SynFinTabs: A Dataset of Synthetic Financial Tables for Information and Table Extraction | Dec 5, 2024 | ArticlesDataset Generation | CodeCode Available | 0 | 5 |
| QUEST: Quality-aware Semi-supervised Table Extraction for Business Documents | Jun 17, 2025 | Pseudo LabelTable Extraction | CodeCode Available | 0 | 5 |
| DiSCoMaT: Distantly Supervised Composition Extraction from Tables in Materials Science Articles | Jul 3, 2022 | ArticlesNutrition | CodeCode Available | 0 | 5 |
| PdfTable: A Unified Toolkit for Deep Learning-Based Table Extraction | Sep 8, 2024 | Deep LearningDocument Layout Analysis | CodeCode Available | 0 | 5 |
| Web Table Classification based on Visual Features | Feb 25, 2021 | ClassificationGeneral Classification | —Unverified | 0 | 0 |
| A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents | Mar 17, 2023 | RetrievalTable Extraction | —Unverified | 0 | 0 |
| Web Table Extraction, Retrieval and Augmentation: A Survey | Feb 1, 2020 | Question AnsweringRetrieval | —Unverified | 0 | 0 |
| An Annotated Corpus and Method for Analysis of Ad-Hoc Structures Embedded in Text | May 1, 2016 | Table ExtractionText Segmentation | —Unverified | 0 | 0 |
| A two-stage approach for table extraction in invoices | Oct 10, 2022 | Table ExtractionVocal Bursts Valence Prediction | —Unverified | 0 | 0 |
| ChartCitor: Multi-Agent Framework for Fine-Grained Chart Visual Attribution | Feb 3, 2025 | Chart Question AnsweringQuestion Answering | —Unverified | 0 | 0 |
| CISOL: An Open and Extensible Dataset for Table Structure Recognition in the Construction Industry | Jan 26, 2025 | BenchmarkingObject Detection | —Unverified | 0 | 0 |
| Design and Implementation of an OCR-Powered Pipeline for Table Extraction from Invoices | Jul 9, 2025 | Boundary DetectionOptical Character Recognition (OCR) | —Unverified | 0 | 0 |
| Financial Table Extraction in Image Documents | Mar 18, 2024 | Image SegmentationOptical Character Recognition (OCR) | —Unverified | 0 | 0 |
| Global Table Extractor (GTE): A Framework for Joint Table Identification and Cell Structure Recognition Using Visual Context | May 1, 2020 | Cell Detectionobject-detection | —Unverified | 0 | 0 |
| Modelling the semantics of text in complex document layouts using graph transformer networks | Feb 18, 2022 | Table Extraction | —Unverified | 0 | 0 |
| PlotEdit: Natural Language-Driven Accessible Chart Editing in PDFs via Multimodal LLM Agents | Jan 20, 2025 | AttributeTable Extraction | —Unverified | 0 | 0 |
| RAPTOR: Refined Approach for Product Table Object Recognition | Feb 19, 2025 | ObjectObject Recognition | —Unverified | 0 | 0 |
| TableLab: An Interactive Table Extraction System with Adaptive Deep Learning | Feb 16, 2021 | Deep LearningTable Extraction | —Unverified | 0 | 0 |
| TabLeX: A Benchmark Dataset for Structure and Content Information Extraction from Scientific Tables | May 12, 2021 | ArticlesTable Extraction | —Unverified | 0 | 0 |
| Tablext: A Combined Neural Network And Heuristic Based Table Extractor | Apr 22, 2021 | object-detectionObject Detection | —Unverified | 0 | 0 |
| tabulapdf: An R Package to Extract Tables from PDF Documents | Aug 25, 2024 | RetrievalTable Extraction | —Unverified | 0 | 0 |