| MultiQG-TI: Towards Question Generation from Multi-modal Sources | Jul 7, 2023 | Image to textOptical Character Recognition | CodeCode Available | 0 |
| T-MARS: Improving Visual Representations by Circumventing Text Feature Learning | Jul 6, 2023 | Optical Character Recognition | CodeCode Available | 1 |
| Resume Information Extraction via Post-OCR Text Processing | Jun 23, 2023 | Object RecognitionOptical Character Recognition | —Unverified | 0 |
| A Survey on Multimodal Large Language Models | Jun 23, 2023 | HallucinationIn-Context Learning | —Unverified | 0 |
| Transformer-Based UNet with Multi-Headed Cross-Attention Skip Connections to Eliminate Artifacts in Scanned Documents | Jun 5, 2023 | DenoisingDocument Classification | —Unverified | 0 |
| TransDocAnalyser: A Framework for Offline Semi-structured Handwritten Document Analysis in the Legal Domain | Jun 3, 2023 | BenchmarkingDecoder | CodeCode Available | 1 |
| DuoSearch: A Novel Search Engine for Bulgarian Historical Documents | May 30, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Super-Resolution of License Plate Images Using Attention Modules and Sub-Pixel Convolution Layers | May 27, 2023 | Image Super-ResolutionLicense Plate Recognition | CodeCode Available | 1 |
| Exploring Better Text Image Translation with Multimodal Codebook | May 27, 2023 | Machine TranslationOptical Character Recognition | CodeCode Available | 1 |
| Measuring Intersectional Biases in Historical Documents | May 21, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |