| OCRBench: On the Hidden Mystery of OCR in Large Multimodal Models | May 13, 2023 | Key Information ExtractionNutrition | CodeCode Available | 2 |
| E2TIMT: Efficient and Effective Modal Adapter for Text Image Machine Translation | May 9, 2023 | DecoderMachine Translation | CodeCode Available | 0 |
| Evaluating BERT-based Scientific Relation Classifiers for Scholarly Knowledge Graph Construction on Digital Library Collections | May 3, 2023 | graph constructionOptical Character Recognition | —Unverified | 0 |
| DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents | Apr 24, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Multimodal Short Video Rumor Detection System Based on Contrastive Learning | Apr 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| TransDocs: Optical Character Recognition with word to word translation | Apr 15, 2023 | Deep LearningDocument Translation | CodeCode Available | 0 |
| Cleansing Jewel: A Neural Spelling Correction Model Built On Google OCR-ed Tibetan Manuscripts | Apr 7, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Linking Representations with Multimodal Contrastive Learning | Apr 7, 2023 | Contrastive LearningOptical Character Recognition | —Unverified | 0 |
| Efficient OCR for Building a Diverse Digital History | Apr 5, 2023 | DiversityImage Retrieval | CodeCode Available | 1 |
| A Study of Autoregressive Decoders for Multi-Tasking in Computer Vision | Mar 30, 2023 | DecoderMulti-Task Learning | —Unverified | 0 |