SOTAVerified

TextVQA

Papers

Showing 4147 of 47 papers

TitleStatusHype
Towards a Unified Multimodal Reasoning FrameworkCode0
Adversarial Training with OCR Modality Perturbation for Scene-Text Visual Question AnsweringCode0
InstructOCR: Instruction Boosting Scene Text SpottingCode0
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCapsCode0
Separate and Locate: Rethink the Text in Text-based Visual Question AnsweringCode0
OmniFusion Technical ReportCode0
Iterative Answer Prediction with Pointer-Augmented Multimodal Transformers for TextVQACode0
Show:102550
← PrevPage 5 of 5Next →

No leaderboard results yet.