SOTAVerified

TextVQA

Papers

Showing 2130 of 47 papers

TitleStatusHype
Instruction-Aligned Visual Attention for Mitigating Hallucinations in Large Vision-Language ModelsCode0
Iterative Answer Prediction with Pointer-Augmented Multimodal Transformers for TextVQACode0
Towards a Unified Multimodal Reasoning FrameworkCode0
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCapsCode0
Separate and Locate: Rethink the Text in Text-based Visual Question AnsweringCode0
Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal CluesCode0
VisLingInstruct: Elevating Zero-Shot Learning in Multi-Modal Language Models with Autonomous Instruction OptimizationCode0
InstructOCR: Instruction Boosting Scene Text SpottingCode0
Winner Team Mia at TextVQA Challenge 2021: Vision-and-Language Representation Learning with Pre-trained Sequence-to-Sequence Model0
Analysing the Robustness of Vision-Language-Models to Common Corruptions0
Show:102550
← PrevPage 3 of 5Next →

No leaderboard results yet.