SOTAVerified

TextVQA

Papers

Showing 2647 of 47 papers

TitleStatusHype
CogVLM: Visual Expert for Pretrained Language ModelsCode5
Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA0
Sentence Attention Blocks for Answer Grounding0
Separate and Locate: Rethink the Text in Text-based Visual Question AnsweringCode0
Making the V in Text-VQA Matter0
Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA0
SceneGATE: Scene-Graph based co-Attention networks for TExt visual question answering0
Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering0
TAG: Boosting Text-VQA via Text-aware Visual Question-answer GenerationCode1
Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering0
LaTr: Layout-Aware Transformer for Scene-Text VQACode1
Graph Relation Transformer: Incorporating pairwise object features into the Transformer architecture0
Winner Team Mia at TextVQA Challenge 2021: Vision-and-Language Representation Learning with Pre-trained Sequence-to-Sequence Model0
TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text0
A First Look: Towards Explainable TextVQA Models via Visual and Textual ExplanationsCode1
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCapsCode0
TAP: Text-Aware Pre-training for Text-VQA and Text-CaptionCode1
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question AnsweringCode1
Spatially Aware Multimodal Transformers for TextVQACode1
Structured Multimodal Attentions for TextVQACode1
Iterative Answer Prediction with Pointer-Augmented Multimodal Transformers for TextVQACode0
Towards VQA Models That Can ReadCode3
Show:102550
← PrevPage 2 of 2Next →

No leaderboard results yet.