SOTAVerified

Referring Expression Comprehension

Papers

Showing 5175 of 167 papers

TitleStatusHype
Coarse-to-Fine Vision-Language Pre-training with Fusion in the BackboneCode1
PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language ModelsCode1
A Survivor in the Era of Large-Scale Pretraining: An Empirical Study of One-Stage Referring Expression ComprehensionCode1
ReCLIP: A Strong Zero-Shot Baseline for Referring Expression ComprehensionCode1
SeqTR: A Simple yet Universal Network for Visual GroundingCode1
Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point CloudsCode1
Referring Transformer: A One-step Approach to Multi-task Visual GroundingCode1
MDETR -- Modulated Detection for End-to-End Multi-Modal UnderstandingCode1
TransVG: End-to-End Visual Grounding with TransformersCode1
Unifying Vision-and-Language Tasks via Text GenerationCode1
TRAR: Routing the Attention Spans in Transformer for Visual Question AnsweringCode1
Large-Scale Adversarial Training for Vision-and-Language Representation LearningCode1
Multi-task Collaborative Network for Joint Referring Expression Comprehension and SegmentationCode1
UNITER: UNiversal Image-TExt Representation LearningCode1
Talk2Car: Taking Control of Your Self-Driving CarCode1
VL-BERT: Pre-training of Generic Visual-Linguistic RepresentationsCode1
A Fast and Accurate One-Stage Approach to Visual GroundingCode1
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language TasksCode1
Explainable Neural Computation via Stack Neural Module NetworksCode1
Compositional Attention Networks for Machine ReasoningCode1
Referring Expression Instance Retrieval and A Strong End-to-End Baseline0
Synthetic Visual Genome0
WeakMCN: Multi-task Collaborative Network for Weakly Supervised Referring Expression Comprehension and SegmentationCode0
Beyond Object Categories: Multi-Attribute Reference Understanding for Visual Grounding0
GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing0
Show:102550
← PrevPage 3 of 7Next →

No leaderboard results yet.