Referring Image Segmentation Using Text Supervision Aug 28, 2023 Image Segmentation Object Localization
Code Code Available 1Spectrum-guided Multi-granularity Referring Video Object Segmentation Jul 25, 2023 Object Referring Expression Segmentation
Code Code Available 1Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation Jul 21, 2023 Decoder Image Segmentation
Code Code Available 1OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation Jul 18, 2023 Referring Expression Segmentation Referring Video Object Segmentation
Code Code Available 1LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation Jun 14, 2023 Referring Expression Segmentation Referring Video Object Segmentation
Code Code Available 1SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation May 26, 2023 cross-modal alignment Object
Code Code Available 1Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation May 25, 2023 Object Referring Expression Segmentation
Code Code Available 1Advancing Referring Expression Segmentation Beyond Single Image May 21, 2023 Co-Salient Object Detection Object
Code Code Available 1Zero-shot Referring Image Segmentation with Global-Local Context Features Mar 31, 2023 Image Segmentation Referring Expression
Code Code Available 1PolyFormer: Referring Image Segmentation as Sequential Polygon Generation Feb 14, 2023 Decoder Image Segmentation
Code Code Available 1Multi-Attention Network for Compressed Video Referring Object Segmentation Jul 26, 2022 Object Referring Expression Segmentation
Code Code Available 1Towards Robust Referring Video Object Segmentation with Cyclic Relational Consensus Jul 4, 2022 Referring Expression Segmentation Referring Video Object Segmentation
Code Code Available 1Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation Apr 6, 2022 Optical Flow Estimation Referring Expression Segmentation
Code Code Available 1SeqTR: A Simple yet Universal Network for Visual Grounding Mar 30, 2022 Decoder Referring Expression
Code Code Available 1Local-Global Context Aware Transformer for Language-Guided Video Segmentation Mar 18, 2022 Referring Expression Segmentation Referring Video Object Segmentation
Code Code Available 1Image Segmentation Using Text and Image Prompts Dec 18, 2021 Decoder Image Segmentation
Code Code Available 1LAVT: Language-Aware Vision Transformer for Referring Image Segmentation Dec 4, 2021 Decoder Generalized Referring Expression Segmentation
Code Code Available 1CRIS: CLIP-Driven Referring Image Segmentation Nov 30, 2021 Contrastive Learning Decoder
Code Code Available 1End-to-End Referring Video Object Segmentation with Multimodal Transformers Nov 29, 2021 Inductive Bias Instance Segmentation
Code Code Available 1Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts Nov 16, 2021 Cross-Modal Retrieval Image Captioning
Code Code Available 1Vision-Language Transformer and Query Generation for Referring Segmentation Aug 12, 2021 Decoder Generalized Referring Expression Comprehension
Code Code Available 1SynthRef: Generation of Synthetic Referring Expressions for Object Segmentation Jun 8, 2021 Object object-detection
Code Code Available 1Referring Transformer: A One-step Approach to Multi-task Visual Grounding Jun 6, 2021 Decoder Referring Expression
Code Code Available 1Cross-Modal Progressive Comprehension for Referring Segmentation May 15, 2021 Attribute Image Segmentation
Code Code Available 1MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding Apr 26, 2021 Generalized Referring Expression Comprehension Phrase Grounding
Code Code Available 1OCID-Ref: A 3D Robotic Dataset with Embodied Language for Clutter Scene Grounding Mar 13, 2021 Referring Expression Referring Expression Segmentation
Code Code Available 1RefVOS: A Closer Look at Referring Expressions for Video Object Segmentation Oct 1, 2020 Image Segmentation Referring Expression Segmentation
Code Code Available 1Referring Image Segmentation via Cross-Modal Progressive Comprehension Oct 1, 2020 Attribute Image Segmentation
Code Code Available 1PhraseCut: Language-based Image Segmentation in the Wild Aug 3, 2020 Attribute Diversity
Code Code Available 1URVOS: Unified Referring Video Object Segmentation Network with a Large-Scale Benchmark Aug 1, 2020 Object One-shot visual object segmentation
Code Code Available 1Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation Mar 19, 2020 Generalized Referring Expression Comprehension Referring Expression
Code Code Available 1Actor and Action Video Segmentation from a Sentence Mar 20, 2018 Action Segmentation Decoder
Code Code Available 1Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval Jun 28, 2025 Cross-Modal Retrieval Image Captioning
— Unverified 0Refer to Anything with Vision-Language Prompts Jun 5, 2025 Benchmarking Generalized Referring Expression Segmentation
— Unverified 0RESAnything: Attribute Prompting for Arbitrary Referring Segmentation May 3, 2025 Attribute Image Segmentation
— Unverified 03DResT: A Strong Baseline for Semi-Supervised 3D Referring Expression Segmentation Apr 17, 2025 Referring Expression Referring Expression Segmentation
— Unverified 0Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target Granularities Apr 2, 2025 Descriptive Large Language Model
Code Code Available 0ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations Jan 24, 2025 Decoder Object
— Unverified 0InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling Jan 21, 2025 Object Tracking Referring Expression Segmentation
Code Code Available 0Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension Jan 2, 2025 Generalized Referring Expression Comprehension Generalized Referring Expression Segmentation
— Unverified 0Task-aware Cross-modal Feature Refinement Transformer with Large Language Models for Visual Grounding Jan 1, 2025 Referring Expression Referring Expression Comprehension
— Unverified 0DViN: Dynamic Visual Routing Network for Weakly Supervised Referring Expression Comprehension Jan 1, 2025 Descriptive Referring Expression
— Unverified 0Instance-Aware Generalized Referring Expression Segmentation Nov 22, 2024 Generalized Referring Expression Segmentation Object
— Unverified 0SegLLM: Multi-round Reasoning Segmentation Oct 24, 2024 Reasoning Segmentation Referring Expression
— Unverified 0SafaRi:Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation Jul 2, 2024 Referring Expression Referring Expression Segmentation
— Unverified 0GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation Jun 18, 2024 Contrastive Learning Object
— Unverified 0GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane May 27, 2024 3DGS feature selection
— Unverified 0Bring Adaptive Binding Prototypes to Generalized Referring Expression Segmentation May 24, 2024 Decoder Generalized Referring Expression Segmentation
Code Code Available 0Harnessing Vision-Language Pretrained Models with Temporal-Aware Adaptation for Referring Video Object Segmentation May 17, 2024 Referring Expression Segmentation Referring Video Object Segmentation
— Unverified 0Vision-Aware Text Features in Referring Image Segmentation: From Object Understanding to Context Understanding Apr 12, 2024 Decoder Image Segmentation
Code Code Available 0