Sound and Visual Representation Learning with Multiple Pretraining Tasks Jan 4, 2022 Incremental Learning Representation Learning
— Unverified 0Spacewalk-18: A Benchmark for Multimodal and Long-form Procedural Video Understanding Nov 30, 2023 Form Video Retrieval
— Unverified 0Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding Mar 28, 2023 Action Localization Action Recognition
— Unverified 0Spatio-temporal Video Re-localization by Warp LSTM May 10, 2019 Retrieval Video Retrieval
— Unverified 0Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics Aug 5, 2024 Retrieval Video Retrieval
— Unverified 0SSAN: Separable Self-Attention Network for Video Representation Learning May 27, 2021 Action Recognition Representation Learning
— Unverified 0STAR-GNN: Spatial-Temporal Video Representation for Content-based Retrieval Aug 15, 2022 Graph Neural Network Representation Learning
— Unverified 0Cross-modal Manifold Cutmix for Self-supervised Video Representation Learning Dec 7, 2021 Action Recognition Representation Learning
— Unverified 0STOA-VLP: Spatial-Temporal Modeling of Object and Action for Video-Language Pre-training Feb 20, 2023 Language Modelling Object
— Unverified 0Strategies for Searching Video Content with Text Queries or Video Examples Jun 17, 2016 Event Detection Reranking
— Unverified 0Support-set bottlenecks for video-text representation learning Oct 6, 2020 Contrastive Learning Representation Learning
— Unverified 0SVD: A Large-Scale Short Video Dataset for Near-Duplicate Video Retrieval Oct 1, 2019 Diversity Retrieval
— Unverified 0SwAMP: Swapped Assignment of Multi-Modal Pairs for Cross-Modal Retrieval Nov 10, 2021 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets Sep 2, 2024 Video Alignment Video Editing
— Unverified 0System Analysis And Design For Multimedia Retrieval Systems Dec 31, 2013 Retrieval Video Retrieval
— Unverified 0TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment Aug 23, 2021 Action Segmentation Contrastive Learning
— Unverified 0TeachCLIP: Multi-Grained Teaching for Efficient Text-to-Video Retrieval Aug 2, 2023 Retrieval text similarity
— Unverified 0Temporal Contrastive Graph Learning for Video Action Recognition and Retrieval Jan 4, 2021 Action Recognition Contrastive Learning
— Unverified 0Temporal Contrastive Learning with Curriculum Sep 2, 2022 Action Recognition Contrastive Learning
— Unverified 0Temporal Modular Networks for Retrieving Complex Compositional Activities in Videos Sep 1, 2018 Retrieval Video Retrieval
— Unverified 0Temporal Perceiving Video-Language Pre-training Jan 18, 2023 Action Localization Contrastive Learning
— Unverified 0Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval Sep 27, 2022 Cross-Modal Retrieval Retrieval
— Unverified 0Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval Mar 26, 2024 Multimodal Reasoning Retrieval
— Unverified 0Text-Video Retrieval via Variational Multi-Modal Hypergraph Networks Jan 6, 2024 Retrieval Variational Inference
— Unverified 0The VISIONE Video Search System: Exploiting Off-the-Shelf Text Search Engines for Large-Scale Video Retrieval Aug 6, 2020 Retrieval Text Retrieval
— Unverified 0Time-Equivariant Contrastive Video Representation Learning Dec 7, 2021 Action Recognition Contrastive Learning
— Unverified 0Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention Sep 17, 2023 Action Recognition Graph Generation
— Unverified 0Towards Efficient and Robust Moment Retrieval System: A Unified Framework for Multi-Granularity Models and Temporal Reranking Apr 11, 2025 Moment Retrieval Question Answering
— Unverified 0Towards Holistic Language-video Representation: the language model-enhanced MSR-Video to Text Dataset Jun 19, 2024 Language Modeling Language Modelling
— Unverified 0TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba Feb 21, 2025 image-classification Image Classification
— Unverified 0TRECVID 2019: An Evaluation Campaign to Benchmark Video Activity Detection, Video Captioning and Matching, and Video Search & Retrieval Sep 21, 2020 Action Detection Activity Detection
— Unverified 0Tree-Augmented Cross-Modal Encoding for Complex-Query Video Retrieval Jul 6, 2020 Retrieval Video Retrieval
— Unverified 0Tree-based Text-Vision BERT for Video Search in Baidu Video Advertising Sep 19, 2022 Image Retrieval Retrieval
— Unverified 0Two-person interaction detection using body-pose features and multiple instance learning Jul 16, 2012 Activity Recognition Human Activity Recognition
— Unverified 0Uncertainty-aware sign language video retrieval with probability distribution modeling May 30, 2024 Retrieval Sign Language Retrieval
— Unverified 0Unfolding Videos Dynamics via Taylor Expansion Sep 4, 2024 Action Detection Action Recognition
— Unverified 0Unified Embedding and Metric Learning for Zero-Exemplar Event Detection May 5, 2017 Event Detection Metric Learning
— Unverified 0Universal Adversarial Head: Practical Protection against Video Data Leakage Jun 18, 2021 Deep Hashing Retrieval
— Unverified 0Unsupervised Data Uncertainty Learning in Visual Retrieval Systems Feb 7, 2019 Retrieval Triplet
— Unverified 0Unsupervised Segmentation of Action Segments in Egocentric Videos using Gaze Sep 30, 2017 Activity Recognition Retrieval
— Unverified 0Use of Affective Visual Information for Summarization of Human-Centric Videos Jul 8, 2021 Emotion Recognition Retrieval
— Unverified 0V3C - a Research Video Collection Oct 11, 2018 Management Retrieval
— Unverified 0Video 3D Sampling for Self-supervised Representation Learning Jul 8, 2021 Action Recognition Representation Learning
— Unverified 0VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding Sep 28, 2021 Action Localization Action Segmentation
— Unverified 0VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models Oct 1, 2024 Hallucination text similarity
— Unverified 0Video-ColBERT: Contextualized Late Interaction for Text-to-Video Retrieval Mar 24, 2025 Retrieval Text to Video Retrieval
— Unverified 0Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding Sep 29, 2024 Diversity Question Answering
— Unverified 0Video Editing for Video Retrieval Feb 4, 2024 Retrieval Text Retrieval
— Unverified 0Videoprompter: an ensemble of foundational models for zero-shot video understanding Oct 23, 2023 Action Recognition Descriptive
— Unverified 0Video retrieval based on deep convolutional neural network Dec 1, 2017 Retrieval Triplet
— Unverified 0