VideoCoCa: Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners Dec 9, 2022 Question Answering Retrieval
— Unverified 0Vi-MIX FOR SELF-SUPERVISED VIDEO REPRESENTATION Sep 29, 2021 Action Recognition Representation Learning
— Unverified 0ViSeRet: A simple yet effective approach to moment retrieval via fine-grained video segmentation Oct 11, 2021 Moment Retrieval Retrieval
— Unverified 0Visual Information Retrieval in Endoscopic Video Archives Apr 29, 2015 Information Retrieval Retrieval
— Unverified 0Visual Semantic Search: Retrieving Videos via Complex Textual Queries Jun 1, 2014 Autonomous Driving Natural Language Queries
— Unverified 0VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending May 22, 2023 Question Answering Retrieval
— Unverified 0VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding May 20, 2021 Action Segmentation Language Modeling
— Unverified 0VRAG: Region Attention Graphs for Content-Based Video Retrieval May 18, 2022 Retrieval Video Retrieval
— Unverified 0VRFP: On-the-fly Video Retrieval using Web Images and Fast Fisher Vector Products Dec 10, 2015 Re-Ranking Retrieval
— Unverified 0VScript: Controllable Script Generation with Visual Presentation Mar 1, 2022 Dialogue Generation Retrieval
— Unverified 0Watch Less and Uncover More: Could Navigation Tools Help Users Search and Explore Videos? Jan 10, 2022 Information Retrieval Retrieval
— Unverified 0AMIL: Adversarial Multi Instance Learning for Human Pose Estimation Mar 18, 2020 Multiple Instance Learning Pose Estimation
Code Code Available 0Self-supervised Video Representation Learning by Context and Motion Decoupling Apr 2, 2021 Action Recognition CPU
Code Code Available 0LAMV: Learning to Align and Match Videos With Kernelized Temporal Layers Jun 1, 2018 Copy Detection Retrieval
Code Code Available 0Joint Searching and Grounding: Multi-Granularity Video Content Retrieval Oct 23, 2023 Contrastive Learning Retrieval
Code Code Available 0Self-supervised Video Representation Learning with Cascade Positive Retrieval Jan 20, 2022 Action Recognition Contrastive Learning
Code Code Available 0Dialogue-to-Video Retrieval Mar 23, 2023 Recommendation Systems Retrieval
Code Code Available 0Self-Supervised Visual Learning by Variable Playback Speeds Prediction of a Video Mar 5, 2020 Action Recognition Representation Learning
Code Code Available 0Is Multimodal Vision Supervision Beneficial to Language? Feb 10, 2023 Image Retrieval Natural Language Understanding
Code Code Available 0Semantic Role Aware Correlation Transformer for Text to Video Retrieval Jun 26, 2022 Retrieval Text to Video Retrieval
Code Code Available 0A Challenge to Build Neuro-Symbolic Video Agents May 20, 2025 Scene Classification Video Retrieval
Code Code Available 0Deep Hashing with Category Mask for Fast Video Retrieval Dec 22, 2017 Code Generation Deep Hashing
Code Code Available 0Improving Video Corpus Moment Retrieval with Partial Relevance Enhancement Feb 21, 2024 Moment Retrieval Retrieval
Code Code Available 0SEDS: Semantically Enhanced Dual-Stream Encoder for Sign Language Retrieval Jul 23, 2024 Retrieval Sign Language Retrieval
Code Code Available 0ICSVR: Investigating Compositional and Syntactic Understanding in Video Retrieval Models Jun 28, 2023 Retrieval Video Retrieval
Code Code Available 0Inter-intra Variant Dual Representations forSelf-supervised Video Recognition Jul 2, 2021 Contrastive Learning Representation Learning
Code Code Available 0SEA: Sentence Encoder Assembly for Video Retrieval by Textual Queries Nov 24, 2020 Ad-hoc video search Management
Code Code Available 0Screencast Tutorial Video Understanding Jun 1, 2020 object-detection Object Detection
Code Code Available 0Rudder: A Cross Lingual Video and Text Retrieval Dataset Mar 9, 2021 Natural Language Queries Retrieval
Code Code Available 0ConTra: (Con)text (Tra)nsformer for Cross-Modal Video Retrieval Oct 9, 2022 Retrieval Sentence
Code Code Available 0RoME: Role-aware Mixture-of-Expert Transformer for Text-to-Video Retrieval Jun 26, 2022 Mixture-of-Experts Retrieval
Code Code Available 0Hashing with Mutual Information Mar 2, 2018 Image Retrieval Retrieval
Code Code Available 0Accommodating Audio Modality in CLIP for Multimodal Processing Mar 12, 2023 AudioCaps Contrastive Learning
Code Code Available 0ReSpec: Relevance and Specificity Grounded Online Filtering for Learning on Video-Text Data Streams Apr 21, 2025 Informativeness Low-latency processing
Code Code Available 0Video-Text Retrieval by Supervised Sparse Multi-Grained Learning Feb 19, 2023 Representation Learning Retrieval
Code Code Available 0Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language Apr 1, 2022 Diversity Image Captioning
Code Code Available 0Graph Based Temporal Aggregation for Video Retrieval Nov 4, 2020 Retrieval Video Retrieval
Code Code Available 0Contextual Explainable Video Representation: Human Perception-based Understanding Dec 12, 2022 Action Detection Action Recognition
Code Code Available 0You were saying? - Spoken Language in the V3C Dataset Dec 15, 2022 Retrieval Video Retrieval
Code Code Available 0GOCA: Guided Online Cluster Assignment for Self-Supervised Video Representation Learning Jul 20, 2022 Action Recognition Clustering
Code Code Available 0Unmasked Teacher: Towards Training-Efficient Video Foundation Models Mar 28, 2023 Action Classification Action Recognition
Code Code Available 0Relevance-based Margin for Contrastively-trained Video Retrieval Models Apr 27, 2022 Multi-Instance Retrieval Natural Language Queries
Code Code Available 0ContextIQ: A Multimodal Expert-Based Video Retrieval System for Contextual Advertising Oct 29, 2024 Retrieval Text to Video Retrieval
Code Code Available 0Contrastive Alignment with Semantic Gap-Aware Corrections in Text-Video Retrieval May 18, 2025 Contrastive Learning Retrieval
Code Code Available 0Circulant temporal encoding for video retrieval and temporal alignment Jun 8, 2015 Retrieval Video Retrieval
Code Code Available 0Aligning Step-by-Step Instructional Diagrams to Video Demonstrations Mar 24, 2023 Contrastive Learning Image Retrieval
Code Code Available 0Generating Signed Language Instructions in Large-Scale Dialogue Systems Oct 17, 2024 Retrieval Text Generation
Code Code Available 0Central Similarity Quantization for Efficient Image and Video Retrieval Aug 1, 2019 Quantization Retrieval
Code Code Available 0From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained Videos Jun 5, 2025 Action Classification Composed Video Retrieval (CoVR)
Code Code Available 0FIVR: Fine-grained Incident Video Retrieval Sep 11, 2018 Benchmarking Retrieval
Code Code Available 0