Prompt Switch: Efficient CLIP Adaptation for Text-Video Retrieval Aug 15, 2023 Retrieval Video Captioning
Code Code Available 1Learning Multi-modal Representations by Watching Hundreds of Surgical Video Lectures Jul 27, 2023 Automatic Speech Recognition Contrastive Learning
Code Code Available 1Towards Video Anomaly Retrieval from Video Anomaly Detection: New Benchmarks and Model Jul 24, 2023 Anomaly Detection Retrieval
Code Code Available 1An overview on the evaluated video retrieval tasks at TRECVID 2022 Jun 22, 2023 Ad-hoc video search Retrieval
Code Code Available 1COSA: Concatenated Sample Pretrained Vision-Language Foundation Model Jun 15, 2023 Form model
Code Code Available 1Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment May 20, 2023 Retrieval Video Retrieval
Code Code Available 1A Large Cross-Modal Video Retrieval Dataset with Reading Comprehension May 5, 2023 Reading Comprehension Retrieval
Code Code Available 1Robust Cross-Modal Knowledge Distillation for Unconstrained Videos Apr 16, 2023 Action Recognition Audio Tagging
Code Code Available 1Self-Supervised Video Similarity Learning Apr 6, 2023 ISVR Retrieval
Code Code Available 1Hierarchical Video-Moment Retrieval and Step-Captioning Mar 29, 2023 Information Retrieval Moment Retrieval
Code Code Available 1Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning Mar 25, 2023 Contrastive Learning Question Answering
Code Code Available 1MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models Mar 23, 2023 Auxiliary Learning Multimodal Sentiment Analysis
Code Code Available 1DiffusionRet: Generative Text-Video Retrieval with Diffusion Model Mar 17, 2023 Retrieval Video Retrieval
Code Code Available 1VVS: Video-to-Video Retrieval with Irrelevant Frame Suppression Mar 15, 2023 Retrieval Video Retrieval
Code Code Available 1Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring Jan 26, 2023 Representation Learning Retrieval
Code Code Available 1UATVR: Uncertainty-Adaptive Text-Video Retrieval Jan 16, 2023 Retrieval Semantic correspondence
Code Code Available 1Dual Learning with Dynamic Knowledge Distillation for Partially Relevant Video Retrieval Jan 1, 2023 Knowledge Distillation Language Modelling
Code Code Available 1Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval Jan 1, 2023 Diversity Object
Code Code Available 1TempCLR: Temporal Alignment Representation with Contrastive Learning Dec 28, 2022 Action Recognition Contrastive Learning
Code Code Available 1VindLU: A Recipe for Effective Video-and-Language Pretraining Dec 9, 2022 Question Answering Retrieval
Code Code Available 1Normalized Contrastive Learning for Text-Video Retrieval Nov 30, 2022 Contrastive Learning Cross-Modal Retrieval
Code Code Available 1VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval Nov 23, 2022 Cross-Modal Retrieval Retrieval
Code Code Available 1TransVCL: Attention-enhanced Video Copy Localization Network with Flexible Supervision Nov 23, 2022 Retrieval Video Retrieval
Code Code Available 1Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations Nov 21, 2022 Contrastive Learning Representation Learning
Code Code Available 1Contrastive Masked Autoencoders for Self-Supervised Video Hashing Nov 21, 2022 Decoder Retrieval
Code Code Available 1Cross-Modal Adapter for Text-Video Retrieval Nov 17, 2022 parameter-efficient fine-tuning Retrieval
Code Code Available 13D-CSL: self-supervised 3D context similarity learning for Near-Duplicate Video Retrieval Nov 10, 2022 Retrieval Self-Supervised Learning
Code Code Available 1C2KD: Cross-Lingual Cross-Modal Knowledge Distillation for Multilingual Text-Video Retrieval Oct 7, 2022 Knowledge Distillation Retrieval
Code Code Available 1TVLT: Textless Vision-Language Transformer Sep 28, 2022 Automatic Speech Recognition (ASR) Image Retrieval
Code Code Available 1Marine Video Kit: A New Marine Video Dataset for Content-based Analysis and Retrieval Sep 23, 2022 Retrieval Video Retrieval
Code Code Available 1An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling Sep 4, 2022 Fill Mask Optical Flow Estimation
Code Code Available 1Partially Relevant Video Retrieval Aug 26, 2022 Moment Retrieval Multiple Instance Learning
Code Code Available 1A Feature-space Multimodal Data Augmentation Technique for Text-video Retrieval Aug 3, 2022 Data Augmentation Retrieval
Code Code Available 1LocVTP: Video-Text Pre-training for Temporal Localization Jul 21, 2022 Retrieval Temporal Localization
Code Code Available 1Clover: Towards A Unified Video-Language Alignment and Fusion Model Jul 16, 2022 Language Modeling Language Modelling
Code Code Available 1TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval Jul 16, 2022 Retrieval Video Retrieval
Code Code Available 1X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval Jul 15, 2022 Contrastive Learning Retrieval
Code Code Available 1SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos Jun 25, 2022 Action Classification Clustering
Code Code Available 1LAVENDER: Unifying Video-Language Understanding as Masked Language Modeling Jun 14, 2022 Decoder Language Modeling
Code Code Available 1Revisiting the "Video" in Video-Language Understanding Jun 3, 2022 Benchmarking Question Answering
Code Code Available 1Cross-Architecture Self-supervised Video Representation Learning May 26, 2022 Action Recognition Contrastive Learning
Code Code Available 1A CLIP-Hitchhiker's Guide to Long Video Retrieval May 17, 2022 Retrieval Video Retrieval
Code Code Available 1CoCa: Contrastive Captioners are Image-Text Foundation Models May 4, 2022 Action Classification Decoder
Code Code Available 1TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognition May 4, 2022 Action Recognition Representation Learning
Code Code Available 1CenterCLIP: Token Clustering for Efficient Text-Video Retrieval May 2, 2022 Clustering Retrieval
Code Code Available 1MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval Apr 26, 2022 Action Recognition Retrieval
Code Code Available 1ECLIPSE: Efficient Long-range Video Retrieval using Sight and Sound Apr 6, 2022 Retrieval Text to Video Retrieval
Code Code Available 1Temporal Alignment Networks for Long-term Video Apr 6, 2022 Action Recognition Action Segmentation
Code Code Available 1X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval Mar 28, 2022 Retrieval Text to Video Retrieval
Code Code Available 1Learning video retrieval models with relevance-aware online mining Mar 16, 2022 Multi-Instance Retrieval Retrieval
Code Code Available 1