Revitalize Region Feature for Democratizing Video-Language Pre-training of Retrieval Mar 15, 2022 Question Answering Retrieval
Code Code Available 1Disentangled Representation Learning for Text-Video Retrieval Mar 14, 2022 Representation Learning Retrieval
Code Code Available 1Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Data Mar 14, 2022 Articles Retrieval
Code Code Available 1Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval Feb 7, 2022 Contrastive Learning Quantization
Code Code Available 1Reading-strategy Inspired Visual Representation Learning for Text-to-Video Retrieval Jan 23, 2022 Representation Learning Retrieval
Code Code Available 1Bridging Video-text Retrieval with Multiple Choice Questions Jan 13, 2022 Action Recognition Linear evaluation
Code Code Available 1Multi-Query Video Retrieval Jan 10, 2022 Retrieval Video Retrieval
Code Code Available 1Everything at Once - Multi-Modal Fusion Transformer for Video Retrieval Jan 1, 2022 Action Localization Retrieval
Code Code Available 1Video Joint Modelling Based on Hierarchical Transformer for Co-summarization Dec 27, 2021 Retrieval Supervised Video Summarization
Code Code Available 1Cross Modal Retrieval with Querybank Normalisation Dec 23, 2021 Cross-Modal Retrieval Metric Learning
Code Code Available 1Align and Prompt: Video-and-Language Pre-training with Entity Prompts Dec 17, 2021 cross-modal alignment Entity Alignment
Code Code Available 1Prompting Visual-Language Models for Efficient Video Understanding Dec 8, 2021 Action Recognition Language Modelling
Code Code Available 1Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval Dec 8, 2021 Action Localization Retrieval
Code Code Available 1TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning Dec 7, 2021 Action Recognition Contrastive Learning
Code Code Available 1Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval Dec 3, 2021 Ad-hoc video search feature selection
Code Code Available 1AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant Nov 30, 2021 Question Answering Retrieval
Code Code Available 1Video Content Classification using Deep Learning Nov 27, 2021 Classification Deep Learning
Code Code Available 1VIOLET : End-to-End Video-Language Transformers with Masked Visual-token Modeling Nov 24, 2021 Question Answering Retrieval
Code Code Available 1Florence: A New Foundation Model for Computer Vision Nov 22, 2021 Action Classification Action Recognition
Code Code Available 1Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions Nov 19, 2021 Retrieval Super-Resolution
Code Code Available 1BiC-Net: Learning Efficient Spatio-Temporal Relation for Text-Video Retrieval Oct 29, 2021 Cross-Modal Retrieval Relation
Code Code Available 1Video and Text Matching with Conditioned Embeddings Oct 21, 2021 Machine Translation Sentence
Code Code Available 1CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval Sep 21, 2021 Corpus Video Moment Retrieval Moment Retrieval
Code Code Available 1Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss Sep 9, 2021 Mixture-of-Experts Retrieval
Code Code Available 1Video Contrastive Learning with Global Context Aug 5, 2021 Action Classification Action Localization
Code Code Available 1DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval Jun 24, 2021 Computational Efficiency Knowledge Distillation
Code Code Available 1CLIP2Video: Mastering Video-Text Retrieval via Image CLIP Jun 21, 2021 Language Modeling Language Modelling
Code Code Available 1Self-Supervised Video Hashing via Bidirectional Transformers Jun 19, 2021 Decoder Retrieval
Code Code Available 1Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting Jun 18, 2021 Action Recognition Action Recognition In Videos
Code Code Available 1VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation Jun 8, 2021 Multi-Task Learning Question Answering
Code Code Available 1DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization Jun 1, 2021 Question Answering Retrieval
Code Code Available 1TRECVID 2020: A comprehensive campaign for evaluating video retrieval tasks across multiple application domains Apr 27, 2021 Ad-hoc video search Instance Search
Code Code Available 1Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos Apr 26, 2021 Action Localization Clustering
Code Code Available 1VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text Apr 22, 2021 Action Classification Action Recognition
Code Code Available 1CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval Apr 18, 2021 Retrieval Text Retrieval
Code Code Available 1TEACHTEXT: CrossModal Generalized Distillation for Text-Video Retrieval Apr 16, 2021 Retrieval Video Retrieval
Code Code Available 1Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval Apr 1, 2021 Retrieval Text Retrieval
Code Code Available 1MDMMT: Multidomain Multimodal Transformer for Video Retrieval Mar 19, 2021 Retrieval Text to Video Retrieval
Code Code Available 1On Semantic Similarity in Video Retrieval Mar 18, 2021 Retrieval Semantic Similarity
Code Code Available 1A Straightforward Framework For Video Retrieval Using CLIP Feb 24, 2021 Retrieval Video Retrieval
Code Code Available 1SeqNet: Learning Descriptors for Sequence-based Hierarchical Place Recognition Feb 23, 2021 Autonomous Driving Image Retrieval
Code Code Available 1Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling Feb 11, 2021 Question Answering Retrieval
Code Code Available 1COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning Nov 1, 2020 Cross-Modal Retrieval Representation Learning
Code Code Available 1Pretext-Contrastive Learning: Toward Good Practices in Self-supervised Video Representation Leaning Oct 29, 2020 Contrastive Learning Data Augmentation
Code Code Available 1RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning Oct 27, 2020 Action Recognition Representation Learning
Code Code Available 1Self-supervised Co-training for Video Representation Learning Oct 19, 2020 Action Recognition Contrastive Learning
Code Code Available 1Audio-based Near-Duplicate Video Retrieval with Audio Similarity Learning Oct 17, 2020 Retrieval Transfer Learning
Code Code Available 1Dual Encoding for Video Retrieval by Text Sep 10, 2020 Ad-hoc video search Retrieval
Code Code Available 1Self-supervised Video Representation Learning by Uncovering Spatio-temporal Statistics Aug 31, 2020 Action Recognition Representation Learning
Code Code Available 1Self-supervised Video Representation Learning by Pace Prediction Aug 13, 2020 Action Recognition Contrastive Learning
Code Code Available 1