TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning Dec 7, 2021 Action Recognition Contrastive Learning
Code Code Available 1Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval Dec 3, 2021 Ad-hoc video search feature selection
Code Code Available 1Generalizable Multi-linear Attention Network Dec 1, 2021 Multimodal Sentiment Analysis Retrieval
— Unverified 0AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant Nov 30, 2021 Question Answering Retrieval
Code Code Available 1Video Content Classification using Deep Learning Nov 27, 2021 Classification Deep Learning
Code Code Available 1VIOLET : End-to-End Video-Language Transformers with Masked Visual-token Modeling Nov 24, 2021 Question Answering Retrieval
Code Code Available 1Florence: A New Foundation Model for Computer Vision Nov 22, 2021 Action Classification Action Recognition
Code Code Available 1Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions Nov 19, 2021 Retrieval Super-Resolution
Code Code Available 1Induce, Edit, Retrieve:Language Grounded Multimodal Schema for Instructional Video Retrieval Nov 17, 2021 Retrieval Video Retrieval
— Unverified 0CREATE: A Benchmark for Chinese Short Video Retrieval and Title Generation Nov 16, 2021 Retrieval Video Captioning
— Unverified 0CLIP2TV: Align, Match and Distill for Video-Text Retrieval Nov 10, 2021 Representation Learning Retrieval
— Unverified 0SwAMP: Swapped Assignment of Multi-Modal Pairs for Cross-Modal Retrieval Nov 10, 2021 Contrastive Learning Cross-Modal Retrieval
— Unverified 0Masking Modalities for Cross-modal Video Retrieval Nov 1, 2021 Retrieval Video Retrieval
— Unverified 0BiC-Net: Learning Efficient Spatio-Temporal Relation for Text-Video Retrieval Oct 29, 2021 Cross-Modal Retrieval Relation
Code Code Available 1Domain Adaptation in Multi-View Embedding for Cross-Modal Video Retrieval Oct 25, 2021 Domain Adaptation Retrieval
— Unverified 0Video and Text Matching with Conditioned Embeddings Oct 21, 2021 Machine Translation Sentence
Code Code Available 1Coarse to Fine: Video Retrieval before Moment Localization Oct 14, 2021 Moment Retrieval Retrieval
— Unverified 0ViSeRet: A simple yet effective approach to moment retrieval via fine-grained video segmentation Oct 11, 2021 Moment Retrieval Retrieval
— Unverified 0Spatio-Temporal Video Representation Learning for AI Based Video Playback Style Prediction Oct 3, 2021 Action Recognition Representation Learning
— Unverified 0Vi-MIX FOR SELF-SUPERVISED VIDEO REPRESENTATION Sep 29, 2021 Action Recognition Representation Learning
— Unverified 0VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding Sep 28, 2021 Action Localization Action Segmentation
— Unverified 0Self-Supervised Video Representation Learning by Video Incoherence Detection Sep 26, 2021 Action Recognition Contrastive Learning
— Unverified 0CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval Sep 21, 2021 Corpus Video Moment Retrieval Moment Retrieval
Code Code Available 1Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss Sep 9, 2021 Mixture-of-Experts Retrieval
Code Code Available 1TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment Aug 23, 2021 Action Segmentation Contrastive Learning
— Unverified 0Self-Supervised Video Representation Learning with Meta-Contrastive Network Aug 19, 2021 Action Recognition Contrastive Learning
— Unverified 0Video Contrastive Learning with Global Context Aug 5, 2021 Action Classification Action Localization
Code Code Available 1Boosting Video Captioning with Dynamic Loss Network Jul 25, 2021 image-classification Image Classification
— Unverified 0Use of Affective Visual Information for Summarization of Human-Centric Videos Jul 8, 2021 Emotion Recognition Retrieval
— Unverified 0Video 3D Sampling for Self-supervised Representation Learning Jul 8, 2021 Action Recognition Representation Learning
— Unverified 0Inter-intra Variant Dual Representations forSelf-supervised Video Recognition Jul 2, 2021 Contrastive Learning Representation Learning
Code Code Available 0DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval Jun 24, 2021 Computational Efficiency Knowledge Distillation
Code Code Available 1CLIP2Video: Mastering Video-Text Retrieval via Image CLIP Jun 21, 2021 Language Modeling Language Modelling
Code Code Available 1Self-Supervised Video Hashing via Bidirectional Transformers Jun 19, 2021 Decoder Retrieval
Code Code Available 1Universal Adversarial Head: Practical Protection against Video Data Leakage Jun 18, 2021 Deep Hashing Retrieval
— Unverified 0Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting Jun 18, 2021 Action Recognition Action Recognition In Videos
Code Code Available 1VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation Jun 8, 2021 Multi-Task Learning Question Answering
Code Code Available 1ASCNet: Self-supervised Video Representation Learning with Appearance-Speed Consistency Jun 4, 2021 Action Recognition Representation Learning
— Unverified 0DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization Jun 1, 2021 Question Answering Retrieval
Code Code Available 1CNN Retrieval based Unsupervised Metric Learning for Near-Duplicated Video Retrieval May 30, 2021 Metric Learning Re-Ranking
— Unverified 0SSAN: Separable Self-Attention Network for Video Representation Learning May 27, 2021 Action Recognition Representation Learning
— Unverified 0VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding May 20, 2021 Action Segmentation Language Modeling
— Unverified 0Action in Mind: A Neural Network Approach to Action Recognition and Segmentation Apr 30, 2021 Action Recognition Action Segmentation
— Unverified 0TRECVID 2020: A comprehensive campaign for evaluating video retrieval tasks across multiple application domains Apr 27, 2021 Ad-hoc video search Instance Search
Code Code Available 1Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos Apr 26, 2021 Action Localization Clustering
Code Code Available 1VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text Apr 22, 2021 Action Classification Action Recognition
Code Code Available 1T2VLAD: Global-Local Sequence Alignment for Text-Video Retrieval Apr 20, 2021 Retrieval Video Retrieval
Code Code Available 0CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval Apr 18, 2021 Retrieval Text Retrieval
Code Code Available 1Self-supervised Video Retrieval Transformer Network Apr 16, 2021 Retrieval Self-supervised Video Retrieval
— Unverified 0TEACHTEXT: CrossModal Generalized Distillation for Text-Video Retrieval Apr 16, 2021 Retrieval Video Retrieval
Code Code Available 1