FitCLIP: Refining Large-Scale Pretrained Image-Text Models for Zero-Shot Video Understanding Tasks Mar 24, 2022 Action Recognition Retrieval
Code Code Available 0MDMMT-2: Multidomain Multimodal Transformer for Video Retrieval, One More Step Towards Generalization Mar 14, 2022 Retrieval Text to Video Retrieval
— Unverified 0Live Laparoscopic Video Retrieval with Compressed Uncertainty Mar 8, 2022 Retrieval Video Retrieval
— Unverified 0VScript: Controllable Script Generation with Visual Presentation Mar 1, 2022 Dialogue Generation Retrieval
— Unverified 0NEWSKVQA: Knowledge-Aware News Video Question Answering Feb 8, 2022 Common Sense Reasoning Management
— Unverified 0End-to-end Generative Pretraining for Multimodal Video Captioning Jan 20, 2022 Action Classification Decoder
— Unverified 0Self-supervised Video Representation Learning with Cascade Positive Retrieval Jan 20, 2022 Action Recognition Contrastive Learning
Code Code Available 0Watch Less and Uncover More: Could Navigation Tools Help Users Search and Explore Videos? Jan 10, 2022 Information Retrieval Retrieval
— Unverified 0Sign Language Video Retrieval with Free-Form Textual Queries Jan 7, 2022 Form Retrieval
— Unverified 0Sound and Visual Representation Learning with Multiple Pretraining Tasks Jan 4, 2022 Incremental Learning Representation Learning
— Unverified 0Vision Transformer Based Video Hashing Retrieval for Tracing the Source of Fake Videos Dec 15, 2021 Retrieval Triplet
Code Code Available 0Self-supervised Spatiotemporal Representation Learning by Exploiting Video Continuity Dec 11, 2021 Action Localization Action Recognition
— Unverified 0Time-Equivariant Contrastive Video Representation Learning Dec 7, 2021 Action Recognition Contrastive Learning
— Unverified 0Cross-modal Manifold Cutmix for Self-supervised Video Representation Learning Dec 7, 2021 Action Recognition Representation Learning
— Unverified 0Generalizable Multi-linear Attention Network Dec 1, 2021 Multimodal Sentiment Analysis Retrieval
— Unverified 0Induce, Edit, Retrieve:Language Grounded Multimodal Schema for Instructional Video Retrieval Nov 17, 2021 Retrieval Video Retrieval
— Unverified 0CREATE: A Benchmark for Chinese Short Video Retrieval and Title Generation Nov 16, 2021 Retrieval Video Captioning
— Unverified 0SwAMP: Swapped Assignment of Multi-Modal Pairs for Cross-Modal Retrieval Nov 10, 2021 Contrastive Learning Cross-Modal Retrieval
— Unverified 0CLIP2TV: Align, Match and Distill for Video-Text Retrieval Nov 10, 2021 Representation Learning Retrieval
— Unverified 0Masking Modalities for Cross-modal Video Retrieval Nov 1, 2021 Retrieval Video Retrieval
— Unverified 0Domain Adaptation in Multi-View Embedding for Cross-Modal Video Retrieval Oct 25, 2021 Domain Adaptation Retrieval
— Unverified 0Coarse to Fine: Video Retrieval before Moment Localization Oct 14, 2021 Moment Retrieval Retrieval
— Unverified 0ViSeRet: A simple yet effective approach to moment retrieval via fine-grained video segmentation Oct 11, 2021 Moment Retrieval Retrieval
— Unverified 0Spatio-Temporal Video Representation Learning for AI Based Video Playback Style Prediction Oct 3, 2021 Action Recognition Representation Learning
— Unverified 0Vi-MIX FOR SELF-SUPERVISED VIDEO REPRESENTATION Sep 29, 2021 Action Recognition Representation Learning
— Unverified 0VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding Sep 28, 2021 Action Localization Action Segmentation
Code Code Available 0Self-Supervised Video Representation Learning by Video Incoherence Detection Sep 26, 2021 Action Recognition Contrastive Learning
— Unverified 0TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment Aug 23, 2021 Action Segmentation Contrastive Learning
— Unverified 0Self-Supervised Video Representation Learning with Meta-Contrastive Network Aug 19, 2021 Action Recognition Contrastive Learning
— Unverified 0Boosting Video Captioning with Dynamic Loss Network Jul 25, 2021 image-classification Image Classification
— Unverified 0Video 3D Sampling for Self-supervised Representation Learning Jul 8, 2021 Action Recognition Representation Learning
— Unverified 0Use of Affective Visual Information for Summarization of Human-Centric Videos Jul 8, 2021 Emotion Recognition Retrieval
— Unverified 0Inter-intra Variant Dual Representations forSelf-supervised Video Recognition Jul 2, 2021 Contrastive Learning Representation Learning
Code Code Available 0Universal Adversarial Head: Practical Protection against Video Data Leakage Jun 18, 2021 Deep Hashing Retrieval
— Unverified 0ASCNet: Self-supervised Video Representation Learning with Appearance-Speed Consistency Jun 4, 2021 Action Recognition Representation Learning
— Unverified 0CNN Retrieval based Unsupervised Metric Learning for Near-Duplicated Video Retrieval May 30, 2021 Metric Learning Re-Ranking
— Unverified 0SSAN: Separable Self-Attention Network for Video Representation Learning May 27, 2021 Action Recognition Representation Learning
— Unverified 0VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding May 20, 2021 Action Segmentation Language Modeling
Code Code Available 0Action in Mind: A Neural Network Approach to Action Recognition and Segmentation Apr 30, 2021 Action Recognition Action Segmentation
— Unverified 0T2VLAD: Global-Local Sequence Alignment for Text-Video Retrieval Apr 20, 2021 Retrieval Video Retrieval
Code Code Available 0Self-supervised Video Retrieval Transformer Network Apr 16, 2021 Retrieval Self-supervised Video Retrieval
— Unverified 0Object Priors for Classifying and Localizing Unseen Actions Apr 10, 2021 Action Classification Action Localization
Code Code Available 0Self-supervised Video Representation Learning by Context and Motion Decoupling Apr 2, 2021 Action Recognition CPU
Code Code Available 0CUPID: Adaptive Curation of Pre-training Data for Video-and-Language Representation Learning Apr 1, 2021 Question Answering Representation Learning
— Unverified 0Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning Mar 30, 2021 counterfactual Object
— Unverified 0Rudder: A Cross Lingual Video and Text Retrieval Dataset Mar 9, 2021 Natural Language Queries Retrieval
Code Code Available 0Clarification of Video Retrieval Query Results by the Automated Insertion of Supporting Shots Feb 19, 2021 Retrieval Video Editing
— Unverified 0Win-Fail Action Recognition Feb 15, 2021 Action Recognition Action Understanding
Code Code Available 0Temporal Contrastive Graph Learning for Video Action Recognition and Retrieval Jan 4, 2021 Action Recognition Contrastive Learning
— Unverified 0Grounding Physical Object and Event Concepts Through Dynamic Visual Reasoning Jan 1, 2021 counterfactual Object
— Unverified 0