FitCLIP: Refining Large-Scale Pretrained Image-Text Models for Zero-Shot Video Understanding Tasks Mar 24, 2022 Action Recognition Retrieval
Code Code Available 05 Video Logo Retrieval based on local Features Aug 11, 2018 Image Retrieval Retrieval
Code Code Available 05 Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning Mar 6, 2020 Density Estimation Noise Estimation
Code Code Available 05 TC-MGC: Text-Conditioned Multi-Grained Contrastive Learning for Text-Video Retrieval Apr 7, 2025 Contrastive Learning Retrieval
Code Code Available 05 A Joint Sequence Fusion Model for Video Question Answering and Retrieval Aug 7, 2018 Decoder Multiple-choice
Code Code Available 05 Circulant temporal encoding for video retrieval and temporal alignment Jun 8, 2015 Retrieval Video Retrieval
Code Code Available 05 Dialogue-to-Video Retrieval Mar 23, 2023 Recommendation Systems Retrieval
Code Code Available 05 RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter May 29, 2024 Natural Language Queries parameter-efficient fine-tuning
— Unverified 00 ACE: A Generative Cross-Modal Retrieval Framework with Coarse-To-Fine Semantic Modeling Jun 25, 2024 Cross-Modal Retrieval Natural Language Queries
— Unverified 00 Action in Mind: A Neural Network Approach to Action Recognition and Segmentation Apr 30, 2021 Action Recognition Action Segmentation
— Unverified 00 Advances in Human Action Recognition: A Survey Jan 23, 2015 Action Recognition Retrieval
— Unverified 00 A Faster Method for Tracking and Scoring Videos Corresponding to Sentences Nov 14, 2014 Retrieval Sentence
— Unverified 00 A Hierarchical Multi-Modal Encoder for Moment Localization in Video Corpus Nov 18, 2020 Language Modeling Language Modelling
— Unverified 00 Analysis of Gait Pattern to Recognize the Human Activities Jul 18, 2014 Activity Recognition Human Activity Recognition
— Unverified 00 An Empirical Study of Frame Selection for Text-to-Video Retrieval Nov 1, 2023 Retrieval Text to Video Retrieval
— Unverified 00 An Improved Video Analysis using Context based Extension of LSH May 10, 2017 Action Recognition Retrieval
— Unverified 00 An Overview of Challenges in Egocentric Text-Video Retrieval Jun 7, 2023 Retrieval Video Retrieval
— Unverified 00 A Proposal-based Approach for Activity Image-to-Video Retrieval Nov 24, 2019 Cross-Modal Retrieval Retrieval
— Unverified 00 A Review of Deep Learning for Video Captioning Apr 22, 2023 Deep Learning Dense Video Captioning
— Unverified 00 ASCNet: Self-supervised Video Representation Learning with Appearance-Speed Consistency Jun 4, 2021 Action Recognition Representation Learning
— Unverified 00 A Survey of Video-based Action Quality Assessment Apr 20, 2022 Action Quality Assessment Action Recognition
— Unverified 00 Audio-Enhanced Text-to-Video Retrieval using Text-Conditioned Feature Alignment Jul 24, 2023 Retrieval Text to Video Retrieval
— Unverified 00 Audio-Visual Embedding for Cross-Modal MusicVideo Retrieval through Supervised Deep CCA Aug 10, 2019 audio-visual learning Retrieval
— Unverified 00 A Unified Model for Video Understanding and Knowledge Embedding with Heterogeneous Knowledge Graph Dataset Nov 19, 2022 Common Sense Reasoning Graph Embedding
— Unverified 00 A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval Nov 30, 2023 Benchmarking Retrieval
— Unverified 00 Bag of Genres for Video Retrieval May 30, 2015 Retrieval Video Retrieval
— Unverified 00 Binary Subspace Coding for Query-by-Image Video Retrieval Dec 6, 2016 Retrieval Video Retrieval
— Unverified 00 Boosting Video Captioning with Dynamic Loss Network Jul 25, 2021 image-classification Image Classification
— Unverified 00 CHAIN: Exploring Global-Local Spatio-Temporal Information for Improved Self-Supervised Video Hashing Oct 29, 2023 Contrastive Learning Retrieval
— Unverified 00 Clarification of Video Retrieval Query Results by the Automated Insertion of Supporting Shots Feb 19, 2021 Retrieval Video Editing
— Unverified 00 Classroom Video Assessment and Retrieval via Multiple Instance Learning Mar 25, 2014 Multiple Instance Learning Retrieval
— Unverified 00 CLIP2TV: Align, Match and Distill for Video-Text Retrieval Nov 10, 2021 Representation Learning Retrieval
— Unverified 00 CLOP: Video-and-Language Pre-Training with Knowledge Regularizations Nov 7, 2022 Contrastive Learning Retrieval
— Unverified 00 CMAWRNet: Multiple Adverse Weather Removal via a Unified Quaternion Neural Architecture May 3, 2025 Autonomous Driving Benchmarking
— Unverified 00 CNN Retrieval based Unsupervised Metric Learning for Near-Duplicated Video Retrieval May 30, 2021 Metric Learning Re-Ranking
— Unverified 00 Coarse to Fine: Video Retrieval before Moment Localization Oct 14, 2021 Moment Retrieval Retrieval
— Unverified 00 CoAVT: A Cognition-Inspired Unified Audio-Visual-Text Pre-Training Model for Multimodal Processing Jan 22, 2024 AudioCaps Audio-Visual Synchronization
— Unverified 00 Colo-SCRL: Self-Supervised Contrastive Representation Learning for Colonoscopic Video Retrieval Mar 28, 2023 Action Recognition Contrastive Learning
— Unverified 00 Contrastive Video-Language Learning with Fine-grained Frame Sampling Oct 10, 2022 Question Answering Representation Learning
— Unverified 00 Controllable Augmentations for Video Representation Learning Mar 30, 2022 Action Recognition Contrastive Learning
— Unverified 00 COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval Apr 15, 2022 Contrastive Learning Cross-Modal Retrieval
— Unverified 00 CREATE: A Benchmark for Chinese Short Video Retrieval and Title Generation Nov 16, 2021 Retrieval Video Captioning
— Unverified 00 CREATE: A Benchmark for Chinese Short Video Retrieval and Title Generation Mar 31, 2022 Retrieval Video Captioning
— Unverified 00 CUPID: Adaptive Curation of Pre-training Data for Video-and-Language Representation Learning Apr 1, 2021 Question Answering Representation Learning
— Unverified 00 Deep Heterogeneous Hashing for Face Video Retrieval Nov 4, 2019 Retrieval Video Retrieval
— Unverified 00 Deep Learning Based Semantic Video Indexing and Retrieval Jan 28, 2016 Deep Learning Retrieval
— Unverified 00 De-Hashing: Server-Side Context-Aware Feature Reconstruction for Mobile Visual Search Jun 29, 2016 Retrieval Video Retrieval
— Unverified 00 Detours for Navigating Instructional Videos Jan 3, 2024 16k Question Answering
— Unverified 00 Discrete Wavelet Transform and Gradient Difference based approach for text localization in videos Feb 24, 2015 Retrieval Text Detection
— Unverified 00 Distilling Vision-Language Models on Millions of Videos Jan 11, 2024 Language Modeling Language Modelling
— Unverified 00