Learning and Recognizing Human Action from Skeleton Movement with Deep Residual Neural Networks Mar 21, 2018 Action Recognition Deep Learning
— Unverified 00 Learning Audio-Video Modalities from Image Captions Apr 1, 2022 Image Captioning Retrieval
— Unverified 00 Learning Joint Representations of Videos and Sentences with Web Image Search Aug 8, 2016 Image Retrieval Natural Language Queries
— Unverified 00 Learning Language-Visual Embedding for Movie Understanding with Natural-Language Sep 26, 2016 Multiple-choice Retrieval
— Unverified 00 Learning Locally-Adaptive Decision Functions for Person Verification Jun 1, 2013 Face Verification Metric Learning
— Unverified 00 Learning Segment Similarity and Alignment in Large-Scale Content Based Video Retrieval Sep 20, 2023 Retrieval Video Retrieval
— Unverified 00 Learning text-to-video retrieval from image captioning Apr 26, 2024 Image Captioning Image Retrieval
— Unverified 00 Learning to Generate Long-term Future Narrations Describing Activities of Daily Living Mar 3, 2025 Action Anticipation Decision Making
— Unverified 00 Learning Trajectory-Word Alignments for Video-Language Tasks Jan 5, 2023 Question Answering Retrieval
— Unverified 00 Learning World Models for Interactive Video Generation May 28, 2025 In-Context Learning Retrieval
— Unverified 00 Leveraging Auxiliary Information in Text-to-Video Retrieval: A Review May 29, 2025 Retrieval Text to Video Retrieval
— Unverified 00 Leveraging Generative Language Models for Weakly Supervised Sentence Component Analysis in Video-Language Joint Learning Dec 10, 2023 Language Modeling Language Modelling
— Unverified 00 Leveraging Modality Tags for Enhanced Cross-Modal Video Retrieval Apr 2, 2025 cross-modal alignment Retrieval
— Unverified 00 LiteVL: Efficient Video-Language Learning with Enhanced Spatial-Temporal Modeling Oct 21, 2022 Language Modeling Language Modelling
— Unverified 00 Live Laparoscopic Video Retrieval with Compressed Uncertainty Mar 8, 2022 Retrieval Video Retrieval
— Unverified 00 LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning Mar 4, 2025 Contrastive Learning Image-text Retrieval
— Unverified 00 Long-VMNet: Accelerating Long-Form Video Understanding via Fixed Memory Mar 17, 2025 Form GPU
— Unverified 00 Lost Your Style? Navigating with Semantic-Level Approach for Text-to-Outfit Retrieval Nov 3, 2023 Recommendation Systems Retrieval
— Unverified 00 MAGMaR Shared Task System Description: Video Retrieval with OmniEmbed Jun 11, 2025 Retrieval Video Retrieval
— Unverified 00 MarineVRS: Marine Video Retrieval System with Explainability via Semantic Understanding Jun 7, 2023 Retrieval Sentence
— Unverified 00 Masked Contrastive Pre-Training for Efficient Video-Text Retrieval Dec 2, 2022 Image-text Retrieval Retrieval
— Unverified 00 Masking Modalities for Cross-modal Video Retrieval Nov 1, 2021 Retrieval Video Retrieval
— Unverified 00 Mask to reconstruct: Cooperative Semantics Completion for Video-text Retrieval May 13, 2023 Retrieval Text Retrieval
— Unverified 00 MDMMT-2: Multidomain Multimodal Transformer for Video Retrieval, One More Step Towards Generalization Mar 14, 2022 Retrieval Text to Video Retrieval
— Unverified 00 MERLIN: Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank Pipeline Jul 17, 2024 Question Answering Retrieval
— Unverified 00 Modality-Balanced Embedding for Video Retrieval Apr 18, 2022 Retrieval Text Matching
— Unverified 00 Motion Sensitive Contrastive Learning for Self-supervised Video Representation Aug 12, 2022 Contrastive Learning Representation Learning
— Unverified 00 MuLTI: Efficient Video-and-Language Understanding with Text-Guided MultiWay-Sampler and Multiple Choice Modeling Mar 10, 2023 Multi-Label Classification MUlTI-LABEL-ClASSIFICATION
— Unverified 00 Multi-Granularity and Multi-modal Feature Interaction Approach for Text Video Retrieval Jun 21, 2024 Retrieval Sentence
— Unverified 00 Multi-Granularity Graph Pooling for Video-based Person Re-Identification Sep 23, 2022 Node Clustering Person Re-Identification
— Unverified 00 Multimodal Approach for Video Surveillance Indexing and Retrieval Aug 6, 2013 Retrieval Video Retrieval
— Unverified 00 Multimodal Contextualized Support for Enhancing Video Retrieval System Dec 10, 2024 object-detection Object Detection
— Unverified 00 Multimodal Skip-gram Using Convolutional Pseudowords Nov 12, 2015 Object Recognition Retrieval
— Unverified 00 Multiple Visual-Semantic Embedding for Video Retrieval from Query Sentence Apr 16, 2020 Retrieval Sentence
— Unverified 00 MultiVENT 2.0: A Massive Multilingual Benchmark for Event-Centric Video Retrieval Oct 15, 2024 Descriptive Retrieval
— Unverified 00 MultiVENT: Multilingual Videos of Events with Aligned Natural Text Jul 6, 2023 Information Retrieval Retrieval
— Unverified 00 Narrating the Video: Boosting Text-Video Retrieval via Comprehensive Utilization of Frame-Level Captions Mar 7, 2025 Retrieval Video Retrieval
— Unverified 00 NAVERO: Unlocking Fine-Grained Semantics for Video-Language Compositionality Aug 18, 2024 Retrieval Text Retrieval
— Unverified 00 Near-duplicate video detection featuring coupled temporal and perceptual visual structures and logical inference based matching May 15, 2020 Retrieval Video Editing
— Unverified 00 Neighborhood Preserving Hashing for Scalable Video Retrieval Oct 1, 2019 Retrieval Video Retrieval
— Unverified 00 Neural Graph Matching for Video Retrieval in Large-Scale Video-driven E-commerce Aug 1, 2024 Graph Matching Retrieval
— Unverified 00 NEWSKVQA: Knowledge-Aware News Video Question Answering Feb 8, 2022 Common Sense Reasoning Management
— Unverified 00 No More Shortcuts: Realizing the Potential of Temporal Self-Supervision Dec 20, 2023 Action Classification Attribute
— Unverified 00 Not All Pairs are Equal: Hierarchical Learning for Average-Precision-Oriented Video Retrieval Jul 22, 2024 All Retrieval
— Unverified 00 OmniVL:One Foundation Model for Image-Language and Video-Language Tasks Sep 15, 2022 Action Classification Action Recognition
— Unverified 00 Perfect Match in Video Retrieval Mar 29, 2023 Retrieval Video Retrieval
— Unverified 00 PIDRo: Parallel Isomeric Attention with Dynamic Routing for Text-Video Retrieval Jan 1, 2023 Representation Learning Retrieval
— Unverified 00 PolySmart @ TRECVid 2024 Medical Video Question Answering Dec 20, 2024 Question Answering Retrieval
— Unverified 00 Pose-Aided Video-based Person Re-Identification via Recurrent Graph Convolutional Network Sep 23, 2022 Person Re-Identification Retrieval
— Unverified 00 Probabilistic Representations for Video Contrastive Learning Apr 8, 2022 Action Recognition Contrastive Learning
— Unverified 00