Generative Semantic Communication: Architectures, Technologies, and Applications Dec 11, 2024 Retrieval Semantic Communication
— Unverified 0Bridging Information Asymmetry in Text-video Retrieval: A Data-centric Approach Aug 14, 2024 Cross-Modal Retrieval Language Modeling
— Unverified 0Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning Mar 30, 2021 counterfactual Object
— Unverified 0Grounding Physical Object and Event Concepts Through Dynamic Visual Reasoning Jan 1, 2021 counterfactual Object
— Unverified 0HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training Dec 30, 2022 cross-modal alignment TGIF-Action
— Unverified 0HiVLP: Hierarchical Interactive Video-Language Pre-Training Jan 1, 2023 Retrieval Self-Supervised Learning
— Unverified 0HORUS: Multimodal Large Language Models Framework for Video Retrieval at VBS 2025 Jan 1, 2025 Image Retrieval Retrieval
— Unverified 0Human Action Recognition and Prediction: A Survey Jun 28, 2018 Action Recognition Autonomous Driving
— Unverified 0Tencent Text-Video Retrieval: Hierarchical Cross-Modal Interactions with Multi-Level Representations Apr 7, 2022 Contrastive Learning Denoising
— Unverified 0Improving Video Retrieval by Adaptive Margin Mar 9, 2023 Retrieval Video Retrieval
— Unverified 0MuMUR : Multilingual Multimodal Universal Retrieval Aug 24, 2022 Image Retrieval Machine Translation
— Unverified 0Induce, Edit, Retrieve:Language Grounded Multimodal Schema for Instructional Video Retrieval Nov 17, 2021 Retrieval Video Retrieval
— Unverified 0Interactive Video Retrieval with Dialog May 7, 2019 Retrieval Video Retrieval
— Unverified 0Key Frame Extraction with Attention Based Deep Neural Networks Jun 21, 2023 Video Retrieval Video Summarization
— Unverified 0KPCA Spatio-temporal trajectory point cloud classifier for recognizing human actions in a CBVR system Mar 26, 2014 Action Recognition Retrieval
— Unverified 0Large-Scale Query-by-Image Video Retrieval Using Bloom Filters Jul 12, 2016 Retrieval Video Retrieval
— Unverified 0Large Scale Video Representation Learning via Relational Graph Clustering Jun 1, 2020 Clustering Graph Clustering
— Unverified 0Vision-Language Models Learn Super Images for Efficient Partially Relevant Video Retrieval Dec 1, 2023 Image Retrieval Partially Relevant Video Retrieval
— Unverified 0LASER: A Neuro-Symbolic Framework for Learning Spatial-Temporal Scene Graphs with Weak Supervision Apr 15, 2023 Language Modeling Language Modelling
— Unverified 0LaT: Latent Translation with Cycle-Consistency for Video-Text Retrieval Jul 11, 2022 Representation Learning Retrieval
— Unverified 0Learning and Recognizing Human Action from Skeleton Movement with Deep Residual Neural Networks Mar 21, 2018 Action Recognition Deep Learning
— Unverified 0Learning Audio-Video Modalities from Image Captions Apr 1, 2022 Image Captioning Retrieval
— Unverified 0Learning Joint Representations of Videos and Sentences with Web Image Search Aug 8, 2016 Image Retrieval Natural Language Queries
— Unverified 0Learning Language-Visual Embedding for Movie Understanding with Natural-Language Sep 26, 2016 Multiple-choice Retrieval
— Unverified 0Learning Locally-Adaptive Decision Functions for Person Verification Jun 1, 2013 Face Verification Metric Learning
— Unverified 0Learning Segment Similarity and Alignment in Large-Scale Content Based Video Retrieval Sep 20, 2023 Retrieval Video Retrieval
— Unverified 0Learning text-to-video retrieval from image captioning Apr 26, 2024 Image Captioning Image Retrieval
— Unverified 0Learning to Generate Long-term Future Narrations Describing Activities of Daily Living Mar 3, 2025 Action Anticipation Decision Making
— Unverified 0Learning Trajectory-Word Alignments for Video-Language Tasks Jan 5, 2023 Question Answering Retrieval
— Unverified 0Learning World Models for Interactive Video Generation May 28, 2025 In-Context Learning Retrieval
— Unverified 0Leveraging Auxiliary Information in Text-to-Video Retrieval: A Review May 29, 2025 Retrieval Text to Video Retrieval
— Unverified 0Leveraging Generative Language Models for Weakly Supervised Sentence Component Analysis in Video-Language Joint Learning Dec 10, 2023 Language Modeling Language Modelling
— Unverified 0Leveraging Modality Tags for Enhanced Cross-Modal Video Retrieval Apr 2, 2025 cross-modal alignment Retrieval
— Unverified 0LiteVL: Efficient Video-Language Learning with Enhanced Spatial-Temporal Modeling Oct 21, 2022 Language Modeling Language Modelling
— Unverified 0Live Laparoscopic Video Retrieval with Compressed Uncertainty Mar 8, 2022 Retrieval Video Retrieval
— Unverified 0LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning Mar 4, 2025 Contrastive Learning Image-text Retrieval
— Unverified 0Long-VMNet: Accelerating Long-Form Video Understanding via Fixed Memory Mar 17, 2025 Form GPU
— Unverified 0Lost Your Style? Navigating with Semantic-Level Approach for Text-to-Outfit Retrieval Nov 3, 2023 Recommendation Systems Retrieval
— Unverified 0MAGMaR Shared Task System Description: Video Retrieval with OmniEmbed Jun 11, 2025 Retrieval Video Retrieval
— Unverified 0MarineVRS: Marine Video Retrieval System with Explainability via Semantic Understanding Jun 7, 2023 Retrieval Sentence
— Unverified 0Masked Contrastive Pre-Training for Efficient Video-Text Retrieval Dec 2, 2022 Image-text Retrieval Retrieval
— Unverified 0Masking Modalities for Cross-modal Video Retrieval Nov 1, 2021 Retrieval Video Retrieval
— Unverified 0Mask to reconstruct: Cooperative Semantics Completion for Video-text Retrieval May 13, 2023 Retrieval Text Retrieval
— Unverified 0MDMMT-2: Multidomain Multimodal Transformer for Video Retrieval, One More Step Towards Generalization Mar 14, 2022 Retrieval Text to Video Retrieval
— Unverified 0MERLIN: Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank Pipeline Jul 17, 2024 Question Answering Retrieval
— Unverified 0Modality-Balanced Embedding for Video Retrieval Apr 18, 2022 Retrieval Text Matching
— Unverified 0Motion Sensitive Contrastive Learning for Self-supervised Video Representation Aug 12, 2022 Contrastive Learning Representation Learning
— Unverified 0MuLTI: Efficient Video-and-Language Understanding with Text-Guided MultiWay-Sampler and Multiple Choice Modeling Mar 10, 2023 Multi-Label Classification MUlTI-LABEL-ClASSIFICATION
— Unverified 0Multi-Granularity and Multi-modal Feature Interaction Approach for Text Video Retrieval Jun 21, 2024 Retrieval Sentence
— Unverified 0Multi-Granularity Graph Pooling for Video-based Person Re-Identification Sep 23, 2022 Node Clustering Person Re-Identification
— Unverified 0