PV-VTT: A Privacy-Centric Dataset for Mission-Specific Anomaly Detection and Natural Language Interpretation Oct 30, 2024 Anomaly Detection Descriptive
— Unverified 00 Relational Graph Learning for Grounded Video Description Generation Dec 2, 2021 Graph Learning Hallucination
— Unverified 00 Saarland: Vector-based models of semantic textual similarity Jul 1, 2012 Semantic Textual Similarity Video Description
— Unverified 00 Semantic Neighborhoods as Hypergraphs Aug 1, 2013 Machine Translation Paraphrase Generation
— Unverified 00 SHEF-Multimodal: Grounding Machine Translation on Images Aug 1, 2016 Machine Translation Multimodal Machine Translation
— Unverified 00 SRIUBC: Simple Similarity Features for Semantic Textual Similarity Jul 1, 2012 Natural Language Inference Paraphrase Identification
— Unverified 00 Synchronized Audio-Visual Frames with Fractional Positional Encoding for Transformers in Video-to-Text Translation Dec 28, 2021 Image Captioning Machine Translation
— Unverified 00 Task-Driven Dynamic Fusion: Reducing Ambiguity in Video Description Jul 1, 2017 Video Captioning Video Description
— Unverified 00 Technical Report: Competition Solution For Modelscope-Sora Sep 24, 2024 Text-to-Video Generation Video Description
— Unverified 00 The Role of the Input in Natural Language Video Description Feb 9, 2021 Data Augmentation Video Description
— Unverified 00 Towards Zero-Shot & Explainable Video Description by Reasoning over Graphs of Events in Space and Time Jan 14, 2025 Object Recognition Text Generation
— Unverified 00 Unbox the Blackbox: Predict and Interpret YouTube Viewership Using Deep Learning Dec 21, 2020 Misinformation Prediction
— Unverified 00 Vectors of Locally Aggregated Centers for Compact Video Representation Sep 13, 2015 Clustering Video Description
— Unverified 00 VideoA11y: Method and Dataset for Accessible Video Description Feb 27, 2025 Video Description
— Unverified 00 VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models Oct 1, 2024 Hallucination text similarity
— Unverified 00 Video Description: A Survey of Methods, Datasets and Evaluation Metrics Jun 1, 2018 Diversity Language Modeling
— Unverified 00 VideoMCC: a New Benchmark for Video Comprehension Jun 23, 2016 Multiple-choice Video Description
— Unverified 00 Visual-aware Attention Dual-stream Decoder for Video Captioning Oct 16, 2021 Decoder Video Captioning
— Unverified 00 A Comprehensive Review on Recent Methods and Challenges of Video Description Nov 30, 2020 Machine Translation Survey
— Unverified 00 X-VARS: Introducing Explainability in Football Refereeing with Multi-Modal Large Language Model Apr 7, 2024 Action Recognition Decision Making
— Unverified 00 ActionHub: A Large-scale Action Video Description Dataset for Zero-shot Action Recognition Jan 22, 2024 Action Recognition Video Description
— Unverified 00 Active Learning for Video Description With Cluster-Regularized Ensemble Ranking Jul 27, 2020 Active Learning Video Captioning
— Unverified 00 A Dataset for Telling the Stories of Social Media Videos Oct 1, 2018 Sentence Video Captioning
— Unverified 00 A Labelled Dataset for Sentiment Analysis of Videos on YouTube, TikTok, and Other Sources about the 2024 Outbreak of Measles Jun 11, 2024 Sentiment Analysis Subjectivity Analysis
— Unverified 00 A Multi-scale Multiple Instance Video Description Network May 21, 2015 Image Segmentation Multiple Instance Learning
— Unverified 00 Analyzing Political Figures in Real-Time: Leveraging YouTube Metadata for Sentiment Analysis Sep 28, 2023 Sentiment Analysis Video Description
— Unverified 00 An Efficient Keyframes Selection Based Framework for Video Captioning Dec 1, 2021 Text Generation Video Captioning
— Unverified 00 End-to-End Video Captioning Apr 4, 2019 Action Recognition Caption Generation
— Unverified 00 A Thousand Frames in Just a Few Words: Lingual Description of Videos through Latent Topics and Sparse Object Stitching Jun 1, 2013 Image Description Video Description
— Unverified 00 Attend and Interact: Higher-Order Object Interactions for Video Understanding Nov 16, 2017 Action Classification Action Recognition
— Unverified 00 Attention Based Encoder Decoder Model for Video Captioning in Nepali (2023) Dec 12, 2023 Decoder Video Captioning
— Unverified 00 Attention-Based Multimodal Fusion for Video Description Jan 11, 2017 Decoder Sentence
— Unverified 00 Attentive Sequence to Sequence Translation for Localizing Clips of Interest by Natural Language Descriptions Aug 27, 2018 Translation Video Description
— Unverified 00 AVD2: Accident Video Diffusion for Accident Video Description Feb 20, 2025 Autonomous Driving Scene Understanding
— Unverified 00 Better Exploiting Motion for Better Action Recognition Jun 1, 2013 Action Recognition Image Retrieval
— Unverified 00 Bidirectional Long-Short Term Memory for Video Description Jun 15, 2016 Language Modeling Language Modelling
— Unverified 00 Boosting Video Captioning with Dynamic Loss Network Jul 25, 2021 image-classification Image Classification
— Unverified 00 Bridge Video and Text with Cascade Syntactic Structure Aug 1, 2018 Attribute Object
— Unverified 00 FIOVA: A Multi-Annotator Benchmark for Human-Aligned Video Captioning Oct 20, 2024 Diagnostic Video Captioning
— Unverified 00 Prediction and Description of Near-Future Activities in Video Aug 2, 2019 Prediction Video Captioning
— Unverified 00 CLearViD: Curriculum Learning for Video Description Nov 8, 2023 Diversity Video Description
— Unverified 00 Coherent Multi-Sentence Video Description with Variable Level of Detail Mar 24, 2014 Sentence Video Description
— Unverified 00 Cross-Modal Learning for Music-to-Music-Video Description Generation Mar 14, 2025 Video Description Video Generation
— Unverified 00 DANTE-AD: Dual-Vision Attention Network for Long-Term Audio Description Mar 31, 2025 Video Description Video Understanding
— Unverified 00 Efficient data-driven encoding of scene motion using Eccentricity Mar 3, 2021 Activity Recognition Intent Recognition
— Unverified 00 Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis Feb 11, 2025 Action Recognition Video Description
— Unverified 00 Generating Video Description using Sequence-to-sequence Model with Temporal Attention Dec 1, 2016 Caption Generation Sentence
— Unverified 00 HENRY-CORE: Domain Adaptation and Stacking for Text Similarity Jun 1, 2013 Domain Adaptation Machine Translation
— Unverified 00 Hierarchical Boundary-Aware Neural Encoder for Video Captioning Nov 28, 2016 Decoder Video Captioning
— Unverified 00 HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation Mar 31, 2025 Hallucination Human-Object Interaction Detection
— Unverified 00