DIBS: Enhancing Dense Video Captioning with Unlabeled Videos via Pseudo Boundary Enrichment and Online Refinement Apr 3, 2024 Dense Video Captioning Diversity
— Unverified 00 PolySmart @ TRECVid 2024 Video Captioning (VTT) Dec 20, 2024 Video Captioning
— Unverified 00 Describe Anything: Detailed Localized Image and Video Captioning Apr 22, 2025 Sentence Video Captioning
— Unverified 00 End-to-End Video Captioning Apr 4, 2019 Action Recognition Caption Generation
— Unverified 00 Pre-training for Video Captioning Challenge 2020 Summary Jul 27, 2020 Video Captioning
— Unverified 00 Procedural Text Generation from an Execution Video Nov 1, 2017 Object Recognition Text Generation
— Unverified 00 Progress-Aware Video Frame Captioning Dec 3, 2024 Image Captioning Video Captioning
— Unverified 00 Dense Video Captioning using Graph-based Sentence Summarization Jun 25, 2025 Dense Video Captioning Sentence
— Unverified 00 Zero-Shot Dense Video Captioning by Jointly Optimizing Text and Moment Jul 5, 2023 Dense Video Captioning Language Modelling
— Unverified 00 Recent Advances in Video Question Answering: A Review of Datasets and Methods Jan 15, 2021 Information Retrieval Machine Translation
— Unverified 00 Recipe Generation from Unsegmented Cooking Videos Sep 21, 2022 Dense Video Captioning Recipe Generation
— Unverified 00 Reconstruct and Represent Video Contents for Captioning via Reinforcement Learning Jun 3, 2019 Decoder reinforcement-learning
— Unverified 00 VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending May 22, 2023 Question Answering Retrieval
— Unverified 00 Recurrent Memory Addressing for describing videos Nov 20, 2016 Video Captioning
— Unverified 00 Reexamining Racial Disparities in Automatic Speech Recognition Performance: The Role of Confounding by Provenance Jul 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 An Efficient Keyframes Selection Based Framework for Video Captioning Dec 1, 2021 Text Generation Video Captioning
— Unverified 00 ReGen: A good Generative Zero-Shot Video Classifier Should be Rewarded Jan 1, 2023 Action Classification Action Recognition
— Unverified 00 Reinforced Video Captioning with Entailment Rewards Aug 7, 2017 reinforcement-learning Reinforcement Learning
— Unverified 00 Relational Reasoning using Prior Knowledge for Visual Captioning Jun 4, 2019 Image Captioning object-detection
— Unverified 00 Dense Video Captioning: A Survey of Techniques, Datasets and Evaluation Protocols Nov 5, 2023 Caption Generation Dense Video Captioning
— Unverified 00 Retrieval-Augmented Egocentric Video Captioning Jan 1, 2024 Representation Learning Retrieval
— Unverified 00 RETTA: Retrieval-Enhanced Test-Time Adaptation for Zero-Shot Video Captioning May 11, 2024 Image-text matching Retrieval
— Unverified 00 Deep Reinforcement Learning for NLP Jul 1, 2018 Atari Games coreference-resolution
— Unverified 00 RUC+CMU: System Report for Dense Captioning Events in Videos Jun 22, 2018 Caption Generation Dense Captioning
— Unverified 00 SACT: Self-Aware Multi-Space Feature Composition Transformer for Multinomial Attention for Video Captioning Jun 25, 2020 Dense Video Captioning Video Captioning
— Unverified 00 SAVCHOI: Detecting Suspicious Activities using Dense Video Captioning with Human Object Interactions Jul 24, 2022 Dense Captioning Dense Video Captioning
— Unverified 00 SBAT: Video Captioning with Sparse Boundary-Aware Transformer Jul 23, 2020 Machine Translation multimodal interaction
— Unverified 00 Scalable and Accurate Self-supervised Multimodal Representation Learning without Aligned Video and Text Data Apr 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding May 20, 2021 Action Segmentation Language Modeling
— Unverified 00 An Attempt towards Interpretable Audio-Visual Video Captioning Dec 7, 2018 Audio captioning Audio-Visual Video Captioning
— Unverified 00 Semantic-Aware Pretraining for Dense Video Captioning Apr 13, 2022 Dense Captioning Dense Video Captioning
— Unverified 00 CUPID: Adaptive Curation of Pre-training Data for Video-and-Language Representation Learning Apr 1, 2021 Question Answering Representation Learning
— Unverified 00 Analyzing Zero-Shot Abilities of Vision-Language Models on Video Understanding Tasks Oct 7, 2023 Action Recognition Multiple-choice
— Unverified 00 Semi-Supervised Learning for Video Captioning Nov 1, 2020 Video Captioning
— Unverified 00 SEM-POS: Grammatically and Semantically Correct Video Captioning Mar 26, 2023 POS Video Captioning
— Unverified 00 Amortized Context Vector Inference for Sequence-to-Sequence Networks May 23, 2018 Document Summarization Variational Inference
— Unverified 00 Seq2Time: Sequential Knowledge Transfer for Video LLM Temporal Grounding Nov 25, 2024 Dense Video Captioning Transfer Learning
— Unverified 00 Set Prediction Guided by Semantic Concepts for Diverse Video Captioning Dec 25, 2023 Caption Generation Diversity
— Unverified 00 Crowd Video Captioning Nov 13, 2019 Video Captioning
— Unverified 00 CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations Sep 30, 2021 Contrastive Learning Retrieval
— Unverified 00 CREATE: A Benchmark for Chinese Short Video Retrieval and Title Generation Mar 31, 2022 Retrieval Video Captioning
— Unverified 00 Show, Tell and Summarize: Dense Video Captioning Using Visual Cue Aided Sentence Summarization Jun 25, 2025 Dense Video Captioning Descriptive
— Unverified 00 Aligning Source Visual and Target Language Domains for Unpaired Video Captioning Nov 22, 2022 Translation Video Captioning
— Unverified 00 CREATE: A Benchmark for Chinese Short Video Retrieval and Title Generation Nov 16, 2021 Retrieval Video Captioning
— Unverified 00 SMArT: Training Shallow Memory-aware Transformers for Robotic Explainability Oct 7, 2019 Text Generation Video Captioning
— Unverified 00 SnapCap: Efficient Snapshot Compressive Video Captioning Jan 10, 2024 Compressive Sensing Video Captioning
— Unverified 00 Learning Video Representations using Contrastive Bidirectional Transformer Jun 13, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Agent-based Video Trimming Dec 12, 2024 Highlight Detection Moment Retrieval
— Unverified 00 Consensus-based Sequence Training for Video Captioning Dec 27, 2017 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 00 Collaborative Three-Stream Transformers for Video Captioning Sep 18, 2023 Sentence Video Captioning
— Unverified 00