VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding Jun 13, 2024 Dense Video Captioning MVBench
Code Code Available 35 Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models Oct 4, 2024 Dense Video Captioning Sentence
Code Code Available 25 VidChapters-7M: Video Chapters at Scale Sep 25, 2023 Dense Video Captioning Navigate
Code Code Available 25 LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos Nov 29, 2024 Boundary Detection Dense Video Captioning
Code Code Available 25 Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning Feb 27, 2023 Dense Video Captioning Language Modeling
Code Code Available 25 VTimeLLM: Empower LLM to Grasp Video Moments Nov 30, 2023 Dense Video Captioning Temporal Relation Extraction
Code Code Available 25 Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval Apr 11, 2024 Decoder Dense Video Captioning
Code Code Available 25 TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning Apr 14, 2024 Dense Video Captioning Descriptive
Code Code Available 25 VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding May 22, 2024 Dense Video Captioning Highlight Detection
Code Code Available 25 SoccerNet-Caption: Dense Video Captioning for Soccer Broadcasts Commentaries Apr 10, 2023 Dense Video Captioning Video Captioning
Code Code Available 25 OmniVid: A Generative Framework for Universal Video Understanding Mar 26, 2024 Action Recognition Decoder
Code Code Available 25 SODA: Story Oriented Dense Video Captioning Evaluation Framework Aug 1, 2020 Dense Video Captioning Video Captioning
Code Code Available 15 Unifying Event Detection and Captioning as Sequence Generation via Pre-Training Jul 18, 2022 Dense Video Captioning Event Detection
Code Code Available 15 VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format Nov 27, 2024 Dense Video Captioning Grounded Video Question Answering
Code Code Available 15 TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks Nov 23, 2020 Action Classification Action Localization
Code Code Available 15 SoccerNet 2023 Challenges Results Sep 12, 2023 Action Spotting Camera Calibration
Code Code Available 15 COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language Benchmark Aug 5, 2024 Dense Video Captioning Diversity
Code Code Available 15 Multimodal Pretraining for Dense Video Captioning Nov 10, 2020 Dense Video Captioning Video Captioning
Code Code Available 15 End-to-End Dense Video Captioning with Parallel Decoding Aug 17, 2021 Caption Generation Dense Video Captioning
Code Code Available 15 Multi-modal Dense Video Captioning Mar 17, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos Mar 11, 2023 Dense Video Captioning Natural Language Moment Retrieval
Code Code Available 15 A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer May 17, 2020 Dense Video Captioning Temporal Action Proposal Generation
Code Code Available 15 HiCM^2: Hierarchical Compact Memory Modeling for Dense Video Captioning Dec 19, 2024 Dense Video Captioning Video Captioning
Code Code Available 15 Dense-Captioning Events in Videos: SYSU Submission to ActivityNet Challenge 2020 Jun 21, 2020 Dense Captioning Dense Video Captioning
Code Code Available 15 VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning Jan 12, 2025 Dense Video Captioning Video Captioning
Code Code Available 15 Enhancing Traffic Safety with Parallel Dense Video Captioning for End-to-End Event Analysis Apr 12, 2024 Dense Video Captioning Transfer Learning
Code Code Available 15 Towards Automatic Learning of Procedures from Web Instructional Videos Mar 28, 2017 Dense Video Captioning Procedure Learning
Code Code Available 05 Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning Mar 31, 2018 Decoder Dense Video Captioning
Code Code Available 05 Dense Video Captioning Using Unsupervised Semantic Information Dec 15, 2021 Dense Video Captioning Video Captioning
Code Code Available 05 End-to-End Dense Video Captioning with Masked Transformer Apr 3, 2018 Decoder Dense Video Captioning
Code Code Available 05 Global Object Proposals for Improving Multi-Sentence Video Descriptions Jul 18, 2021 Caption Generation Dense Video Captioning
Code Code Available 05 Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video Captioning Dec 17, 2024 Dense Video Captioning Descriptive
Code Code Available 05 Joint Event Detection and Description in Continuous Video Streams Feb 28, 2018 Dense Captioning Dense Video Captioning
Code Code Available 05 Live Video Captioning Jun 20, 2024 Dense Video Captioning Live Video Captioning
Code Code Available 05 Event and Entity Extraction from Generated Video Captions Nov 5, 2022 Caption Generation Dense Video Captioning
Code Code Available 05 Sketch, Ground, and Refine: Top-Down Dense Video Captioning Jun 19, 2021 Dense Video Captioning Sentence
Code Code Available 05 SoccerNet 2024 Challenges Results Sep 16, 2024 Action Spotting Dense Video Captioning
Code Code Available 05 Streaming Dense Video Captioning Apr 1, 2024 Dense Video Captioning Live Video Captioning
Code Code Available 05 Streamlined Dense Video Captioning Apr 8, 2019 Dense Video Captioning Reinforcement Learning
Code Code Available 05 Visual Transformation Telling May 3, 2023 Dense Video Captioning Video Captioning
Code Code Available 05 SACT: Self-Aware Multi-Space Feature Composition Transformer for Multinomial Attention for Video Captioning Jun 25, 2020 Dense Video Captioning Video Captioning
— Unverified 00 SAVCHOI: Detecting Suspicious Activities using Dense Video Captioning with Human Object Interactions Jul 24, 2022 Dense Captioning Dense Video Captioning
— Unverified 00 Semantic-Aware Pretraining for Dense Video Captioning Apr 13, 2022 Dense Captioning Dense Video Captioning
— Unverified 00 A Closer Look at Temporal Ordering in the Segmentation of Instructional Videos Sep 30, 2022 Dense Video Captioning Segmentation
— Unverified 00 Seq2Time: Sequential Knowledge Transfer for Video LLM Temporal Grounding Nov 25, 2024 Dense Video Captioning Transfer Learning
— Unverified 00 Show, Tell and Summarize: Dense Video Captioning Using Visual Cue Aided Sentence Summarization Jun 25, 2025 Dense Video Captioning Descriptive
— Unverified 00 Watch and Learn: Leveraging Expert Knowledge and Language for Surgical Video Understanding Mar 14, 2025 Denoising Dense Video Captioning
— Unverified 00 Exploring Temporal Event Cues for Dense Video Captioning in Cyclic Co-learning Dec 16, 2024 Contrastive Learning Dense Video Captioning
— Unverified 00 Weakly Supervised Dense Video Captioning Apr 5, 2017 Dense Video Captioning Language Modeling
— Unverified 00 Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities Using Web Instructional Videos Nov 28, 2023 Dense Video Captioning Transfer Learning
— Unverified 00