VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding Jun 13, 2024 Dense Video Captioning MVBench
Code Code Available 3LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos Nov 29, 2024 Boundary Detection Dense Video Captioning
Code Code Available 2Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models Oct 4, 2024 Dense Video Captioning Sentence
Code Code Available 2VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding May 22, 2024 Dense Video Captioning Highlight Detection
Code Code Available 2TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning Apr 14, 2024 Dense Video Captioning Descriptive
Code Code Available 2Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval Apr 11, 2024 Decoder Dense Video Captioning
Code Code Available 2OmniVid: A Generative Framework for Universal Video Understanding Mar 26, 2024 Action Recognition Decoder
Code Code Available 2VTimeLLM: Empower LLM to Grasp Video Moments Nov 30, 2023 Dense Video Captioning Temporal Relation Extraction
Code Code Available 2VidChapters-7M: Video Chapters at Scale Sep 25, 2023 Dense Video Captioning Navigate
Code Code Available 2SoccerNet-Caption: Dense Video Captioning for Soccer Broadcasts Commentaries Apr 10, 2023 Dense Video Captioning Video Captioning
Code Code Available 2Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning Feb 27, 2023 Dense Video Captioning Language Modeling
Code Code Available 2VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning Jan 12, 2025 Dense Video Captioning Video Captioning
Code Code Available 1HiCM^2: Hierarchical Compact Memory Modeling for Dense Video Captioning Dec 19, 2024 Dense Video Captioning Video Captioning
Code Code Available 1VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format Nov 27, 2024 Dense Video Captioning Grounded Video Question Answering
Code Code Available 1COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language Benchmark Aug 5, 2024 Dense Video Captioning Diversity
Code Code Available 1Enhancing Traffic Safety with Parallel Dense Video Captioning for End-to-End Event Analysis Apr 12, 2024 Dense Video Captioning Transfer Learning
Code Code Available 1SoccerNet 2023 Challenges Results Sep 12, 2023 Action Spotting Camera Calibration
Code Code Available 1Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos Mar 11, 2023 Dense Video Captioning Natural Language Moment Retrieval
Code Code Available 1Unifying Event Detection and Captioning as Sequence Generation via Pre-Training Jul 18, 2022 Dense Video Captioning Event Detection
Code Code Available 1End-to-End Dense Video Captioning with Parallel Decoding Aug 17, 2021 Caption Generation Dense Video Captioning
Code Code Available 1TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks Nov 23, 2020 Action Classification Action Localization
Code Code Available 1Multimodal Pretraining for Dense Video Captioning Nov 10, 2020 Dense Video Captioning Video Captioning
Code Code Available 1SODA: Story Oriented Dense Video Captioning Evaluation Framework Aug 1, 2020 Dense Video Captioning Video Captioning
Code Code Available 1Dense-Captioning Events in Videos: SYSU Submission to ActivityNet Challenge 2020 Jun 21, 2020 Dense Captioning Dense Video Captioning
Code Code Available 1A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer May 17, 2020 Dense Video Captioning Temporal Action Proposal Generation
Code Code Available 1Multi-modal Dense Video Captioning Mar 17, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Dense Video Captioning using Graph-based Sentence Summarization Jun 25, 2025 Dense Video Captioning Sentence
— Unverified 0Show, Tell and Summarize: Dense Video Captioning Using Visual Cue Aided Sentence Summarization Jun 25, 2025 Dense Video Captioning Descriptive
— Unverified 0TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation Apr 24, 2025 Caption Generation Dense Video Captioning
— Unverified 0Watch and Learn: Leveraging Expert Knowledge and Language for Surgical Video Understanding Mar 14, 2025 Denoising Dense Video Captioning
— Unverified 0Event-Equalized Dense Video Captioning Jan 1, 2025 Dense Video Captioning Video Captioning
— Unverified 0Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video Captioning Dec 17, 2024 Dense Video Captioning Descriptive
Code Code Available 0Exploring Temporal Event Cues for Dense Video Captioning in Cyclic Co-learning Dec 16, 2024 Contrastive Learning Dense Video Captioning
— Unverified 0Video LLMs for Temporal Reasoning in Long Videos Dec 4, 2024 Action Segmentation Dense Video Captioning
— Unverified 0Seq2Time: Sequential Knowledge Transfer for Video LLM Temporal Grounding Nov 25, 2024 Dense Video Captioning Transfer Learning
— Unverified 0Whats in a Video: Factorized Autoregressive Decoding for Online Dense Video Captioning Nov 22, 2024 Dense Video Captioning Video Captioning
— Unverified 0Technical Report for Soccernet 2023 -- Dense Video Captioning Oct 31, 2024 Dense Video Captioning Video Captioning
— Unverified 0SoccerNet 2024 Challenges Results Sep 16, 2024 Action Spotting Dense Video Captioning
Code Code Available 0Live Video Captioning Jun 20, 2024 Dense Video Captioning Live Video Captioning
Code Code Available 0The 8th AI City Challenge Apr 15, 2024 Dense Video Captioning Video Captioning
— Unverified 0DIBS: Enhancing Dense Video Captioning with Unlabeled Videos via Pseudo Boundary Enrichment and Online Refinement Apr 3, 2024 Dense Video Captioning Diversity
— Unverified 0Streaming Dense Video Captioning Apr 1, 2024 Dense Video Captioning Live Video Captioning
Code Code Available 0Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding Mar 24, 2024 Dense Video Captioning Temporal Localization
— Unverified 0Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities Using Web Instructional Videos Nov 28, 2023 Dense Video Captioning Transfer Learning
— Unverified 0Dense Video Captioning: A Survey of Techniques, Datasets and Evaluation Protocols Nov 5, 2023 Caption Generation Dense Video Captioning
— Unverified 0Towards Surveillance Video-and-Language Understanding: New Dataset, Baselines, and Challenges Sep 25, 2023 Anomaly Detection Dense Video Captioning
— Unverified 0Zero-Shot Dense Video Captioning by Jointly Optimizing Text and Moment Jul 5, 2023 Dense Video Captioning Language Modelling
— Unverified 0Visual Transformation Telling May 3, 2023 Dense Video Captioning Video Captioning
Code Code Available 0A Review of Deep Learning for Video Captioning Apr 22, 2023 Deep Learning Dense Video Captioning
— Unverified 0Exploiting Auxiliary Caption for Video Grounding Jan 15, 2023 Contrastive Learning Dense Video Captioning
— Unverified 0