| Lighthouse: A User-Friendly Library for Reproducible Video Moment Retrieval and Highlight Detection | Aug 6, 2024 | audio moment retrievalHighlight Detection | CodeCode Available | 3 | 5 |
| VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding | May 22, 2024 | Dense Video CaptioningHighlight Detection | CodeCode Available | 2 | 5 |
| Correlation-Guided Query-Dependency Calibration for Video Temporal Grounding | Nov 15, 2023 | Highlight DetectionMoment Retrieval | CodeCode Available | 2 | 5 |
| UniVTG: Towards Unified Video-Language Temporal Grounding | Jul 31, 2023 | Highlight DetectionMoment Retrieval | CodeCode Available | 2 | 5 |
| UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection | Mar 23, 2022 | DecoderHighlight Detection | CodeCode Available | 2 | 5 |
| TR-DETR: Task-Reciprocal Transformer for Joint Moment Retrieval and Highlight Detection | Jan 4, 2024 | Highlight DetectionMoment Retrieval | CodeCode Available | 2 | 5 |
| TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning | Oct 25, 2024 | EgoSchemaHallucination | CodeCode Available | 2 | 5 |
| TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding | Dec 4, 2023 | Dense CaptioningHighlight Detection | CodeCode Available | 2 | 5 |
| Query-Dependent Video Representation for Moment Retrieval and Highlight Detection | Mar 24, 2023 | Highlight DetectionMoment Retrieval | CodeCode Available | 2 | 5 |
| Number it: Temporal Grounding Videos like Flipping Manga | Nov 15, 2024 | Highlight DetectionMoment Retrieval | CodeCode Available | 2 | 5 |
| Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval | Jul 21, 2024 | General KnowledgeHighlight Detection | CodeCode Available | 2 | 5 |
| Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection | Jan 5, 2025 | Contrastive LearningHighlight Detection | CodeCode Available | 1 | 5 |
| Adaptive Video Highlight Detection by Learning from User History | Jul 19, 2020 | Highlight Detection | CodeCode Available | 1 | 5 |
| Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detection | Nov 28, 2023 | Contrastive LearningHighlight Detection | CodeCode Available | 1 | 5 |
| Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies | Mar 26, 2023 | Highlight DetectionLearning with noisy labels | CodeCode Available | 1 | 5 |
| Cross-category Video Highlight Detection via Set-based Learning | Aug 26, 2021 | Domain AdaptationHighlight Detection | CodeCode Available | 1 | 5 |
| FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding | Dec 18, 2024 | Highlight DetectionMoment Retrieval | CodeCode Available | 1 | 5 |
| Joint Moment Retrieval and Highlight Detection Via Natural Language Queries | May 8, 2023 | DecoderHighlight Detection | CodeCode Available | 1 | 5 |
| LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection | Jan 18, 2025 | Contrastive LearningDecoder | CodeCode Available | 1 | 5 |
| M2-Net: Multi-stages Specular Highlight Detection and Removal in Multi-scenes | Jul 20, 2022 | Highlight Detectionhighlight removal | CodeCode Available | 1 | 5 |
| MH-DETR: Video Moment and Highlight Detection with Cross-modal Transformer | Apr 29, 2023 | DecoderHighlight Detection | CodeCode Available | 1 | 5 |
| PHD-GIFs: Personalized Highlight Detection for Automatic GIF Creation | Apr 18, 2018 | Highlight Detection | CodeCode Available | 1 | 5 |
| QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries | Jul 20, 2021 | Highlight DetectionMoment Retrieval | CodeCode Available | 1 | 5 |
| Saliency-Guided DETR for Moment Retrieval and Highlight Detection | Oct 2, 2024 | Highlight DetectionMoment Retrieval | CodeCode Available | 1 | 5 |
| Single-Image Specular Highlight Removal via Real-World Dataset Construction | Aug 27, 2021 | Generative Adversarial NetworkHighlight Detection | CodeCode Available | 1 | 5 |
| SpecSeg Network for Specular Highlight Detection and Segmentation in Real-World Images | Aug 30, 2022 | Highlight DetectionSpecular Segmentation | CodeCode Available | 1 | 5 |
| Task-Driven Exploration: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection | Apr 14, 2024 | Highlight DetectionMoment Retrieval | CodeCode Available | 1 | 5 |
| TEMPURA: Temporal Event Masked Prediction and Understanding for Reasoning in Action | May 2, 2025 | Dense CaptioningHighlight Detection | CodeCode Available | 1 | 5 |
| Text-Aware Single Image Specular Highlight Removal | Aug 16, 2021 | Highlight Detectionhighlight removal | CodeCode Available | 1 | 5 |
| VideoLights: Feature Refinement and Cross-Task Alignment Transformer for Joint Video Highlight Detection and Moment Retrieval | Dec 2, 2024 | Highlight DetectionMoment Retrieval | CodeCode Available | 1 | 5 |
| VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format | Nov 27, 2024 | Dense Video CaptioningGrounded Video Question Answering | CodeCode Available | 1 | 5 |
| Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark | Dec 12, 2024 | Highlight DetectionVideo Summarization | CodeCode Available | 1 | 5 |
| SoccerDB: A Large-Scale Database for Comprehensive Video Understanding | Dec 10, 2019 | Action ClassificationAction Detection | CodeCode Available | 0 | 5 |
| R^2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding | Apr 2, 2024 | Highlight DetectionMoment Retrieval | CodeCode Available | 0 | 5 |
| Single Image Highlight Removal with a Sparse and Low-Rank Reflection Model | Sep 1, 2018 | Highlight Detectionhighlight removal | CodeCode Available | 0 | 5 |
| R^2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding | Mar 31, 2024 | Highlight DetectionMoment Retrieval | CodeCode Available | 0 | 5 |
| AENet: Learning Deep Audio Features for Video Analysis | Jan 3, 2017 | Action RecognitionData Augmentation | CodeCode Available | 0 | 5 |
| Joint network for specular highlight detection and adversarial generation of specular-free images trained with polarimetric data | Nov 28, 2023 | Decision MakingGenerative Adversarial Network | CodeCode Available | 0 | 5 |
| Rhapsody: A Dataset for Highlight Detection in Podcasts | May 26, 2025 | Binary ClassificationHighlight Detection | CodeCode Available | 0 | 5 |
| Video Highlights Detection and Summarization with Lag-Calibration based on Concept-Emotion Mapping of Crowd-sourced Time-Sync Comments | Aug 7, 2017 | Highlight Detection | CodeCode Available | 0 | 5 |
| Unleash the Potential of CLIP for Video Highlight Detection | Apr 2, 2024 | Highlight Detection | CodeCode Available | 0 | 5 |
| PR-Net: Preference Reasoning for Personalized Video Highlight Detection | Sep 4, 2021 | Highlight DetectionSemantic Similarity | —Unverified | 0 | 0 |
| Probing Visual-Audio Representation for Video Highlight Detection via Hard-Pairs Guided Contrastive Learning | Jun 21, 2022 | Contrastive LearningHighlight Detection | —Unverified | 0 | 0 |
| AntPivot: Livestream Highlight Detection via Hierarchical Attention Mechanism | Jun 10, 2022 | Highlight Detection | —Unverified | 0 | 0 |
| Unsupervised Transcript-assisted Video Summarization and Highlight Detection | May 29, 2025 | Highlight DetectionReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Unsupervised Video Highlight Detection by Learning from Audio and Visual Recurrence | Jul 18, 2024 | Highlight Detection | —Unverified | 0 | 0 |
| Show Me What I Like: Detecting User-Specific Video Highlights Using Content-Based Multi-Head Attention | Jul 18, 2022 | Highlight Detection | —Unverified | 0 | 0 |
| Video Highlights Detection and Summarization with Lag-Calibration based on Concept-Emotion Mapping of Crowdsourced Time-Sync Comments | Sep 1, 2017 | Highlight Detection | —Unverified | 0 | 0 |
| Smart Director: An Event-Driven Directing System for Live Broadcasting | Jan 11, 2022 | Event DetectionHighlight Detection | —Unverified | 0 | 0 |
| 0/1 Deep Neural Networks via Block Coordinate Descent | Jun 19, 2022 | 10-shot image generation | —Unverified | 0 | 0 |