| Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA | May 13, 2020 | Image CaptioningMulti-Label Classification | CodeCode Available | 1 |
| Finding Moments in Video Collections Using Natural Language | Jul 30, 2019 | Moment RetrievalRe-Ranking | CodeCode Available | 1 |
| Unsupervised classification to improve the quality of a bird song recording dataset | Feb 15, 2023 | Sound ClassificationTemporal Localization | CodeCode Available | 1 |
| Enriching Local and Global Contexts for Temporal Action Localization | Jul 27, 2021 | Action ClassificationAction Localization | CodeCode Available | 1 |
| OpenTAL: Towards Open Set Temporal Action Localization | Mar 10, 2022 | Action ClassificationAction Localization | CodeCode Available | 1 |
| Self-Chained Image-Language Model for Video Localization and Question Answering | May 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Audio-Visual Event Localization in Unconstrained Videos | Mar 23, 2018 | audio-visual event localizationTemporal Localization | CodeCode Available | 1 |
| Multi-Task Learning of Object State Changes from Uncurated Videos | Nov 24, 2022 | Multi-Task LearningObject | CodeCode Available | 1 |
| MAC: Mining Activity Concepts for Language-based Temporal Localization | Nov 21, 2018 | Language-Based Temporal LocalizationTemporal Localization | CodeCode Available | 1 |
| DisTime: Distribution-based Time Representation for Video Large Language Models | May 30, 2025 | Temporal LocalizationVideo Understanding | CodeCode Available | 1 |
| End-to-End Semi-Supervised Learning for Video Action Detection | Mar 8, 2022 | Action DetectionClassification Consistency | CodeCode Available | 1 |
| Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time | Jul 1, 2024 | AUDIO-VISUAL QUESTION ANSWERING (MUSIC-AVQA-v2.0)Fact Checking | CodeCode Available | 1 |
| Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding in Videos | Jan 25, 2022 | Natural Language QueriesSentence | CodeCode Available | 1 |
| Stargazer: A transformer-based driver action detection system for intelligent transportation | Jun 1, 2022 | Action DetectionAction Recognition | CodeCode Available | 1 |
| TALL: Temporal Activity Localization via Language Query | May 5, 2017 | Natural Language Queriesregression | CodeCode Available | 1 |
| Few-Shot Temporal Action Localization with Query Adaptive Transformer | Oct 20, 2021 | Action LocalizationAction Segmentation | CodeCode Available | 1 |
| CityFlow-NL: Tracking and Retrieval of Vehicles at City Scale by Natural Language Descriptions | Jan 12, 2021 | Multi-Object TrackingObject Tracking | CodeCode Available | 1 |
| Unsupervised Pre-training for Temporal Action Localization Tasks | Mar 25, 2022 | Action LocalizationContrastive Learning | CodeCode Available | 1 |
| TimeLoc: A Unified End-to-End Framework for Precise Timestamp Localization in Long Videos | Mar 9, 2025 | Action LocalizationBoundary Detection | CodeCode Available | 1 |
| Detection of Sleep Apnea-Hypopnea Events Using Millimeter-wave Radar and Pulse Oximeter | Sep 28, 2024 | Temporal Localization | —Unverified | 0 |
| Described Spatial-Temporal Video Detection | Jul 8, 2024 | Multi-class ClassificationTemporal Localization | —Unverified | 0 |
| Density-Guided Label Smoothing for Temporal Localization of Driving Actions | Mar 11, 2024 | Action LocalizationAction Recognition | —Unverified | 0 |
| Action Shuffling for Weakly Supervised Temporal Localization | May 10, 2021 | Action LocalizationTemporal Localization | —Unverified | 0 |
| Learning to track for spatio-temporal action localization | Jun 5, 2015 | Action LocalizationSpatio-Temporal Action Localization | —Unverified | 0 |
| Deep-Learning-Assisted Analysis of Cataract Surgery Videos | Dec 10, 2023 | Decision MakingDeep Learning | —Unverified | 0 |