| Finding Moments in Video Collections Using Natural Language | Jul 30, 2019 | Moment RetrievalRe-Ranking | CodeCode Available | 1 |
| MAC: Mining Activity Concepts for Language-based Temporal Localization | Nov 21, 2018 | Language-Based Temporal LocalizationTemporal Localization | CodeCode Available | 1 |
| Audio-Visual Event Localization in Unconstrained Videos | Mar 23, 2018 | audio-visual event localizationTemporal Localization | CodeCode Available | 1 |
| TALL: Temporal Activity Localization via Language Query | May 5, 2017 | Natural Language Queriesregression | CodeCode Available | 1 |
| Fine-Tuning Large Audio-Language Models with LoRA for Precise Temporal Localization of Prolonged Exposure Therapy Elements | Jun 11, 2025 | Temporal Localization | —Unverified | 0 |
| Transforming faces into video stories -- VideoFace2.0 | May 4, 2025 | Face DetectionFace Recognition | CodeCode Available | 0 |
| TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation | Apr 24, 2025 | Caption GenerationDense Video Captioning | —Unverified | 0 |
| Hierarchical and Multimodal Data for Daily Activity Understanding | Apr 24, 2025 | Action Anticipationcounterfactual | CodeCode Available | 0 |
| A Large-Language Model Framework for Relative Timeline Extraction from PubMed Case Reports | Apr 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Crash Time Matters: HybridMamba for Fine-Grained Temporal Localization in Traffic Surveillance Footage | Apr 4, 2025 | Temporal Localization | —Unverified | 0 |