| Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection | Sep 26, 2022 | Audio TaggingEvent Detection | —Unverified | 0 |
| Video Swin Transformers for Egocentric Video Understanding @ Ego4D Challenges 2022 | Jul 22, 2022 | ObjectObject State Change Classification | —Unverified | 0 |
| LocVTP: Video-Text Pre-training for Temporal Localization | Jul 21, 2022 | RetrievalTemporal Localization | CodeCode Available | 1 |
| Team PKU-WICT-MIPL PIC Makeup Temporal Video Grounding Challenge 2022 Technical Report | Jul 6, 2022 | SentenceTemporal Localization | —Unverified | 0 |
| Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes | Jun 16, 2022 | Temporal Localization | —Unverified | 0 |
| Structured Video Tokens @ Ego4D PNR Temporal Localization Challenge 2022 | Jun 15, 2022 | Point- of-no-return (PNR) temporal localizationTemporal Localization | —Unverified | 0 |
| TadML: A fast temporal action detection with Mechanics-MLP | Jun 7, 2022 | Action DetectionOptical Flow Estimation | CodeCode Available | 0 |
| Egocentric Video-Language Pretraining | Jun 3, 2022 | Action RecognitionContrastive Learning | CodeCode Available | 2 |
| Stargazer: A transformer-based driver action detection system for intelligent transportation | Jun 1, 2022 | Action DetectionAction Recognition | CodeCode Available | 1 |
| To catch a chorus, verse, intro, or anything else: Analyzing a song with structural functions | May 29, 2022 | Boundary DetectionTemporal Localization | —Unverified | 0 |