| Single-Stage Visual Query Localization in Egocentric Videos | Jun 15, 2023 | object-detectionObject Detection | —Unverified | 0 |
| Self-Chained Image-Language Model for Video Localization and Question Answering | May 11, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Autonomous Stabilization of Retinal Videos for Streamlining Assessment of Spontaneous Venous Pulsations | May 10, 2023 | Template MatchingTemporal Localization | —Unverified | 0 |
| Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding | Mar 28, 2023 | Action LocalizationAction Recognition | —Unverified | 0 |
| VADER: Video Alignment Differencing and Retrieval | Mar 23, 2023 | MisinformationRetrieval | —Unverified | 0 |
| Unsupervised classification to improve the quality of a bird song recording dataset | Feb 15, 2023 | Sound ClassificationTemporal Localization | CodeCode Available | 1 |
| Multi-Task Learning of Object State Changes from Uncurated Videos | Nov 24, 2022 | Multi-Task LearningObject | CodeCode Available | 1 |
| Masked Autoencoders for Egocentric Video Understanding @ Ego4D Challenge 2022 | Nov 18, 2022 | Object State Change ClassificationTemporal Localization | CodeCode Available | 0 |
| Exploring State Change Capture of Heterogeneous Backbones @ Ego4D Hands and Objects Challenge 2022 | Nov 16, 2022 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| Optimizing Temporal Resolution Of Convolutional Recurrent Neural Networks For Sound Event Detection | Oct 18, 2022 | Event DetectionSound Event Detection | —Unverified | 0 |
| Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection | Sep 26, 2022 | Audio TaggingEvent Detection | —Unverified | 0 |
| Video Swin Transformers for Egocentric Video Understanding @ Ego4D Challenges 2022 | Jul 22, 2022 | ObjectObject State Change Classification | —Unverified | 0 |
| LocVTP: Video-Text Pre-training for Temporal Localization | Jul 21, 2022 | RetrievalTemporal Localization | CodeCode Available | 1 |
| Team PKU-WICT-MIPL PIC Makeup Temporal Video Grounding Challenge 2022 Technical Report | Jul 6, 2022 | SentenceTemporal Localization | —Unverified | 0 |
| Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes | Jun 16, 2022 | Temporal Localization | —Unverified | 0 |
| Structured Video Tokens @ Ego4D PNR Temporal Localization Challenge 2022 | Jun 15, 2022 | Point- of-no-return (PNR) temporal localizationTemporal Localization | —Unverified | 0 |
| TadML: A fast temporal action detection with Mechanics-MLP | Jun 7, 2022 | Action DetectionOptical Flow Estimation | CodeCode Available | 0 |
| Egocentric Video-Language Pretraining | Jun 3, 2022 | Action RecognitionContrastive Learning | CodeCode Available | 2 |
| Stargazer: A transformer-based driver action detection system for intelligent transportation | Jun 1, 2022 | Action DetectionAction Recognition | CodeCode Available | 1 |
| To catch a chorus, verse, intro, or anything else: Analyzing a song with structural functions | May 29, 2022 | Boundary DetectionTemporal Localization | —Unverified | 0 |
| Temporally Precise Action Spotting in Soccer Videos Using Dense Detection Anchors | May 20, 2022 | Action SpottingData Augmentation | CodeCode Available | 1 |
| Contrastive Language-Action Pre-training for Temporal Localization | Apr 26, 2022 | Action LocalizationContrastive Learning | —Unverified | 0 |
| TubeDETR: Spatio-Temporal Video Grounding with Transformers | Mar 30, 2022 | DecoderLanguage-Based Temporal Localization | CodeCode Available | 1 |
| Unsupervised Pre-training for Temporal Action Localization Tasks | Mar 25, 2022 | Action LocalizationContrastive Learning | CodeCode Available | 1 |
| OpenTAL: Towards Open Set Temporal Action Localization | Mar 10, 2022 | Action ClassificationAction Localization | CodeCode Available | 1 |