| Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding | Mar 24, 2024 | Dense Video CaptioningTemporal Localization | —Unverified | 0 | 0 |
| Transductive Universal Transport for Zero-Shot Action Recognition | Sep 29, 2021 | Action RecognitionObject | —Unverified | 0 | 0 |
| Transformer-based Fusion of 2D-pose and Spatio-temporal Embeddings for Distracted Driver Action Recognition | Mar 11, 2024 | 2D Human Pose EstimationAction Recognition | —Unverified | 0 | 0 |
| Action is in the Eye of the Beholder: Eye-gaze Driven Model for Spatio-Temporal Action Localization | Dec 1, 2013 | Action LocalizationClassification | —Unverified | 0 | 0 |
| Autonomous Stabilization of Retinal Videos for Streamlining Assessment of Spontaneous Venous Pulsations | May 10, 2023 | Template MatchingTemporal Localization | —Unverified | 0 | 0 |
| A Large-Language Model Framework for Relative Timeline Extraction from PubMed Case Reports | Apr 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Universal Prototype Transport for Zero-Shot Action Recognition and Localization | Mar 8, 2022 | Action RecognitionObject | —Unverified | 0 | 0 |
| What do I Annotate Next? An Empirical Study of Active Learning for Action Localization | Sep 1, 2018 | Action LocalizationActive Learning | —Unverified | 0 | 0 |
| Joint Visual-Temporal Embedding for Unsupervised Learning of Actions in Untrimmed Sequences | Jan 29, 2020 | Action RecognitionAction Segmentation | —Unverified | 0 | 0 |
| A Hierarchical Multi-Modal Encoder for Moment Localization in Video Corpus | Nov 18, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Learning to track for spatio-temporal action localization | Jun 5, 2015 | Action LocalizationSpatio-Temporal Action Localization | —Unverified | 0 | 0 |
| Inceptive Event Time-Surfaces for Object Classification Using Neuromorphic Cameras | Feb 26, 2020 | ClassificationDimensionality Reduction | —Unverified | 0 | 0 |
| Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection | Sep 26, 2022 | Audio TaggingEvent Detection | —Unverified | 0 | 0 |
| Impact of Noisy Labels on Sound Event Detection: Deletion Errors Are More Detrimental Than Insertion Errors | Aug 27, 2024 | Event DetectionSound Event Detection | —Unverified | 0 | 0 |
| Identity-aware Graph Memory Network for Action Detection | Aug 26, 2021 | Action DetectionGraph Neural Network | —Unverified | 0 | 0 |
| Unsupervised detection and classification of heartbeats using the dissimilarity matrix in PCG signals | Nov 5, 2024 | Heart SegmentationSound Classification | —Unverified | 0 | 0 |
| Measure Twice, Cut Once: Grasping Video Structures and Event Semantics with LLMs for Video Temporal Localization | Mar 12, 2025 | Temporal LocalizationVideo Understanding | —Unverified | 0 | 0 |
| Fusion of Millimeter-wave Radar and Pulse Oximeter Data for Low-burden Diagnosis of Obstructive Sleep Apnea-Hypopnea Syndrome | Jan 25, 2025 | DiagnosticSleep Staging | —Unverified | 0 | 0 |
| Fine-Tuning Large Audio-Language Models with LoRA for Precise Temporal Localization of Prolonged Exposure Therapy Elements | Jun 11, 2025 | Temporal Localization | —Unverified | 0 | 0 |
| MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment Retrieval | Jun 25, 2024 | cross-modal alignmentMoment Retrieval | —Unverified | 0 | 0 |
| Modality Shifting Attention Network for Multi-modal Video Question Answering | Jul 4, 2020 | Question AnsweringTemporal Localization | —Unverified | 0 | 0 |
| Modeling Spatio-Temporal Human Track Structure for Action Localization | Jun 28, 2018 | Action LocalizationHuman Detection | —Unverified | 0 | 0 |
| A Data Driven End-to-end Approach for In-the-wild Monitoring of Eating Behavior Using Smartwatches | Oct 12, 2020 | Temporal Localization | —Unverified | 0 | 0 |
| Few-Shot Transformation of Common Actions into Time and Space | Apr 6, 2021 | Action LocalizationDecoder | —Unverified | 0 | 0 |
| VADER: Video Alignment Differencing and Retrieval | Mar 23, 2023 | MisinformationRetrieval | —Unverified | 0 | 0 |