| Do Current Video LLMs Have Strong OCR Abilities? A Preliminary Study | Dec 29, 2024 | Motion DetectionOptical Character Recognition | CodeCode Available | 0 |
| Temporal Localization of Fine-Grained Actions in Videos by Domain Transfer from Web Images | Apr 4, 2015 | Action LocalizationAction Recognition | CodeCode Available | 0 |
| RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization | Mar 30, 2019 | Action LocalizationTemporal Action Localization | CodeCode Available | 0 |
| Technical Report of the Video Event Reconstruction and Analysis (VERA) System -- Shooter Localization, Models, Interface, and Beyond | May 26, 2019 | Gunshot DetectionShooter Localization | CodeCode Available | 0 |
| Online Human Action Detection using Joint Classification-Regression Recurrent Neural Networks | Apr 19, 2016 | Action DetectionAction Recognition | CodeCode Available | 0 |
| Accelerating COVID-19 Differential Diagnosis with Explainable Ultrasound Image Analysis | Sep 13, 2020 | DiagnosticSensitivity | CodeCode Available | 0 |
| NAAQA: A Neural Architecture for Acoustic Question Answering | Jun 11, 2021 | Acoustic Question AnsweringQuestion Answering | CodeCode Available | 0 |
| Multi-attention Networks for Temporal Localization of Video-level Labels | Nov 15, 2019 | Action RecognitionTemporal Action Localization | CodeCode Available | 0 |
| Masked Autoencoders for Egocentric Video Understanding @ Ego4D Challenge 2022 | Nov 18, 2022 | Object State Change ClassificationTemporal Localization | CodeCode Available | 0 |
| TimeRefine: Temporal Grounding with Time Refining Video LLM | Dec 12, 2024 | Temporal Localization | CodeCode Available | 0 |