ActAlign: Zero-Shot Fine-Grained Video Classification via Language-Guided Sequence Alignment Jun 28, 2025 Dynamic Time Warping Large Language Model
Code Code Available 0Exploring Audio Cues for Enhanced Test-Time Video Model Adaptation Jun 14, 2025 Test-time Adaptation Video Classification
Code Code Available 0Spatiotemporal Analysis of Forest Machine Operations Using 3D Video Classification May 30, 2025 Activity Recognition Video Classification
— Unverified 0Video-GPT via Next Clip Diffusion May 18, 2025 Denoising Image Animation
Code Code Available 1Read My Ears! Horse Ear Movement Detection for Equine Affective State Assessment May 6, 2025 Optical Flow Estimation Video Classification
Code Code Available 0Perception Encoder: The best visual embeddings are not at the output of the network Apr 17, 2025 Depth Estimation Language Modeling
Code Code Available 8TenAd: A Tensor-based Low-rank Black Box Adversarial Attack for Video Classification Apr 1, 2025 Adversarial Attack Video Classification
— Unverified 0Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks Mar 24, 2025 Common Sense Reasoning Prediction
— Unverified 0Spatiotemporal Learning with Context-aware Video Tubelets for Ultrasound Video Analysis Mar 21, 2025 object-detection Object Detection
— Unverified 0Ultrasound Image-to-Video Synthesis via Latent Dynamic Diffusion Models Mar 19, 2025 Classification Data Augmentation
Code Code Available 0Accurate and Efficient Two-Stage Gun Detection in Video Mar 8, 2025 Anomaly Detection Object
— Unverified 0Online Meta-learning for AutoML in Real-time (OnMAR) Feb 27, 2025 AutoML Image Clustering
— Unverified 0Variable-frame CNNLSTM for Breast Nodule Classification using Ultrasound Videos Feb 17, 2025 Classification Specificity
— Unverified 0Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering Feb 13, 2025 Classification Prompt Engineering
— Unverified 0Towards a Robust Framework for Multimodal Hate Detection: A Study on Video vs. Image-based Content Feb 11, 2025 Hate Speech Detection Video Classification
Code Code Available 0BRIDLE: Generalized Self-supervised Learning with Quantization Feb 4, 2025 image-classification Image Classification
Code Code Available 0Extending Information Bottleneck Attribution to Video Sequences Jan 28, 2025 DeepFake Detection Face Swapping
Code Code Available 0Document-Level Sentiment Analysis of Urdu Text Using Deep Learning Techniques Jan 23, 2025 Sentiment Analysis Video Classification
— Unverified 0Efficient Lung Ultrasound Severity Scoring Using Dedicated Feature Extractor Jan 21, 2025 Diagnostic Knowledge Distillation
Code Code Available 0GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video Jan 20, 2025 Video Classification Video Generation
— Unverified 0When language and vision meet road safety: leveraging multimodal large language models for video-based traffic accident analysis Jan 17, 2025 Large Language Model Multimodal Large Language Model
Code Code Available 1Towards Zero-Shot & Explainable Video Description by Reasoning over Graphs of Events in Space and Time Jan 14, 2025 Object Recognition Text Generation
— Unverified 0An Empirical Study of Autoregressive Pre-training from Videos Jan 9, 2025 Object Tracking Video Classification
— Unverified 0Temporal Feature Weaving for Neonatal Echocardiographic Viewpoint Video Classification Jan 7, 2025 Classification image-classification
Code Code Available 0CM3T: Framework for Efficient Multimodal Learning for Inhomogeneous Interaction Datasets Jan 6, 2025 Transfer Learning Video Classification
— Unverified 0Multi-Modal Video Feature Extraction for Popularity Prediction Jan 2, 2025 Feature Engineering Prediction
— Unverified 0AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction Jan 1, 2025 GPU Question Answering
— Unverified 0DejaVid: Encoder-Agnostic Learned Temporal Matching for Video Classification Jan 1, 2025 Action Classification Action Recognition
Code Code Available 0LEARN: A Unified Framework for Multi-Task Domain Adapt Few-Shot Learning Dec 20, 2024 Domain Adaptation Few-Shot Learning
Code Code Available 0Discrepancy-Aware Attention Network for Enhanced Audio-Visual Zero-Shot Learning Dec 16, 2024 Video Classification Zero-Shot Learning
— Unverified 0Gramian Multimodal Representation Learning and Alignment Dec 16, 2024 Contrastive Learning Representation Learning
Code Code Available 2Context-Aware Detection of Mixed Critical Events using Video Classification Nov 24, 2024 Fire Detection Video Classification
— Unverified 0OccludeNet: A Causal Journey into Mixed-View Actor-Centric Video Action Recognition under Occlusions Nov 24, 2024 Action Classification Action Recognition
Code Code Available 0AdaCM^2: On Understanding Extremely Long-Term Video with Adaptive Cross-Modality Memory Reduction Nov 19, 2024 GPU Question Answering
— Unverified 0Efficient Audio-Visual Fusion for Video Classification Nov 8, 2024 Classification Video Classification
— Unverified 0AM Flow: Adapters for Temporal Processing in Action Recognition Nov 4, 2024 Action Classification Action Recognition
— Unverified 0Optimizing Violence Detection in Video Classification Accuracy through 3D Convolutional Neural Networks Nov 2, 2024 Optical Flow Estimation Video Classification
— Unverified 0Video Token Merging for Long-form Video Understanding Oct 31, 2024 Form Video Classification
— Unverified 0Distributed Intelligent Video Surveillance for Early Armed Robbery Detection based on Deep Learning Oct 13, 2024 object-detection Object Detection
Code Code Available 0CPFD: Confidence-aware Privileged Feature Distillation for Short Video Classification Oct 3, 2024 Video Classification
— Unverified 0TikGuard: A Deep Learning Transformer-Based Solution for Detecting Unsuitable TikTok Content for Kids Oct 1, 2024 Video Classification
— Unverified 0Benchmarking Edge AI Platforms for High-Performance ML Inference Sep 23, 2024 Benchmarking CPU
— Unverified 0Pushing the boundaries of event subsampling in event-based video classification using CNNs Sep 13, 2024 Event data classification Sensitivity
Code Code Available 0Scalable Frame Sampling for Video Classification: A Semi-Optimal Policy Approach with Reduced Search Space Sep 9, 2024 Video Classification
Code Code Available 0Inference-Scale Complexity in ANN-SNN Conversion for High-Performance and Low-Power Applications Sep 5, 2024 image-classification Image Classification
Code Code Available 0ReSpike: Residual Frames-based Hybrid Spiking Neural Networks for Efficient Action Recognition Sep 3, 2024 Action Recognition image-classification
Code Code Available 0HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics Aug 30, 2024 Form Video Classification
Code Code Available 1Attend-Fusion: Efficient Audio-Visual Fusion for Video Classification Aug 26, 2024 Video Classification Video Understanding
— Unverified 0Query-Efficient Video Adversarial Attack with Stylized Logo Aug 22, 2024 Adversarial Attack Reinforcement Learning (RL)
— Unverified 0VidModEx: Interpretable and Efficient Black Box Model Extraction for High-Dimensional Spaces Aug 4, 2024 image-classification Image Classification
Code Code Available 0