Malicious or Benign? Towards Effective Content Moderation for Children's Videos May 24, 2023 Video Classification
Code Code Available 0Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception May 10, 2023 Classification image-classification
— Unverified 0HateMM: A Multi-Modal Dataset for Hate Video Classification May 6, 2023 Classification Hate Speech Detection
Code Code Available 1Verbs in Action: Improving verb understanding in video-language models Apr 13, 2023 Contrastive Learning Question Answering
Code Code Available 0SparseFormer: Sparse Visual Recognition via Limited Latent Tokens Apr 7, 2023 Image Classification Sparse Representation-based Classification
Code Code Available 1Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting Apr 6, 2023 Action Recognition Prompt Learning
Code Code Available 1NetFlick: Adversarial Flickering Attacks on Deep Learning Based Video Compression Apr 4, 2023 Deep Learning Video Classification
— Unverified 0SELF-VS: Self-supervised Encoding Learning For Video Summarization Mar 28, 2023 Knowledge Distillation Representation Learning
Code Code Available 0Unified Keypoint-based Action Recognition Framework via Structured Keypoint Pooling Mar 27, 2023 Action Localization Action Recognition
— Unverified 0Selective Structured State-Spaces for Long-Form Video Understanding Mar 25, 2023 Contrastive Learning Form
— Unverified 0The effectiveness of MAE pre-pretraining for billion-scale pretraining Mar 23, 2023 Action Classification Action Recognition
Code Code Available 1ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders Mar 21, 2023 Action Classification Action Recognition
Code Code Available 0Towards Activated Muscle Group Estimation in the Wild Mar 2, 2023 Activity Recognition Human Activity Recognition
Code Code Available 1Temporal Coherent Test-Time Optimization for Robust Video Classification Feb 28, 2023 Classification Self-Supervised Learning
— Unverified 0Video4MRI: An Empirical Study on Brain Magnetic Resonance Image Analytics with CNN-based Video Classification Frameworks Feb 24, 2023 Classification Data Augmentation
— Unverified 0Analysis of Real-Time Hostile Activitiy Detection from Spatiotemporal Features Using Time Distributed Deep CNNs, RNNs and Attention-Based Mechanisms Feb 21, 2023 Action Recognition Classification
— Unverified 0Reversible Vision Transformers Feb 9, 2023 GPU image-classification
Code Code Available 1Augmenting Ego-Vehicle for Traffic Near-Miss and Accident Classification Dataset using Manipulating Conditional Style Translation Jan 6, 2023 Image-to-Image Translation Translation
Code Code Available 0Few-Shot Video Classification via Representation Fusion and Promotion Learning Jan 1, 2023 Video Classification
— Unverified 0Class Prototypes Based Contrastive Learning for Classifying Multi-Label and Fine-Grained Educational Videos Jan 1, 2023 Contrastive Learning Math
— Unverified 0Efficient Movie Scene Detection using State-Space Transformers Dec 29, 2022 GPU Scene Segmentation
Code Code Available 1Truncate-Split-Contrast: A Framework for Learning from Mislabeled Videos Dec 27, 2022 channel selection Contrastive Learning
— Unverified 0VideoCoCa: Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners Dec 9, 2022 Question Answering Retrieval
— Unverified 0Evaluation of Explanation Methods of AI -- CNNs in Image Classification Tasks with Reference-based and No-reference Metrics Dec 2, 2022 image-classification Image Classification
Code Code Available 0A Unified Multimodal De- and Re-coupling Framework for RGB-D Motion Recognition Nov 16, 2022 Action Recognition Data Augmentation
Code Code Available 1Deep Unsupervised Key Frame Extraction for Efficient Video Classification Nov 12, 2022 Classification Video Classification
— Unverified 0BOREx: Bayesian-Optimization--Based Refinement of Saliency Map for Image- and Video-Classification Models Oct 31, 2022 Bayesian Optimization Classification
— Unverified 0Transfer-learning for video classification: Video Swin Transformer on multiple domains Oct 18, 2022 Transfer Learning Video Classification
— Unverified 0Linear Video Transformer with Feature Fixation Oct 15, 2022 Feature Importance Video Classification
— Unverified 0Overlooked Video Classification in Weakly Supervised Video Anomaly Detection Oct 13, 2022 All Anomaly Detection
Code Code Available 1S4ND: Modeling Images and Videos as Multidimensional Signals Using State Spaces Oct 12, 2022 Inductive Bias State Space Models
Code Code Available 0TAD: A Large-Scale Benchmark for Traffic Accidents Detection from Video Surveillance Sep 26, 2022 image-classification Image Classification
Code Code Available 1FuTH-Net: Fusing Temporal Relations and Holistic Features for Aerial Video Classification Sep 22, 2022 Action Recognition Temporal Action Localization
— Unverified 0Traffic Congestion Prediction using Deep Convolutional Neural Networks: A Color-coding Approach Sep 16, 2022 Classification vehicle detection
— Unverified 0On the Surprising Effectiveness of Transformers in Low-Labeled Video Recognition Sep 15, 2022 image-classification Image Classification
— Unverified 0UAV-CROWD: Violent and non-violent crowd activity simulator from the perspective of UAV Aug 13, 2022 Semantic Segmentation Video Classification
— Unverified 0Motion Sensitive Contrastive Learning for Self-supervised Video Representation Aug 12, 2022 Contrastive Learning Representation Learning
— Unverified 0Two-Stream Transformer Architecture for Long Video Understanding Aug 2, 2022 Action Recognition GPU
— Unverified 0Adaptive occlusion sensitivity analysis for visually explaining video recognition networks Jul 26, 2022 Decision Making image-classification
Code Code Available 0P2ANet: A Dataset and Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos Jul 26, 2022 Action Detection Action Localization
— Unverified 0SSIVD-Net: A Novel Salient Super Image Classification & Detection Technique for Weaponized Violence Jul 26, 2022 Action Recognition image-classification
Code Code Available 1Intelligent 3D Network Protocol for Multimedia Data Classification using Deep Learning Jul 23, 2022 Action Recognition Deep Learning
— Unverified 0Contrastive Self-Supervised Learning Leads to Higher Adversarial Susceptibility Jul 22, 2022 Adversarial Robustness Self-Supervised Learning
— Unverified 0Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments Jul 21, 2022 General Classification Video Classification
Code Code Available 1NSNet: Non-saliency Suppression Sampler for Efficient Video Recognition Jul 21, 2022 Action Recognition Video Classification
— Unverified 0Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset Jul 21, 2022 Fine-Grained Visual Categorization Video Classification
Code Code Available 1Temporal and cross-modal attention for audio-visual zero-shot learning Jul 20, 2022 GZSL Video Classification Video Classification
Code Code Available 1GOCA: Guided Online Cluster Assignment for Self-Supervised Video Representation Learning Jul 20, 2022 Action Recognition Clustering
Code Code Available 0Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models Jul 15, 2022 Optical Flow Estimation Video Classification
— Unverified 0Long-term Leap Attention, Short-term Periodic Shift for Video Classification Jul 12, 2022 Video Classification
Code Code Available 0