HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics Aug 30, 2024 Form Video Classification
Code Code Available 15 Key-frame Guided Network for Thyroid Nodule Recognition using Ultrasound Videos Jun 27, 2022 Video Classification
Code Code Available 15 Out-of-Distribution Detection Using Union of 1-Dimensional Subspaces Jun 19, 2021 Bayesian Inference Out-of-Distribution Detection
Code Code Available 15 On the effectiveness of task granularity for transfer learning Apr 24, 2018 Classification Diversity
Code Code Available 15 Non-local Neural Networks Nov 21, 2017 Action Classification Action Recognition
Code Code Available 15 A Closer Look at Few-Shot Video Classification: A New Baseline and Benchmark Oct 24, 2021 Classification Meta-Learning
Code Code Available 15 A Multigrid Method for Efficiently Training Video Models Dec 2, 2019 Action Detection Action Recognition
Code Code Available 15 Learning Implicit Temporal Alignment for Few-shot Video Classification May 11, 2021 Action Recognition In Videos Classification
Code Code Available 15 A Unified Multimodal De- and Re-coupling Framework for RGB-D Motion Recognition Nov 16, 2022 Action Recognition Data Augmentation
Code Code Available 15 A Unified Taxonomy and Multimodal Dataset for Events in Invasion Games Aug 25, 2021 Benchmarking Video Classification
Code Code Available 15 On Translation Invariance in CNNs: Convolutional Layers can Exploit Absolute Spatial Location Mar 16, 2020 General Classification image-classification
Code Code Available 15 Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks Nov 28, 2017 Action Recognition Philosophy
Code Code Available 15 Piano Skills Assessment Jan 13, 2021 Action Quality Assessment Audio Classification
Code Code Available 15 MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube and Bilibili Jul 28, 2024 Hate Speech Detection Video Classification
Code Code Available 15 A Simple Video Segmenter by Tracking Objects Along Axial Trajectories Nov 30, 2023 GPU Object
Code Code Available 15 A Dataset for Medical Instructional Video Classification and Question Answering Jan 30, 2022 Classification Question Answering
Code Code Available 15 Motion-Excited Sampler: Video Adversarial Attack with Sparked Prior Mar 17, 2020 Adversarial Attack Video Classification
Code Code Available 15 Towards Activated Muscle Group Estimation in the Wild Mar 2, 2023 Activity Recognition Human Activity Recognition
Code Code Available 15 MotionSqueeze: Neural Motion Feature Learning for Video Understanding Jul 20, 2020 Action Classification Action Recognition
Code Code Available 15 Long Movie Clip Classification with State-Space Video Models Apr 4, 2022 Classification Decoder
Code Code Available 15 Billion-scale semi-supervised learning for image classification May 2, 2019 Classification General Classification
Code Code Available 15 Discovering Dynamic Salient Regions for Spatio-Temporal Graph Neural Networks Sep 17, 2020 Inductive Bias Object
Code Code Available 15 Non-Local Neural Networks With Grouped Bilinear Attentional Transforms Jun 1, 2020 Image Classification Video Classification
Code Code Available 15 EEG-based Emotional Video Classification via Learning Connectivity Structure May 28, 2019 Classification EEG
Code Code Available 15 ViViT: A Video Vision Transformer Mar 29, 2021 Action Classification Action Recognition
Code Code Available 15 Efficient Movie Scene Detection using State-Space Transformers Dec 29, 2022 GPU Scene Segmentation
Code Code Available 15 Overlooked Video Classification in Weakly Supervised Video Anomaly Detection Oct 13, 2022 All Anomaly Detection
Code Code Available 15 Making a Case for 3D Convolutions for Object Segmentation in Videos Aug 26, 2020 Decoder Segmentation
Code Code Available 15 Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset Jul 21, 2022 Fine-Grained Visual Categorization Video Classification
Code Code Available 15 Home Action Genome: Cooperative Compositional Action Understanding May 11, 2021 Action Recognition Action Understanding
Code Code Available 15 Approximated Bilinear Modules for Temporal Modeling Jul 25, 2020 Action Recognition Video Classification
Code Code Available 15 Quantized Distillation: Optimizing Driver Activity Recognition Models for Resource-Constrained Environments Nov 10, 2023 Activity Recognition Autonomous Driving
Code Code Available 15 A Spatio-temporal Attention-based Model for Infant Movement Assessment from Videos May 20, 2021 Video Classification
Code Code Available 15 Rethinking Zero-shot Video Classification: End-to-end Training for Realistic Applications Mar 3, 2020 Benchmarking General Classification
Code Code Available 15 Compact Generalized Non-local Network Oct 31, 2018 Object Detection Object Recognition
Code Code Available 15 Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach Dec 21, 2023 Action Localization Classification
Code Code Available 15 Deep Temporal Linear Encoding Networks Nov 21, 2016 Representation Learning Video Classification
Code Code Available 15 Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting Jun 18, 2021 Action Recognition Action Recognition In Videos
Code Code Available 15 Convex Combination Consistency between Neighbors for Weakly-supervised Action Localization May 1, 2022 Action Localization Data Augmentation
Code Code Available 15 Generalized Few-Shot Video Classification with Video Retrieval and Feature Generation Jul 9, 2020 Few-Shot Image Classification Few-Shot Learning
Code Code Available 15 Convolutional Spiking Neural Networks for Spatio-Temporal Feature Extraction Mar 27, 2020 Activity Recognition In Videos Event data classification
Code Code Available 15 AdaPool: Exponential Adaptive Pooling for Information-Retaining Downsampling Nov 1, 2021 Benchmarking object-detection
Code Code Available 15 Diverse Temporal Aggregation and Depthwise Spatiotemporal Factorization for Efficient Video Classification Dec 1, 2020 3D Architecture Action Recognition
Code Code Available 15 HateMM: A Multi-Modal Dataset for Hate Video Classification May 6, 2023 Classification Hate Speech Detection
Code Code Available 15 MUVF-YOLOX: A Multi-modal Ultrasound Video Fusion Network for Renal Tumor Diagnosis Jul 15, 2023 Video Classification
Code Code Available 15 PipeNet: Selective Modal Pipeline of Fusion Network for Multi-Modal Face Anti-Spoofing Apr 24, 2020 Face Anti-Spoofing General Classification
Code Code Available 15 Timeception for Complex Action Recognition Dec 4, 2018 Action Classification Action Recognition
Code Code Available 15 MViTv2: Improved Multiscale Vision Transformers for Classification and Detection Dec 2, 2021 Action Classification Action Recognition
Code Code Available 15 Alignment-Uniformity aware Representation Learning for Zero-shot Video Classification Mar 29, 2022 Representation Learning Video Classification
Code Code Available 15 X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization Mar 28, 2024 Video Classification Zero-Shot Learning
Code Code Available 15