A Simple Video Segmenter by Tracking Objects Along Axial Trajectories Nov 30, 2023 GPU Object
Code Code Available 1The effectiveness of MAE pre-pretraining for billion-scale pretraining Mar 23, 2023 Action Classification Action Recognition
Code Code Available 1Learning Implicit Temporal Alignment for Few-shot Video Classification May 11, 2021 Action Recognition In Videos Classification
Code Code Available 1SSIVD-Net: A Novel Salient Super Image Classification & Detection Technique for Weaponized Violence Jul 26, 2022 Action Recognition image-classification
Code Code Available 1Large-Scale Video Classification with Convolutional Neural Networks Jun 23, 2014 Action Recognition Classification
Code Code Available 1A Closer Look at Few-Shot Video Classification: A New Baseline and Benchmark Oct 24, 2021 Classification Meta-Learning
Code Code Available 1A Multigrid Method for Efficiently Training Video Models Dec 2, 2019 Action Detection Action Recognition
Code Code Available 1Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks Nov 28, 2017 Action Recognition Philosophy
Code Code Available 1A Unified Multimodal De- and Re-coupling Framework for RGB-D Motion Recognition Nov 16, 2022 Action Recognition Data Augmentation
Code Code Available 1A Unified Taxonomy and Multimodal Dataset for Events in Invasion Games Aug 25, 2021 Benchmarking Video Classification
Code Code Available 1Progressive Video Summarization via Multimodal Self-supervised Learning Jan 7, 2022 Self-Supervised Learning Supervised Video Summarization
Code Code Available 1Motion-Excited Sampler: Video Adversarial Attack with Sparked Prior Mar 17, 2020 Adversarial Attack Video Classification
Code Code Available 1InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks Dec 21, 2023 Image Retrieval Image-to-Text Retrieval
Code Code Available 1Large Scale Holistic Video Understanding Apr 25, 2019 Action Classification Action Recognition
Code Code Available 1A Dataset for Medical Instructional Video Classification and Question Answering Jan 30, 2022 Classification Question Answering
Code Code Available 1Home Action Genome: Cooperative Compositional Action Understanding May 11, 2021 Action Recognition Action Understanding
Code Code Available 1Is normalization indispensable for training deep neural network? Dec 1, 2020 General Classification image-classification
Code Code Available 1Generalized Few-Shot Video Classification with Video Retrieval and Feature Generation Jul 9, 2020 Few-Shot Image Classification Few-Shot Learning
Code Code Available 1MViTv2: Improved Multiscale Vision Transformers for Classification and Detection Dec 2, 2021 Action Classification Action Recognition
Code Code Available 1Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments Jul 21, 2022 General Classification Video Classification
Code Code Available 1Billion-scale semi-supervised learning for image classification May 2, 2019 Classification General Classification
Code Code Available 1Key-frame Guided Network for Thyroid Nodule Recognition using Ultrasound Videos Jun 27, 2022 Video Classification
Code Code Available 1Active Contrastive Learning of Audio-Visual Video Representations Aug 31, 2020 Contrastive Learning Representation Learning
Code Code Available 1EEG-based Emotional Video Classification via Learning Connectivity Structure May 28, 2019 Classification EEG
Code Code Available 1HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics Aug 30, 2024 Form Video Classification
Code Code Available 1Learning To Recognize Procedural Activities with Distant Supervision Jan 26, 2022 Action Classification Language Modelling
Code Code Available 1ViViT: A Video Vision Transformer Mar 29, 2021 Action Classification Action Recognition
Code Code Available 1Making a Case for 3D Convolutions for Object Segmentation in Videos Aug 26, 2020 Decoder Segmentation
Code Code Available 1Diverse Temporal Aggregation and Depthwise Spatiotemporal Factorization for Efficient Video Classification Dec 1, 2020 3D Architecture Action Recognition
Code Code Available 1MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube and Bilibili Jul 28, 2024 Hate Speech Detection Video Classification
Code Code Available 1Approximated Bilinear Modules for Temporal Modeling Jul 25, 2020 Action Recognition Video Classification
Code Code Available 1MUVF-YOLOX: A Multi-modal Ultrasound Video Fusion Network for Renal Tumor Diagnosis Jul 15, 2023 Video Classification
Code Code Available 1AdaPool: Exponential Adaptive Pooling for Information-Retaining Downsampling Nov 1, 2021 Benchmarking object-detection
Code Code Available 1On the effectiveness of task granularity for transfer learning Apr 24, 2018 Classification Diversity
Code Code Available 1Compact Generalized Non-local Network Oct 31, 2018 Object Detection Object Recognition
Code Code Available 1Out-of-Distribution Detection Using Union of 1-Dimensional Subspaces Jun 19, 2021 Bayesian Inference Out-of-Distribution Detection
Code Code Available 1Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset Jul 21, 2022 Fine-Grained Visual Categorization Video Classification
Code Code Available 1Piano Skills Assessment Jan 13, 2021 Action Quality Assessment Audio Classification
Code Code Available 1Convex Combination Consistency between Neighbors for Weakly-supervised Action Localization May 1, 2022 Action Localization Data Augmentation
Code Code Available 1Pyramidal Convolution: Rethinking Convolutional Neural Networks for Visual Recognition Jun 20, 2020 Action Classification Action Recognition
Code Code Available 1Convolutional Spiking Neural Networks for Spatio-Temporal Feature Extraction Mar 27, 2020 Activity Recognition In Videos Event data classification
Code Code Available 1Reinforcement Learning with Latent Flow Jan 6, 2021 Atari Games continuous-control
Code Code Available 1HateMM: A Multi-Modal Dataset for Hate Video Classification May 6, 2023 Classification Hate Speech Detection
Code Code Available 1Reversible Vision Transformers Feb 9, 2023 GPU image-classification
Code Code Available 1MotionSqueeze: Neural Motion Feature Learning for Video Understanding Jul 20, 2020 Action Classification Action Recognition
Code Code Available 1Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting Jun 18, 2021 Action Recognition Action Recognition In Videos
Code Code Available 1Adaptive Token Sampling For Efficient Vision Transformers Nov 30, 2021 Efficient ViTs image-classification
Code Code Available 1SmallBigNet: Integrating Core and Contextual Views for Video Classification Jun 25, 2020 Classification General Classification
Code Code Available 1Alignment-Uniformity aware Representation Learning for Zero-shot Video Classification Mar 29, 2022 Representation Learning Video Classification
Code Code Available 1X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization Mar 28, 2024 Video Classification Zero-Shot Learning
Code Code Available 1