SOTAVerified

Activity Recognition

Human Activity Recognition is the problem of identifying events performed by humans given a video input. It is formulated as a binary (or multiclass) classification problem of outputting activity class labels. Activity Recognition is an important problem with many societal applications including smart surveillance, video search/retrieval, intelligent robots, and other monitoring systems.

Source: Learning Latent Sub-events in Activity Videos Using Temporal Attention Filters

Papers

Showing 951975 of 1322 papers

TitleStatusHype
Deep Learning for Inertial Sensor Alignment0
Mouse Movement and Probabilistic Graphical Models Based E-Learning Activity Recognition Improvement Possibilistic Model0
MPT-PAR:Mix-Parameters Transformer for Panoramic Activity Recognition0
MuJo: Multimodal Joint Feature Space Learning for Human Activity Recognition0
Multi-agent Attentional Activity Recognition0
Multi-channel Time Series Decomposition Network For Generalizable Sensor-Based Activity Recognition0
Multi-Channel Time-Series Person and Soft-Biometric Identification0
MultiCore+TPU Accelerated Multi-Modal TinyML for Livestock Behaviour Recognition0
Multidimensional Human Activity Recognition With Large Language Model: A Conceptual Framework0
Multi-GAT: A Graphical Attention-based Hierarchical Multimodal Representation Learning Approach for Human Activity Recognition0
Multi-kernel learning of deep convolutional features for action recognition0
Multi-Label Activity Recognition using Activity-specific Features and Activity Correlations0
Multi-label Class-imbalanced Action Recognition in Hockey Videos via 3D Convolutional Neural Networks0
Multi-label Prediction in Time Series Data using Deep Neural Networks0
Multi-level Contrast Network for Wearables-based Joint Activity Segmentation and Recognition0
Multi-Level Sequence GAN for Group Activity Recognition0
Multimodal Contrastive Learning with Hard Negative Sampling for Human Activity Recognition0
Multi-modal Egocentric Activity Recognition using Audio-Visual Features0
Multi-Modal Gesture Recognition from Video and Surgical Tool Pose Information via Motion Invariants0
Multi-Modal Prototype Learning for Interpretable Multivariable Time Series Classification0
Multi-Modal Unsupervised Pre-Training for Surgical Operating Room Workflow Analysis0
Multi-objective Feature Selection in Remote Health Monitoring Applications0
Multi-Person Brain Activity Recognition via Comprehensive EEG Signal Analysis0
Multiple Human Association between Top and Horizontal Views by Matching Subjects' Spatial Distributions0
Multiple object tracking with context awareness0
Show:102550
← PrevPage 39 of 53Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Structured Keypoint PoolingAccuracy93.4Unverified
2Semi-Supervised Hard Attention (SSHA); pretrained on Deepmind Kinetics datasetAccuracy90.4Unverified
3Human Skeletons + Change DetectionAccuracy90.25Unverified
4Separable Convolutional LSTMAccuracy89.75Unverified
5SPIL ConvolutionAccuracy89.3Unverified
6Flow Gated NetworkAccuracy87.25Unverified
#ModelMetricClaimedVerifiedStatus
1FocusCLIPTop-3 Accuracy (%)10.47Unverified
2CLIPTop-3 Accuracy (%)6.49Unverified
#ModelMetricClaimedVerifiedStatus
1Boutaleb et al.1:1 Accuracy97.91Unverified
#ModelMetricClaimedVerifiedStatus
1all-landmark-modelActivity Recognition0.76Unverified