SOTAVerified

Activity Recognition

Human Activity Recognition is the problem of identifying events performed by humans given a video input. It is formulated as a binary (or multiclass) classification problem of outputting activity class labels. Activity Recognition is an important problem with many societal applications including smart surveillance, video search/retrieval, intelligent robots, and other monitoring systems.

Source: Learning Latent Sub-events in Activity Videos Using Temporal Attention Filters

Papers

Showing 651675 of 1322 papers

TitleStatusHype
Student Activity Recognition in Classroom Environments using Transfer Learning0
Super Normal Vector for Activity Recognition Using Depth Sequences0
Surgical Activity Recognition in Robot-Assisted Radical Prostatectomy using Deep Learning0
SurgicalVLM-Agent: Towards an Interactive AI Co-Pilot for Pituitary Surgery0
SWTF: Sparse Weighted Temporal Fusion for Drone-Based Activity Recognition0
Tapestry of Time and Actions: Modeling Human Activity Sequences using Temporal Point Process Flows0
TASKED: Transformer-based Adversarial learning for human activity recognition using wearable sensors via Self-KnowledgE Distillation0
WiFi-TCN: Temporal Convolution for Human Interaction Recognition based on WiFi signal0
Template co-updating in multi-modal human activity recognition systems0
Temporal Action Segmentation with High-level Complex Activity Labels0
Temporal DINO: A Self-supervised Video Strategy to Enhance Action Prediction0
Temporally Consistent Dynamic Scene Graphs: An End-to-End Approach for Action Tracklet Generation0
Temporally Robust Global Motion Compensation by Keypoint-based Congealing0
Temporal Reasoning Graph for Activity Recognition0
TENT: Connect Language Models with IoT Sensors for Zero-Shot Activity Recognition0
Text me the data: Generating Ground Pressure Sequence from Textual Descriptions for HAR0
The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, & Mosquitoes0
The ActivityNet Large-Scale Activity Recognition Challenge 2018 Summary0
The Evolution of First Person Vision Methods: A Survey0
The Potential of Vision-Language Models for Content Moderation of Children's Videos0
The Sweet-Home speech and multimodal corpus for home automation interaction0
Three Branches: Detecting Actions With Richer Features0
Three-Stream Fusion Network for First-Person Interaction Recognition0
Three-stream network for enriched Action Recognition0
Time Series Segmentation through Automatic Feature Learning0
Show:102550
← PrevPage 27 of 53Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Structured Keypoint PoolingAccuracy93.4Unverified
2Semi-Supervised Hard Attention (SSHA); pretrained on Deepmind Kinetics datasetAccuracy90.4Unverified
3Human Skeletons + Change DetectionAccuracy90.25Unverified
4Separable Convolutional LSTMAccuracy89.75Unverified
5SPIL ConvolutionAccuracy89.3Unverified
6Flow Gated NetworkAccuracy87.25Unverified
#ModelMetricClaimedVerifiedStatus
1FocusCLIPTop-3 Accuracy (%)10.47Unverified
2CLIPTop-3 Accuracy (%)6.49Unverified
#ModelMetricClaimedVerifiedStatus
1Boutaleb et al.1:1 Accuracy97.91Unverified
#ModelMetricClaimedVerifiedStatus
1all-landmark-modelActivity Recognition0.76Unverified