Every Shot Counts: Using Exemplars for Repetition Counting in Videos

2024-03-26Code Available1· sign in to hype

Saptarshi Sinha, Alexandros Stergiou, Dima Damen

Code Available — Be the first to reproduce this paper.

Code

github.com/sinhasaptarshi/EveryShotCounts
OfficialIn paperpytorch★ 29

Abstract

Video repetition counting infers the number of repetitions of recurring actions or motion within a video. We propose an exemplar-based approach that discovers visual correspondence of video exemplars across repetitions within target videos. Our proposed Every Shot Counts (ESCounts) model is an attention-based encoder-decoder that encodes videos of varying lengths alongside exemplars from the same and different videos. In training, ESCounts regresses locations of high correspondence to the exemplars within the video. In tandem, our method learns a latent that encodes representations of general repetitive motions, which we use for exemplar-free, zero-shot inference. Extensive experiments over commonly used datasets (RepCount, Countix, and UCFRep) showcase ESCounts obtaining state-of-the-art performance across all three datasets. Detailed ablations further demonstrate the effectiveness of our method.

Tasks

Decoder Exemplar-Free Repetitive Action Counting

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Countix	ESCounts	OBO	0.67	—	Unverified
RepCount	ESCounts	OBO	0.56	—	Unverified
UCFRep	ESCounts	MAE	0.22	—	Unverified

Every Shot Counts: Using Exemplars for Repetition Counting in Videos

Code

Abstract

Tasks

Benchmark Results

Reproductions