SOTAVerified

You were saying? - Spoken Language in the V3C Dataset

2022-12-15Code Available0· sign in to hype

Luca Rossetto

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

This paper presents an analysis of the distribution of spoken language in the V3C video retrieval benchmark dataset based on automatically generated transcripts. It finds that a large portion of the dataset is covered by spoken language. Since language transcripts can be quickly and accurately described, this has implications for retrieval tasks such as known-item search.

Tasks

Reproductions