You were saying? - Spoken Language in the V3C Dataset
2022-12-15Code Available0· sign in to hype
Luca Rossetto
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/lucaro/v3c-language-analysisOfficialIn papernone★ 0
Abstract
This paper presents an analysis of the distribution of spoken language in the V3C video retrieval benchmark dataset based on automatically generated transcripts. It finds that a large portion of the dataset is covered by spoken language. Since language transcripts can be quickly and accurately described, this has implications for retrieval tasks such as known-item search.