An Effortless Way To Create Large-Scale Datasets For Famous Speakers

2014-05-01LREC 2014Unverified0· sign in to hype

Fran{\c{c}}ois Salmon, F{\'e}licien Vallet

Unverified — Be the first to reproduce this paper.

Abstract

The creation of large-scale multimedia datasets has become a scientific matter in itself. Indeed, the fully-manual annotation of hundreds or thousands of hours of video and/or audio turns out to be practically infeasible. In this paper, we propose an extremly handy approach to automatically construct a database of famous speakers from TV broadcast news material. We then run a user experiment with a correctly designed tool that demonstrates that very reliable results can be obtained with this method. In particular, a thorough error analysis demonstrates the value of the approach and provides hints for the improvement of the quality of the dataset.

Tasks

Person Identification Speaker Diarization Speaker Recognition

An Effortless Way To Create Large-Scale Datasets For Famous Speakers

Abstract

Tasks

Reproductions