BUCEADOR, a multi-language search engine for digital libraries
Jordi Adell, Antonio Bonafonte, Antonio Cardenal, Marta R. Costa-juss{\`a}, Jos{\'e} A. R. Fonollosa, Asunci{\'o}n Moreno, Eva Navas, Eduardo R. Banga
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
This paper presents a web-based multimedia search engine built within the Buceador (www.buceador.org) research project. A proof-of-concept tool has been implemented which is able to retrieve information from a digital library made of multimedia documents in the 4 official languages in Spain (Spanish, Basque, Catalan and Galician). The retrieved documents are presented in the user language after translation and dubbing (the four previous languages + English). The paper presents the tool functionality, the architecture, the digital library and provide some information about the technology involved in the fields of automatic speech recognition, statistical machine translation, text-to-speech synthesis and information retrieval. Each technology has been adapted to the purposes of the presented tool as well as to interact with the rest of the technologies involved.