SOTAVerified

Suggesting Sentences for ESL using Kernel Embeddings

2017-12-01WS 2017Unverified0· sign in to hype

Kent Shioda, Mamoru Komachi, Rue Ikeya, Daichi Mochihashi

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Sentence retrieval is an important NLP application for English as a Second Language (ESL) learners. ESL learners are familiar with web search engines, but generic web search results may not be adequate for composing documents in a specific domain. However, if we build our own search system specialized to a domain, it may be subject to the data sparseness problem. Recently proposed word2vec partially addresses the data sparseness problem, but fails to extract sentences relevant to queries owing to the modeling of the latent intent of the query. Thus, we propose a method of retrieving example sentences using kernel embeddings and N-gram windows. This method implicitly models latent intent of query and sentences, and alleviates the problem of noisy alignment. Our results show that our method achieved higher precision in sentence retrieval for ESL in the domain of a university press release corpus, as compared to a previous unsupervised method used for a semantic textual similarity task.

Tasks

Reproductions