Neural Variational Inference for Text Processing
Yishu Miao, Lei Yu, Phil Blunsom
Code Available — Be the first to reproduce this paper.
ReproduceCode
Abstract
Recent advances in neural variational inference have spawned a renaissance in deep latent variable models. In this paper we introduce a generic variational inference framework for generative and conditional models of text. While traditional variational methods derive an analytic approximation for the intractable distributions over latent variables, here we construct an inference network conditioned on the discrete text input to provide the variational distribution. We validate this framework on two very different text modelling applications, generative document modelling and supervised question answering. Our neural variational document model combines a continuous stochastic document representation with a bag-of-words generative model and achieves the lowest reported perplexities on two standard test corpora. The neural answer selection model employs a stochastic representation layer within an attention mechanism to extract the semantics between a question and answer pair. On two question answering benchmarks this model exceeds all previous published benchmarks.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| QASent | Attentive LSTM | MAP | 0.73 | — | Unverified |
| QASent | LSTM (lexical overlap + dist output) | MAP | 0.72 | — | Unverified |
| QASent | LSTM | MAP | 0.64 | — | Unverified |
| WikiQA | Attentive LSTM | MAP | 0.69 | — | Unverified |
| WikiQA | LSTM (lexical overlap + dist output) | MAP | 0.68 | — | Unverified |
| WikiQA | LSTM | MAP | 0.66 | — | Unverified |