Neural Variational Inference for Text Processing

2015-11-19Code Available0· sign in to hype

Yishu Miao, Lei Yu, Phil Blunsom

Code Available — Be the first to reproduce this paper.

Code

github.com/ysmiao/nvdm
tf★ 0
github.com/shining-spring/nvlda
tf★ 0
github.com/carpedm20/variational-text-tensorflow
tf★ 0
github.com/jainshobhit/Variational-Autoencoder
tf★ 0
github.com/YongfeiYan/Neural-Document-Modeling
pytorch★ 0
github.com/jiacheng-xu/vmf_vae_nlp
pytorch★ 0

Abstract

Recent advances in neural variational inference have spawned a renaissance in deep latent variable models. In this paper we introduce a generic variational inference framework for generative and conditional models of text. While traditional variational methods derive an analytic approximation for the intractable distributions over latent variables, here we construct an inference network conditioned on the discrete text input to provide the variational distribution. We validate this framework on two very different text modelling applications, generative document modelling and supervised question answering. Our neural variational document model combines a continuous stochastic document representation with a bag-of-words generative model and achieves the lowest reported perplexities on two standard test corpora. The neural answer selection model employs a stochastic representation layer within an attention mechanism to extract the semantics between a question and answer pair. On two question answering benchmarks this model exceeds all previous published benchmarks.

Tasks

Answer Selection Question Answering Topic Models Variational Inference

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
QASent	Attentive LSTM	MAP	0.73	—	Unverified
QASent	LSTM (lexical overlap + dist output)	MAP	0.72	—	Unverified
QASent	LSTM	MAP	0.64	—	Unverified
WikiQA	Attentive LSTM	MAP	0.69	—	Unverified
WikiQA	LSTM (lexical overlap + dist output)	MAP	0.68	—	Unverified
WikiQA	LSTM	MAP	0.66	—	Unverified

Neural Variational Inference for Text Processing

Code

Abstract

Tasks

Benchmark Results

Reproductions