Hierarchical Sketch Induction for Paraphrase Generation
Tom Hosking, Hao Tang, Mirella Lapata
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/tomhosking/hrq-vaeOfficialIn paperpytorch★ 51
Abstract
We propose a generative model of paraphrase generation, that encourages syntactic diversity by conditioning on an explicit syntactic sketch. We introduce Hierarchical Refinement Quantized Variational Autoencoders (HRQ-VAE), a method for learning decompositions of dense encodings as a sequence of discrete latent variables that make iterative refinements of increasing granularity. This hierarchy of codes is learned through end-to-end training, and represents fine-to-coarse grained information about the input. We use HRQ-VAE to encode the syntactic form of an input sentence as a path through the hierarchy, allowing us to more easily predict syntactic sketches at test time. Extensive experiments, including a human evaluation, confirm that HRQ-VAE learns a hierarchical representation of the input space, and generates paraphrases of higher quality than previous systems.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| MSCOCO | HRQ-VAE | BLEU | 27.9 | — | Unverified |
| Paralex | HRQ-VAE | iBLEU | 24.93 | — | Unverified |
| Quora Question Pairs | HRQ-VAE | iBLEU | 18.42 | — | Unverified |