Complex Verbs are Different: Exploring the Visual Modality in Multi-Modal Models to Predict Compositionality
2017-04-01WS 2017Unverified0· sign in to hype
Maximilian K{\"o}per, Sabine Schulte im Walde
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
This paper compares a neural network DSM relying on textual co-occurrences with a multi-modal model integrating visual information. We focus on nominal vs. verbal compounds, and zoom into lexical, empirical and perceptual target properties to explore the contribution of the visual modality. Our experiments show that (i) visual features contribute differently for verbs than for nouns, and (ii) images complement textual information, if (a) the textual modality by itself is poor and appropriate image subsets are used, or (b) the textual modality by itself is rich and large (potentially noisy) images are added.