Reconstructing the house from the ad: Structured prediction on real estate classifieds
Giannis Bekoulis, Johannes Deleu, Thomas Demeester, Chris Develder
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/bekou/ad_dataOfficialIn papernone★ 0
Abstract
In this paper, we address the (to the best of our knowledge) new problem of extracting a structured description of real estate properties from their natural language descriptions in classifieds. We survey and present several models to (a) identify important entities of a property (e.g.,rooms) from classifieds and (b) structure them into a tree format, with the entities as nodes and edges representing a part-of relation. Experiments show that a graph-based system deriving the tree from an initially fully connected entity graph, outperforms a transition-based system starting from only the entity nodes, since it better reconstructs the tree.