An Evaluation of Image-Based Verb Prediction Models against Human Eye-Tracking Data

2018-06-01NAACL 2018Unverified0· sign in to hype

Sp Gella, ana, Frank Keller

Unverified — Be the first to reproduce this paper.

Abstract

Recent research in language and vision has developed models for predicting and disambiguating verbs from images. Here, we ask whether the predictions made by such models correspond to human intuitions about visual verbs. We show that the image regions a verb prediction model identifies as salient for a given verb correlate with the regions fixated by human observers performing a verb classification task.

Tasks

General Classification Question Answering Visual Question Answering (VQA)Word Sense Disambiguation

An Evaluation of Image-Based Verb Prediction Models against Human Eye-Tracking Data

Abstract

Tasks

Reproductions