Deep gaze pooling: Inferring and visually decoding search intents from human gaze fixations
journal contribution
posted on 2023-11-29, 18:06authored byHosnieh Sattara, Mario FritzMario Fritz, Andreas Bulling
Predicting the target of visual search from human eye fixations (gaze) is a difficult problem with many applications, e.g. in human-computer interaction. While previous work has focused on predicting specific search target instances, we propose the first approach to predict categories and attributes of search intents from gaze data and to visually reconstruct plausible targets. However, state-of-the-art models for categorical recognition, in general, require large amounts of training data, which is prohibitive for gaze data. To address this challenge, we further propose a novel Gaze Pooling Layer that combines gaze information with visual representations from Deep Learning approaches. Our scheme incorporates both spatial and temporal aspects of human gaze behavior as well as the appearance of the fixated locations. We propose an experimental setup and novel dataset and demonstrate the effectiveness of our method for gaze-based search target prediction and reconstruction. We highlight several practical advantages of our approach, such as compatibility with existing architectures, no need for gaze training data, and robustness to noise from common gaze sources.
History
Preferred Citation
Hosnieh Sattara, Mario Fritz and Andreas Bulling. Deep gaze pooling: Inferring and visually decoding search intents from human gaze fixations. In: Neurocomputing. 2020.
Primary Research Area
Algorithmic Foundations and Cryptography
Legacy Posted Date
2020-10-15
Journal
Neurocomputing
Pages
369 - 382
Open Access Type
Unknown
Sub Type
Article
BibTeX
@article{cispa_all_3249,
title = "Deep gaze pooling: Inferring and visually decoding search intents from human gaze fixations",
author = "Sattara, Hosnieh and Fritz, Mario and Bulling, Andreas",
journal="{Neurocomputing}",
year="2020",
}