Small, non-stationary, and hidden objects are often underrepresented in large-scale scene databases

Tue—Casino_1.801—Poster2—5302

Presented by: Feron Y. Basoeki

Feron Y. Basoeki ¹^*, Sandro L. Wiesmann ¹, Melissa L.-H. Võ ^{1, 2}

¹ Goethe University Frankfurt, Department of Psychology, Scene Grammar Lab, Germany, ² Neuro-Cognitive Psychology, Department of Psychology, LMU Munich

Objects play a crucial role in our ability to categorize scenes. Two object properties are particularly important for predicting scene categorization accuracy: Object frequency and specificity for the given scene category. These object-occurrence statistics are often estimated from large-scale databases of labelled scene images. However, database measures do not always correlate with human ratings of specificity and frequency, opening up the question which of the two – databases or human ratings – better represent object-occurrence statistics in the real world. Here we therefore compared object frequency and specificity estimates gathered from human ratings (6-point Likert scale), scene image databases versus real-world environments (i.e., accurate object counts). Participants (N = 91) were asked to report the presence and number of 79 pre-selected objects in different rooms of their homes and rate their visibility and usage frequency. Intriguingly, while specificity measures were relatively consistent across participants’ self-reports and scene databases, we found significant differences between self-reports and databases with regard to object frequency. Especially smaller, non-stationary objects such as consumable goods (e.g., shampoo) and objects that are usually “hidden” or stored away (e.g., dishes or detergent) were underrepresented in scene image databases. Nonetheless, averaged human-rated object-occurrence statistics still outperformed both real-world and database statistics in predicting scene categorization accuracy, suggesting that scene categorization is based on prototypical object representations rather than real-world occurrence statistics. We therefore suggest considering the suitability of each source of object-occurrence statistics depending on the researchers’ goal and argue for more realistic and representative real-world scene image databases.

Keywords: scene categorization, object-occurrence, scene databases, object frequency