TY - JOUR
T1 - Proto-Object Based Saliency Model With Texture Detection Channel
AU - Uejima, Takeshi
AU - Niebur, Ernst
AU - Etienne-Cummings, Ralph
N1 - Funding Information:
We would like to thank Rüdiger von der Heydt for sharing his deep insights on neuroscience and vision with us. We also thank the National Science Foundation and the National Institutes of Health which support this work through the CRCNS mechanism. TU's Ph.D. training was supported by the Japanese Acquisition, Technology & Logistic Agency, Government of Japan.
Publisher Copyright:
© Copyright © 2020 Uejima, Niebur and Etienne-Cummings.
PY - 2020/9/24
Y1 - 2020/9/24
N2 - The amount of visual information projected from the retina to the brain exceeds the information processing capacity of the latter. Attention, therefore, functions as a filter to highlight important information at multiple stages of the visual pathway that requires further and more detailed analysis. Among other functions, this determines where to fixate since only the fovea allows for high resolution imaging. Visual saliency modeling, i.e. understanding how the brain selects important information to analyze further and to determine where to fixate next, is an important research topic in computational neuroscience and computer vision. Most existing bottom-up saliency models use low-level features such as intensity and color, while some models employ high-level features, like faces. However, little consideration has been given to mid-level features, such as texture, for visual saliency models. In this paper, we extend a biologically plausible proto-object based saliency model by adding simple texture channels which employ nonlinear operations that mimic the processing performed by primate visual cortex. The extended model shows statistically significant improved performance in predicting human fixations compared to the previous model. Comparing the performance of our model with others on publicly available benchmarking datasets, we find that our biologically plausible model matches the performance of other models, even though those were designed entirely for maximal performance with little regard to biological realism.
AB - The amount of visual information projected from the retina to the brain exceeds the information processing capacity of the latter. Attention, therefore, functions as a filter to highlight important information at multiple stages of the visual pathway that requires further and more detailed analysis. Among other functions, this determines where to fixate since only the fovea allows for high resolution imaging. Visual saliency modeling, i.e. understanding how the brain selects important information to analyze further and to determine where to fixate next, is an important research topic in computational neuroscience and computer vision. Most existing bottom-up saliency models use low-level features such as intensity and color, while some models employ high-level features, like faces. However, little consideration has been given to mid-level features, such as texture, for visual saliency models. In this paper, we extend a biologically plausible proto-object based saliency model by adding simple texture channels which employ nonlinear operations that mimic the processing performed by primate visual cortex. The extended model shows statistically significant improved performance in predicting human fixations compared to the previous model. Comparing the performance of our model with others on publicly available benchmarking datasets, we find that our biologically plausible model matches the performance of other models, even though those were designed entirely for maximal performance with little regard to biological realism.
KW - image texture analysis
KW - neuromorphic engineering
KW - proto-object
KW - saliency
KW - visual attention
UR - http://www.scopus.com/inward/record.url?scp=85092187848&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85092187848&partnerID=8YFLogxK
U2 - 10.3389/fncom.2020.541581
DO - 10.3389/fncom.2020.541581
M3 - Article
C2 - 33071766
AN - SCOPUS:85092187848
SN - 1662-5188
VL - 14
JO - Frontiers in Computational Neuroscience
JF - Frontiers in Computational Neuroscience
M1 - 541581
ER -