A Comparison of Bottom-Up Models for Spatial Saliency Predictions in Autonomous Driving
by ,
Abstract:
Bottom-up saliency models identify the salient regions of an image based on features such as color, intensity and orientation. These models are typically used as predictors of human visual behavior and for computer vision tasks. In this paper, we conduct a systematic evaluation of the saliency maps computed with four selected bottom-up models on images of urban and highway traffic scenes. Saliency both over whole images and on object level is investigated and elaborated in terms of the energy and the entropy of the saliency maps. We identify significant differences with respect to the amount, size and shape-complexity of the salient areas computed by different models. Based on these findings, we analyze the likelihood that object instances fall within the salient areas of an image and investigate the agreement between the segments of traffic participants and the saliency maps of the different models. The overall and object-level analysis provides insights on the distinctive features of salient areas identified by different models, which can be used as selection criteria for prospective applications in autonomous driving such as object detection and tracking.
Reference:
A Comparison of Bottom-Up Models for Spatial Saliency Predictions in Autonomous Driving (Jaime Maldonado, Lino Antoni Giefer), In Sensors, volume 21, 2021.
Bibtex Entry:
@Article{Maldonado_etal_Sensors_2021,
author = {Maldonado, Jaime and Giefer, Lino Antoni},
title = {A Comparison of Bottom-Up Models for Spatial Saliency Predictions in Autonomous Driving},
journal = {Sensors},
volume = {21},
year = {2021},
number = {20},
article-number = {6825},
url = {https://www.mdpi.com/1424-8220/21/20/6825},
issn = {1424-8220},
abstract = {Bottom-up saliency models identify the salient regions of an image based on features such as color, intensity and orientation. These models are typically used as predictors of human visual behavior and for computer vision tasks. In this paper, we conduct a systematic evaluation of the saliency maps computed with four selected bottom-up models on images of urban and highway traffic scenes. Saliency both over whole images and on object level is investigated and elaborated in terms of the energy and the entropy of the saliency maps. We identify significant differences with respect to the amount, size and shape-complexity of the salient areas computed by different models. Based on these findings, we analyze the likelihood that object instances fall within the salient areas of an image and investigate the agreement between the segments of traffic participants and the saliency maps of the different models. The overall and object-level analysis provides insights on the distinctive features of salient areas identified by different models, which can be used as selection criteria for prospective applications in autonomous driving such as object detection and tracking.},
DOI = {10.3390/s21206825},
keywords = {proreta}
}