认知规律启发的显著性物体检测方法与评测

范登平著

《认知规律启发的显著性物体检测方法与评测》

作者：范登平著

单位：南开大学

论文名称	认知规律启发的显著性物体检测方法与评测
作者	范登平著
学科	软件工程. 计算机视觉
学位授予单位	南开大学
导师	刘晓光，程明明指导
出版年份	2019
中文摘要	显著性物体检测技术起源于认知学中人类的视觉注意行为，即人类视觉系统能够快速地将注意力转移到视觉场景中最具信息量的区域而有选择性地忽略其它区域。该技术在现实生活中有着广泛的应用基础，如，自动驾驶、人机互动、视频分割、视频字幕、视频压缩等。除了其学术价值和实际意义之外，由于图像和视频数据(遮挡、模糊、运动模式等)自身的挑战以及人类在动态场景中注意行为(选择性注意分配和注意转移)固有的复杂性，使得显著性物体检测技术面临着巨大挑战。受制于采集设备，早期构建的显著性物体检测数据集表达真实场景的能力非常有限。同时，这一领域的评价指标也是基于像素级误差的，完全忽略了人类认知规律的特性。上述问题，严重制约了显著性物体检测技术的发展。本文围绕图像视频显著性物体检测，研究了基于人类认知规律的数据集建立、模型建模、评价指标三个方向的问题。主要创新点包括： 1.针对现有图像显著性物体检测公开测试存在的各种偏差问题，构建了一个富上下文环境下的图像显著性物体检测数据集SOC，并首次从属性层面对现有方法进行了大量评测和深入的分析。 2.针对视频显著性物体检测中注意力转移的问题，构建了第一个高质量、稠密标注的视频显著性物体检测DAVSOD数据集；提出了基于注意力转移的SSAV模型，取得了国际领先的检测性能；提供了当前最大规模、最完整的视频显著性物体评测结果。 3.针对非二进制显著性物体检测质量评价的问题，提出了符合人类认知规律的度量指标S-measure，使得评价方法从像素-级过度到结构-级，特别是与人的主观评价一致性性能从低于50％提升到了77％。 4.针对二进制显著性物体检测质量评价的问题，提出了符合人类认知规律的度量指标E-measure，使得评价方法在一个紧凑项中同时考虑了全局和局部信息，上述方法相比国际最先进算法的性能提高了19％。关键词：显著性物体检测，评价指标，数据集，视频显著性，图像显著性
英文摘要	Salient object detection (SOD) originates from the cognitive studies of human visual attention behavior, i.e., the astonishing ability of the human visual system to quickly orient attention to the most informative parts of visual scenes and ignore the other parts. SOD is thus significantly instrumental to a wide range of real-world applications, e.g., autonomous driving, robotic interaction, video segmentation, video captioning, video compression. Besides its academic value and practical significance, SOD presents great difficulties due to the challenges carried by video data (e.g., occlusions, blur, large object-deformations, diverse motion patterns) and the inherent complexity of human visual attention behavior (i.e., selective attention allocation, attention shift) during dynamic scenes. Subject to the limitation of acquisition device, the early build salient object detection datasets do not represent the real scene well. Moreover, the evaluation metrics in this field ignore the properties of the human visual system and are all based on pixel-level error. The above problems have seriously restricted the development of salient object detection technology. This dissertation based on the cognitive theory and focuses on image and video salient object detection, the research directions including the collection of the dataset, the creation of the models and the design of evaluation metrics. The major contributions of the dissertation are: 1.My analysis points out various of serious data bias in existing SOD datasets. I built a new SOD dataset, called SOC which contains diversity context in the realistic environment. Then, a set of attributes (e.g., Appearance Change) is proposed in the attempt to obtain a deeper insight into the SOD problem. I also present the currently largest scale performance evaluation of CNNs based SOD models. 2.To further advance the research of the saliency-shift issue, I elaborately collected a high-quality Densely Annotated Video Salient Object Detection (DAVSOD) dataset. The proposed SSAV model performs better against other top competitors over the five large scale datasets. To further contribute the community a complete and the largest-scale benchmark, I systematically assess several representative video salient object detection algorithms. 3.To address the evaluation problem of the non-binary map, I propose a structure similarity based SOD measure, called S-measure. Rather than based on pixelwise error, the new measure based on structural similarity. Especially, the performance of human consistency has improved from 23% to 77%. 4.I propose a novel and effective Enhanced-alignment measure (E-measure) for binary salient object detection map. The motivation from the cognitive vision studies which have shown that human vision is highly sensitive to both global information and local details in scenes. Thus, the new measure achieve the largest improvement of 19% compared with other popular measures in terms of specific meta-measures. Key Words: salient object detection (SOD), evaluation metric, dataset, video saliency, image saliency

鸥维数据云查询平台

联系我们

电话：400-139-8015
微信：vbeiyou
邮箱：ovo@qudong.com
总部：北京市海淀区学院路30号科群大厦西楼5层

主要城市

主要县市区

主要国家

导航

Copyright © 西北大学西部大数据研究院旗下“鸥维数据” 京ICP备17065155号-6

客服回顶部