Mimicking visual searching with integrated top down cues and low-level features

Xu, Jiawei and Yue, Shigang (2014) Mimicking visual searching with integrated top down cues and low-level features. Neurocomputing, 133 . pp. 1-17. ISSN 0925-2312


Request a copy
[img] PDF
1-s2.0-S0925231214001131-main.pdf - Whole Document
Restricted to Repository staff only

Item Type:Article
Item Status:Live Archive


Visual searching is a perception task involved with visual attention, attention shift and active scan of the visual environment for a particular object or feature. The key idea of our paper is to mimic the human visual searching under the static and dynamic scenes. To build up an artificial vision system that performs the visual searching could be helpful to medical and psychological application development to human machine interaction. Recent state-of-the-art researches focus on the bottom-up and top-down saliency maps. Saliency maps indicate that the saliency likelihood of each pixel, however, understanding the visual searching process can help an artificial vision system exam details in a way similar to human and they will be good for future robots or machine vision systems which is a deeper digest than the saliency map. This paper proposed a computational model trying to mimic human visual searching process and we emphasis the motion cues on the visual processing and searching. Our model analysis the attention shifts by fusing the top-down bias and bottom-up cues. This model also takes account the motion factor into the visual searching processing. The proposed model involves five modules: the pre-learning process; top-down biasing; bottom-up mechanism; multi-layer neural network and attention shifts. Experiment evaluation results via benchmark databases and real-time video showed the model demonstrated high robustness and real-time ability under complex dynamic scenes.

Keywords:Visual searching, Learning bases, Top-down cues, Motion sensitive neurons, Multi-layer neural network, NotOAChecked
Subjects:G Mathematical and Computer Sciences > G740 Computer Vision
Divisions:College of Science > School of Computer Science
Related URLs:
ID Code:13453
Deposited On:04 Mar 2014 12:58

Repository Staff Only: item control page