In the field of computer vision and image recognition, enabling the computer to discern target features while filtering out irrelevant ones poses a challenge. Drawing insights from studies in biological vision, we find that there is a local visual acuity mechanism and a visual focus mechanism in the initial conversion and processing of visual information, ensuring that the visual system can give ear to salient features of the target in the early visual observation phase. Inspired by this, we build a novel image recognition network to focus on the target features while ignoring other irrelevant features in the image, and further focus on the focus features based on the target features. Meanwhile, in order to comply with the output characteristics when similar features exist in different categories, we design a softer image label operation for similar features in different categories, which solves the correlation of labels between categories. Relevant experimental findings underscore the efficacy of our proposed method, revealing discernible advantages in comparison to alternative approaches. Visualization results further attest to the method's capability to selectively focus on pertinent target features within the image, sidelining extraneous information.
Keywords: Bio-inspired intelligence; Deep-learning; Image recognition; Visual focus mechanism; Visual local acuity mechanism.
Copyright © 2024 Elsevier Ltd. All rights reserved.