EHNet: Efficient Hybrid Network with Dual Attention for Image Deblurring

Quoc-Thien Ho; Minh-Thien Duong; Seongsoo Lee; Min-Cheol Hong

doi:10.3390/s24206545

EHNet: Efficient Hybrid Network with Dual Attention for Image Deblurring

Sensors (Basel). 2024 Oct 10;24(20):6545. doi: 10.3390/s24206545.

Authors

Quoc-Thien Ho¹, Minh-Thien Duong², Seongsoo Lee³, Min-Cheol Hong⁴

Affiliations

¹ Department of Information and Telecommunication Engineering, Soongsil University, Seoul 06978, Republic of Korea.
² Department of Automatic Control, Ho Chi Minh City University of Technology and Education, Ho Chi Minh City 70000, Vietnam.
³ Department of Intelligent Semiconductor, Soongsil University, Seoul 06978, Republic of Korea.
⁴ School of Electronic Engineering, Soongsil University, Seoul 06978, Republic of Korea.

Abstract

The motion of an object or camera platform makes the acquired image blurred. This degradation is a major reason to obtain a poor-quality image from an imaging sensor. Therefore, developing an efficient deep-learning-based image processing method to remove the blur artifact is desirable. Deep learning has recently demonstrated significant efficacy in image deblurring, primarily through convolutional neural networks (CNNs) and Transformers. However, the limited receptive fields of CNNs restrict their ability to capture long-range structural dependencies. In contrast, Transformers excel at modeling these dependencies, but they are computationally expensive for high-resolution inputs and lack the appropriate inductive bias. To overcome these challenges, we propose an Efficient Hybrid Network (EHNet) that employs CNN encoders for local feature extraction and Transformer decoders with a dual-attention module to capture spatial and channel-wise dependencies. This synergy facilitates the acquisition of rich contextual information for high-quality image deblurring. Additionally, we introduce the Simple Feature-Embedding Module (SFEM) to replace the pointwise and depthwise convolutions to generate simplified embedding features in the self-attention mechanism. This innovation substantially reduces computational complexity and memory usage while maintaining overall performance. Finally, through comprehensive experiments, our compact model yields promising quantitative and qualitative results for image deblurring on various benchmark datasets.

Keywords: Transformer; convolution neural networks; dual attention module; hybrid architecture; image deblurring; motion blur.

Abstract

Grants and funding