Spatial-Frequency Redistribution-based Saliency Region Detection for Thai Text Localisation

165

Views

0

Downloads

Titijaroonroj, Taravichet, Suttapakti, Ungsumalee and Nunsong, Walairach (2022) Spatial-Frequency Redistribution-based Saliency Region Detection for Thai Text Localisation ECTI Transactions on Computer and Information Technology, 16 (1)., 109-10924. ISSN 22869131

Abstract

Saliency region detection plays an important role in computer vision applications and related areas, such as human fixation, figure-ground separation, face detection, and image compression. Nevertheless, state-of-the-art methods can moderately detect the saliency regions of a psychological pattern dataset. Therefore, this paper proposes a spatial-frequency redistribution (SFR) method to improve the efficiency of the detection of the saliency regions. The proposed SFR method consists of two main procedures: (i) adaptive cosine image construction-and-implantation and (ii) saliency region detection using complete multiple-object-implanted images. The former procedure constructs an adaptive cosine image by using a cosine function based on an object structure and then individually implants it into each object detectable. This stage provides the first significant property, spatial redistribution, to the object implanted. The adaptive cosine image redistributes the objects for controllability. Then, the latter procedure transforms a complete multiple object-implanted image into the frequency domain. At this point, the second significant property, frequency redistribution, provides the simple technique for identifying and separating the target object from the unwanted objects and background. In this paper, these properties are referred to as spatial-frequency redistribution. This method was realized as a computer program, and then such program was tested with a psychological pattern dataset and a Thai text dataset. The experimental results, when compared with state-of-the-art methods, show that the proposed SFR method can achieve the clear detection of the attention region in the psychological pattern dataset. Moreover, the proposed method can achieve a F-value of 80.28% with a recall and precision of 76.32% and 85.23%, respectively, for Thai text localisation in the Thai text dataset.

Item Type:

Article

Subjects:

Subjects > Computer Science > Computer Vision and Pattern Recognition

Deposited by:

Taravichet Titijaroonroj

Date Deposited:

2022-06-30 15:41:28

Last Modified:

2022-08-20 05:45:14

Impact and Interest:

Statistics