Segmentation of historical lanna handwritten manuscripts

146

Views

0

Downloads

Pravesjit, Sakkayaphop and Thammano, Arit (2012) Segmentation of historical lanna handwritten manuscripts In: 2012 6th IEEE International Conference Intelligent Systems (IS), 2012-09-06, Sofia, Bulgaria.

Abstract

Lanna script is an archaic script not commonly used in today's world. People trying to read these archaic Lanna manuscripts have to find some form of translation help to understand what they said. Unfortunately, few people nowadays know how to read or write this language. Therefore, character recognition system must be put to use in order to translate the Lanna script to the commonly used script. The poor condition of the manuscripts and the writing style of the script make this problem very difficult to solve. The most difficult cases of the writing style problem are the touching and overlapping characters. Therefore, the first two stages of the character recognition process, which are image preprocessing and segmentation, need to be closely watched over so that the recognition accuracy is high. In this paper, two new techniques are proposed. The first proposed technique emphasizes on converting a grayscale image to a binary image. In this proposed technique, the concepts of the multithresholding method and Otsu's method are combined together. The second proposed technique emphasizes on the process of touching character segmentation. In doing this, the bounding box analysis is initially employed to segment the document image into images of isolated characters and images of touching characters. The thinning algorithm is applied to extract the skeleton of the touching characters. Next, by using the junction points as the separation points, the skeleton of the touching characters is separated into several pieces. Finally, the separated pieces of the touching characters are put back to reconstruct two isolated characters. The proposed algorithm achieves an accuracy of 86.67%.

Item Type:

Conference or Workshop Item (Paper)

Identification Number (DOI):

Deposited by:

ระบบ อัตโนมัติ

Date Deposited:

2021-09-09 23:53:46

Last Modified:

2021-10-06 09:47:20

Impact and Interest:

Statistics