A Modified Binary Flower Pollination Algorithm: A Fast and Effective Combination of Feature Selection Techniques for SNP Classification

342

Views

0

Downloads

Rathasamuth, Wanthanee and Pasupa, Kitsuchart (2019) A Modified Binary Flower Pollination Algorithm: A Fast and Effective Combination of Feature Selection Techniques for SNP Classification In: 2019 11th International Conference on Information Technology and Electrical Engineering (ICITEE), 2019-10-10, Pattaya, Thailand.

Abstract

Single nucleotide polymorphism (SNP) is a genetic trait responsible for the differences in the characteristics of individuals of a living species. Machine learning has been brought in to classify swine breed according to their SNPs. However, since the number of samples (number of pigs sampled) is usually much smaller than the number of features (SNPs) to classify, there may occur an overfitting problem. Therefore, some feature selection techniques were applied to the entire SNPs to reduce them to a much smaller number of most significant SNPs to be used in the classification. In this study, we used information gain in combination with binary flower pollination algorithm for feature selection as well as a cut-off-point-finding threshold for specifying a 0 or 1 value for a position in the solution vector and a GA bit-flip mutation operator. We called it Modified-BFPA. The classifier was SVM. Evaluated against a few other feature selection techniques, our combination of techniques was, at the very least, competitive to those. It selected only 1.76 % of most significant SNPs from the entire set of 10,210 SNPs. The SNPs that it selected provided 95.12 % classification accuracy. Moreover, it was fast: an average of 1.60 iterations in combination with SVM to find a set of best SNPs that provided the highest classification accuracy.

Item Type:

Conference or Workshop Item (Paper)

Identification Number (DOI):

Deposited by:

ระบบ อัตโนมัติ

Date Deposited:

2021-09-09 23:53:43

Last Modified:

2021-09-20 02:15:39

Impact and Interest:

Statistics