FastThaiCaps: A Transformer based Capsule Network for Hate Speech Detection in Thai Language

50

Views

0

Downloads

Maity, Krishanu, Bhattacharya, Shaubhik, Saha, Sriparna, Janoai, Suwika and Pasupa, Kitsuchart (2022) FastThaiCaps: A Transformer based Capsule Network for Hate Speech Detection in Thai Language In: International Conference on Neural Information Processing (ICONIP 2022) Lecture Notes in Computer Science, 13624 Springer Nature, 425-42537.

Abstract

The advent of technology has led to people sharing their views openly like never before. Parallelly, cyberbullying and hate speech content have also increased as a side effect that is potentially hazardous to society. While plenty of research is going on to detect online hate speech in English, there is very little research on the Thai language. To investigate how noisy Thai posts can be handled effectively, in this work, we have developed a two-channel deep learning model FastThaiCaps based on BERT and FastText embedding along with a capsule network. The input to one channel is the BERT language model, and that to the other is the pre-trained FastText embedding. Our model has been evaluated on a benchmark Thai dataset categorized into four categories, i.e., peace speech, neutral speech, level-1 hate speech, and level-2 hate speech. Experiments show that FastThaiCaps outperforms state-of-the-art methods by up to 3.11% in terms F1 score.

Item Type:

Book Section

Identification Number (DOI):

Subjects:

Subjects > Computer Science > Artificial Intelligence

Subjects > Computer Science > Computation and Language (Computational Linguistics and Natural Language and Speech Processing)

Subjects > Computer Science > Machine Learning

Deposited by:

Kitsuchart Pasupa

Date Deposited:

2023-06-19 22:50:24

Last Modified:

2023-07-24 22:17:50

Impact and Interest:

Statistics